Morning Overview

Anthropic opened its Project Glasswing to AWS, Apple, Cisco, Google and JPMorgan, handing them an AI built to hunt and fix critical software flaws

Anthropic has opened early access to Project Glasswing, its AI system designed to detect and patch critical software vulnerabilities, to five major organizations: AWS, Apple, Cisco, Google, and JPMorgan. The decision places a powerful defensive tool in the hands of companies that collectively manage infrastructure touching billions of users, at a moment when autonomous AI agents are proving they can turn known security flaws into working exploits. The move raises a sharp question: whether concentrating advanced defensive AI among a handful of well-resourced firms will shrink the global attack surface or simply widen the gap between protected and exposed organizations.

Why selective Glasswing access changes the defensive calculus

The five firms chosen for Glasswing access share a common trait: each operates software ecosystems so large that a single unpatched flaw can cascade across millions of endpoints. The 2021 Colonial Pipeline attack illustrated this dynamic at national scale. That incident, which disrupted fuel supplies across the U.S. East Coast, prompted a sustained federal response that U.S. cyber officials later chronicled, including new mandatory reporting requirements for critical infrastructure operators. Glasswing is built to catch the kinds of vulnerabilities that made Colonial Pipeline possible before attackers reach them.

The timing is not incidental. Researchers at UC Berkeley’s Center for Responsible, Decentralized Intelligence built ExploitGym, a benchmark containing 898 real-world vulnerabilities, and used it to test how well frontier AI models can convert disclosed security flaws into functional exploits. Claude Mythos Preview and GPT-5.5 were both evaluated under time constraints, and the results showed measurable exploit success. Anthropic responded by restricting access to one of its models, a signal that the company views offensive capability as a near-term risk serious enough to limit its own products.

A testable prediction follows from this arrangement. If Glasswing performs as intended, the five partner organizations should show meaningfully lower successful exploit rates on future ExploitGym-style evaluations compared with organizations of similar size and complexity that lack access. A reduction of at least 25 percent within 12 months would represent a credible signal, measurable through public benchmark updates. That threshold is not arbitrary: it reflects the gap between current AI exploit success rates and the detection speed that an always-on defensive system would need to close.

ExploitGym results and the AISI warning that shaped the rollout

The Berkeley ExploitGym research provides the clearest public evidence for why Anthropic chose this moment to distribute Glasswing. The benchmark did not test abstract scenarios. It drew from 898 real vulnerabilities, each with documented exploit paths, and measured whether AI agents could reproduce those attacks autonomously. The comparative testing of Claude Mythos Preview and GPT-5.5 under time limits produced concrete exploit counts, demonstrating that frontier models can already function as semi-autonomous penetration tools. A related technical preprint from the Berkeley group framed the central question directly: can AI agents turn security vulnerabilities into real attacks? The answer, based on the data, was yes, with measurable and improving success rates.

The UK AI Safety Institute reached a parallel conclusion through its own evaluation work. AISI published findings examining the pace of autonomous cyber capability, and the trajectory it described was steep. Taken together, the Berkeley and AISI findings create a dual pressure: offensive AI tools are getting better at exploiting known flaws, while the window between vulnerability disclosure and active exploitation is shrinking. Glasswing is Anthropic’s bet that a defensive AI, trained on the same vulnerability data, can close that window faster than human security teams working alone.

The choice to restrict access to its own model after the ExploitGym results also reveals an internal tension at Anthropic. The company is simultaneously building tools that could be used offensively and selling defensive systems to counter that exact threat. Granting Glasswing to five specific partners, rather than releasing it broadly, suggests Anthropic is treating the tool itself as a dual-use technology that requires controlled distribution. That mirrors long-standing debates in cybersecurity over whether powerful intrusion tools should be widely shared for research or tightly held to prevent abuse.

Gaps in the evidence and what to watch next

No primary technical report from Anthropic has been published detailing Glasswing’s detection rates, false-positive metrics, or performance against the ExploitGym benchmark. The five partner organizations have not released statements confirming integration timelines or measured vulnerability reductions. Without that data, the claim that Glasswing can meaningfully reduce exploit success rates rests on the logic of the system’s design rather than on demonstrated results.

The CISA and AISI sources establish the severity of the threat and the speed of offensive AI advancement, but neither contains specific data on Glasswing’s performance. The Berkeley ExploitGym results predate the Glasswing rollout, so they measure the problem Glasswing is meant to solve without measuring whether it actually solves it. This leaves a gap between the urgency of the threat, which is well documented, and the evidence that a particular defensive intervention is working.

Several milestones could help close that gap. The first would be independent benchmarking of Glasswing or comparable systems on public datasets derived from real vulnerabilities. Even if proprietary codebases prevent full transparency, red-team exercises using disclosed flaws could provide partial visibility. A second milestone would be aggregate reporting from the five partners on mean time to patch, exploit prevalence, and the share of critical bugs first flagged by AI rather than by human analysts.

Policy signals will matter as well. If regulators begin to treat advanced defensive AI as a best practice for critical infrastructure, Glasswing-like systems could become de facto requirements for operators of pipelines, payment networks, cloud platforms, and telecommunications backbones. That, in turn, would pressure Anthropic and its peers to broaden access beyond a small club of early adopters, or at least to offer less powerful but still effective versions to smaller organizations.

Concentration risk and the wider security ecosystem

The decision to place Glasswing in the hands of five dominant firms raises distributional questions that go beyond technical performance. If the system works, those firms will enjoy a measurable security advantage over competitors and public institutions that lack comparable tools. Attackers, facing hardened targets at the core of the digital economy, may redirect their efforts toward softer ones: hospitals, regional utilities, municipal systems, or small software vendors that supply critical components.

In that scenario, Glasswing could reduce systemic risk at the very top of the stack while increasing relative risk for everyone else. The overall effect on global cyber harm would depend on how quickly defensive AI diffuses outward and whether open or lower-tier tools can approximate the protection enjoyed by early adopters. History offers mixed precedents: some security innovations, like automatic browser updates, spread rapidly and lifted the baseline, while others, like sophisticated intrusion detection, remained concentrated in organizations with the budgets and staff to use them.

There is also a strategic dimension to consider. By partnering with AWS, Apple, Cisco, Google, and JPMorgan, Anthropic is aligning its defensive roadmap with companies that already shape standards and expectations across cloud computing, mobile platforms, networking, and finance. If Glasswing becomes deeply embedded in their development pipelines, its assumptions and interfaces could influence how future security tooling is built, potentially locking in particular approaches to vulnerability management and disclosure.

For now, the most important fact may be the one Anthropic has not yet provided: verifiable evidence that Glasswing materially changes real-world outcomes. Until that arrives, the system will stand as a high-profile experiment in concentrated defensive AI. Its success or failure will help answer a broader question hanging over the next phase of cybersecurity: whether the same frontier models that make exploitation easier can, when carefully channeled, tip the balance back toward defense without leaving the rest of the digital world further behind.

More from Morning Overview

*This article was researched with the help of AI, with human editors creating the final content.