Anthropic opened its Project Glasswing to AWS, Apple, Cisco, Google and JPMorgan, handing them an AI built to hunt and fix critical software flaws

Anthropic has opened early access to Project Glasswing, its AI system designed to detect and patch critical software vulnerabilities, to five major organizations: AWS, Apple, Cisco, Google, and JPMorgan. The decision places a powerful defensive tool in the hands of companies that collectively manage infrastructure touching billions of users, at a moment when autonomous AI agents are proving they can turn known security flaws into working exploits. The move raises a sharp question: whether concentrating advanced defensive AI among a handful of well-resourced firms will shrink the global attack surface or simply widen the gap between protected and exposed organizations.

Why selective Glasswing access changes the defensive calculus

The five firms chosen for Glasswing access share a common trait: each operates software ecosystems so large that a single unpatched flaw can cascade across millions of endpoints. The 2021 Colonial Pipeline attack illustrated this dynamic at national scale. That incident, which disrupted fuel supplies across the U.S. East Coast, prompted a sustained federal response that U.S. cyber officials later chronicled, including new mandatory reporting requirements for critical infrastructure operators. Glasswing is built to catch the kinds of vulnerabilities that made Colonial Pipeline possible before attackers reach them.

The timing is not incidental. Researchers at UC Berkeley’s Center for Responsible, Decentralized Intelligence built ExploitGym, a benchmark containing 898 real-world vulnerabilities, and used it to test how well frontier AI models can convert disclosed security flaws into functional exploits. Claude Mythos Preview and GPT-5.5 were both evaluated under time constraints, and the results showed measurable exploit success. Anthropic responded by restricting access to one of its models, a signal that the company views offensive capability as a near-term risk serious enough to limit its own products.

A testable prediction follows from this arrangement. If Glasswing performs as intended, the five partner organizations should show meaningfully lower successful exploit rates on future ExploitGym-style evaluations compared with organizations of similar size and complexity that lack access. A reduction of at least 25 percent within 12 months would represent a credible signal, measurable through public benchmark updates. That threshold is not arbitrary: it reflects the gap between current AI exploit success rates and the detection speed that an always-on defensive system would need to close.

ExploitGym results and the AISI warning that shaped the rollout

The Berkeley ExploitGym research provides the clearest public evidence for why Anthropic chose this moment to distribute Glasswing. The benchmark did not test abstract scenarios. It drew from 898 real vulnerabilities, each with documented exploit paths, and measured whether AI agents could reproduce those attacks autonomously. The comparative testing of Claude Mythos Preview and GPT-5.5 under time limits produced concrete exploit counts, demonstrating that frontier models can already function as semi-autonomous penetration tools. A related technical preprint from the Berkeley group framed the central question directly: can AI agents turn security vulnerabilities into real attacks? The answer, based on the data, was yes, with measurable and improving success rates.

The UK AI Safety Institute reached a parallel conclusion through its own evaluation work. AISI published findings examining the pace of autonomous cyber capability, and the trajectory it described was steep. Taken together, the Berkeley and AISI findings create a dual pressure: offensive AI tools are getting better at exploiting known flaws, while the window between vulnerability disclosure and active exploitation is shrinking. Glasswing is Anthropic’s bet that a defensive AI, trained on the same vulnerability data, can close that window faster than human security teams working alone.

The choice to restrict access to its own model after the ExploitGym results also reveals an internal tension at Anthropic. The company is simultaneously building tools that could be used offensively and selling defensive systems to counter that exact threat. Granting Glasswing to five specific partners, rather than releasing it broadly, suggests Anthropic is treating the tool itself as a dual-use technology that requires controlled distribution. That mirrors long-standing debates in cybersecurity over whether powerful intrusion tools should be widely shared for research or tightly held to prevent abuse.

Gaps in the evidence and what to watch next

No primary technical report from Anthropic has been published detailing Glasswing’s detection rates, false-positive metrics, or performance against the ExploitGym benchmark. The five partner organizations have not released statements confirming integration timelines or measured vulnerability reductions. Without that data, the claim that Glasswing can meaningfully reduce exploit success rates rests on the logic of the system’s design rather than on demonstrated results.

The CISA and AISI sources establish the severity of the threat and the speed of offensive AI advancement, but neither contains specific data on Glasswing’s performance. The Berkeley ExploitGym results predate the Glasswing rollout, so they measure the problem Glasswing is meant to solve without measuring whether it actually solves it. This leaves a gap between the urgency of the threat, which is well documented, and the evidence that a particular defensive intervention is working.

Several milestones could help close that gap. The first would be independent benchmarking of Glasswing or comparable systems on public datasets derived from real vulnerabilities. Even if proprietary codebases prevent full transparency, red-team exercises using disclosed flaws could provide partial visibility. A second milestone would be aggregate reporting from the five partners on mean time to patch, exploit prevalence, and the share of critical bugs first flagged by AI rather than by human analysts.

Policy signals will matter as well. If regulators begin to treat advanced defensive AI as a best practice for critical infrastructure, Glasswing-like systems could become de facto requirements for operators of pipelines, payment networks, cloud platforms, and telecommunications backbones. That, in turn, would pressure Anthropic and its peers to broaden access beyond a small club of early adopters, or at least to offer less powerful but still effective versions to smaller organizations.

Concentration risk and the wider security ecosystem

The decision to place Glasswing in the hands of five dominant firms raises distributional questions that go beyond technical performance. If the system works, those firms will enjoy a measurable security advantage over competitors and public institutions that lack comparable tools. Attackers, facing hardened targets at the core of the digital economy, may redirect their efforts toward softer ones: hospitals, regional utilities, municipal systems, or small software vendors that supply critical components.

In that scenario, Glasswing could reduce systemic risk at the very top of the stack while increasing relative risk for everyone else. The overall effect on global cyber harm would depend on how quickly defensive AI diffuses outward and whether open or lower-tier tools can approximate the protection enjoyed by early adopters. History offers mixed precedents: some security innovations, like automatic browser updates, spread rapidly and lifted the baseline, while others, like sophisticated intrusion detection, remained concentrated in organizations with the budgets and staff to use them.

There is also a strategic dimension to consider. By partnering with AWS, Apple, Cisco, Google, and JPMorgan, Anthropic is aligning its defensive roadmap with companies that already shape standards and expectations across cloud computing, mobile platforms, networking, and finance. If Glasswing becomes deeply embedded in their development pipelines, its assumptions and interfaces could influence how future security tooling is built, potentially locking in particular approaches to vulnerability management and disclosure.

For now, the most important fact may be the one Anthropic has not yet provided: verifiable evidence that Glasswing materially changes real-world outcomes. Until that arrives, the system will stand as a high-profile experiment in concentrated defensive AI. Its success or failure will help answer a broader question hanging over the next phase of cybersecurity: whether the same frontier models that make exploitation easier can, when carefully channeled, tip the balance back toward defense without leaving the rest of the digital world further behind.

More from Morning Overview

*This article was researched with the help of AI, with human editors creating the final content.

IG

FB

PIN

LI

X

Global Font

Anthropic opened its Project Glasswing to AWS, Apple, Cisco, Google and JPMorgan, handing them an AI built to hunt and fix critical software flaws

Why selective Glasswing access changes the defensive calculus

ExploitGym results and the AISI warning that shaped the rollout

Gaps in the evidence and what to watch next

Concentration risk and the wider security ecosystem

Dorian Maddox

Author

A new find suggests the famous Sutton Hoo helmet was made in England, not abroad

Microsoft’s July update patches a record 570 Windows flaws, and three are already under attack

Diggers at Pompeii turned up the skeleton of a horse in a house frozen by Vesuvius

iSeeCars studied 312 million vehicles and crowned the Toyota Tacoma the most reliable midsize truck

Wearing hearing aids cut the risk of dementia by about a third in a seven-year study

More in AI

AI

Researchers used AI to rebuild the face of a Pompeii victim who died fleeing Vesuvius

AI

OpenAI’s new voice mode can listen and talk at the same time, killing the awkward pause

AI

Microsoft says its AI diagnosed tough cases four times better than a panel of doctors

AI

Regulators just ordered Google to open Android to rival AI assistants on two billion phones

AI

Meta is cutting 8,000 jobs and shifting thousands of workers onto its AI teams

AI

Nearly half of US companies using ChatGPT say it has already replaced workers

AI

A new bill in Congress would bar chatbots from posing as your doctor or lawyer

AI

AI is making romance-and-crypto scams roughly four times more profitable

IG

FB

PIN

LI

X

IG

FB

PIN

LI

X

Anthropic opened its Project Glasswing to AWS, Apple, Cisco, Google and JPMorgan, handing them an AI built to hunt and fix critical software flaws

Why selective Glasswing access changes the defensive calculus

ExploitGym results and the AISI warning that shaped the rollout

Gaps in the evidence and what to watch next

Concentration risk and the wider security ecosystem

Author

Get weekly updates with the latest news and tips!

More in AI

IG

FB

PIN

LI

X