OpenAI built a cybersecurity system that finds and patches software flaws on its own

SoftBank has begun deploying an automated cybersecurity service built on OpenAI technology that diagnoses software weaknesses and determines how to patch them without waiting for human coders. The rollout moves AI-driven vulnerability repair from government-funded research competitions into commercial use, raising immediate questions about how much autonomy defenders should hand to machines when critical infrastructure is at stake.

Why automated patching is entering commercial use now

Most cyberattacks still exploit known flaws that sit unpatched for weeks or months. The gap between discovery and fix has long been a staffing problem: security teams cannot review and repair code fast enough to keep up with the volume of disclosed vulnerabilities. SoftBank’s new service attacks that bottleneck directly. The workflow diagnoses weaknesses and then analyzes what needs to be done to patch them, compressing a process that typically requires specialized engineers and days of triage into an automated sequence.

The timing reflects a broader shift. Government-backed research has already demonstrated that fully autonomous cyber reasoning systems can both discover and remediate vulnerabilities in real codebases. DARPA’s AI Cyber Challenge, known as AIxCC, tested exactly this capability in a structured competition setting. A scholarly paper published on arXiv documents the architectures and evaluation benchmarks those systems used. SoftBank’s commercial deployment suggests the technology has matured enough for paying customers, not just contest environments.

One hypothesis worth tracking is that commercial systems built on OpenAI models may achieve higher remediation success on legacy codebases than DARPA competition entries. The reason is access to data. Contest datasets are standardized and public, designed to be fair and repeatable for all teams. A commercial operator like SoftBank, by contrast, can fine-tune its models on proprietary customer telemetry, including internal logs, configuration files, and historical incident records that competition entries never see. If that advantage holds, it would help explain why enterprise deployments could outperform academic benchmarks even when using similar underlying AI architectures.

SoftBank’s patching workflow and the DARPA baseline

SoftBank’s service follows a two-stage pattern. First, the system scans a codebase to identify weaknesses. Second, it generates an analysis of the specific changes needed to close each flaw. According to reporting from the Associated Press, this sequence was designed to handle both the diagnostic and remediation steps that security teams normally perform manually. The service uses OpenAI’s models as its reasoning engine, though neither SoftBank nor OpenAI has published a technical paper detailing the model’s architecture or benchmark results on production code.

That gap matters because the closest public reference point is the DARPA competition. The AIxCC paper describes fully autonomous cyber reasoning systems that discover and remediate vulnerabilities, using structured benchmarks and evaluation methods to measure success. The competition tested whether AI agents could find bugs in open-source software and generate working patches under time pressure. Several teams demonstrated that capability, but the published analysis focuses on competition design and lessons learned rather than reporting deployment-scale metrics from real enterprise networks.

The distinction between contest performance and commercial reliability is not academic. A patching system that works on curated challenge problems may struggle with the messy, undocumented codebases that large companies actually run. Legacy systems often contain decades of accumulated technical debt, custom configurations, and dependencies that no public dataset captures. SoftBank’s ability to feed its models with real operational data from Japanese corporate clients could close that gap, but no public results confirm it yet. Until such evidence appears, customers have to treat the service as promising but unproven.

Another difference is operational context. DARPA’s challenge ran in a tightly controlled environment with clear scoring rules and defined end states: find vulnerabilities, patch them, and demonstrate that the patched systems withstand scripted attacks. Commercial environments are far less tidy. Patches can interact in unexpected ways with third-party libraries, cloud services, or proprietary protocols. Performance regressions, compliance requirements, and business uptime guarantees all complicate the seemingly simple task of “fix the bug.” Any automated patching workflow must therefore integrate with change-management pipelines, testing suites, and rollback mechanisms that extend well beyond what a competition scenario demands.

Open questions about autonomy, accuracy, and adversarial risk

Several unresolved issues hang over this technology. The first is accuracy. Automated patches that introduce new bugs or break existing functionality could cause more damage than the vulnerabilities they fix. No public benchmark data from SoftBank’s deployment addresses false-positive rates, patch rejection rates, or the percentage of generated fixes that required human revision before going live. Without those numbers, the claim of full autonomy remains aspirational rather than proven.

The second issue is adversarial exposure. The same AI models that find and fix flaws could, in theory, be repurposed to find and exploit them. OpenAI has published high-level safety principles for its models, but the specific guardrails applied to SoftBank’s patching service have not been disclosed. The DARPA competition addressed this concern by operating in controlled environments with limited real-world connectivity and carefully scoped targets. A commercial service running against live production systems does not have that luxury and must assume that attackers will probe it, attempt to reverse-engineer its behavior, or feed it misleading inputs.

A third concern is transparency. Enterprise customers adopting this service need to understand what the AI changed and why. Automated code modifications that lack clear audit trails create compliance risks, especially in regulated industries like finance and healthcare. If a patch later causes an outage or data loss, the question of accountability becomes urgent. Did the AI make a bad decision, or did the human operator fail to review it? Regulators and insurers are unlikely to accept “the model did it” as a sufficient explanation without detailed logs and reproducible reasoning steps.

Governance frameworks will therefore matter as much as technical performance. Organizations considering AI-driven patching will need policies that define which systems the AI may modify, what testing thresholds must be met before deployment, and who signs off on high-impact changes. They will also need clear incident-response playbooks for when an automated patch goes wrong, including rapid rollback procedures and communication plans for affected customers or regulators.

For organizations evaluating this technology, the practical first step is straightforward: request detailed documentation of the system’s accuracy metrics, failure modes, and rollback procedures before granting it write access to production code. No AI patching tool should operate without a human review gate until its error rates are independently validated. Pilot projects should start with low-risk environments, such as non-critical internal applications, and gradually expand scope only after the AI’s behavior has been observed across multiple patch cycles.

The next development to watch is whether SoftBank or OpenAI publishes performance data from real deployments, even if anonymized and aggregated. Concrete metrics-such as mean time to remediate, percentage of vulnerabilities fixed without human edits, and rate of post-patch incidents-would move the debate from speculation to evidence. If those numbers show that AI-driven patching can reliably shrink exposure windows without increasing operational risk, it will strengthen the case for broader adoption.

Until then, automated patching should be treated as a powerful assistant rather than an infallible guardian. The technology promises to narrow one of cybersecurity’s most persistent gaps: the lag between knowing about a flaw and actually fixing it. But closing that gap safely will depend on careful integration, rigorous oversight, and a willingness to slow down or intervene when the machine’s confidence outpaces its demonstrated reliability.

More from Morning Overview

*This article was researched with the help of AI, with human editors creating the final content.

IG

FB

PIN

LI

X

Global Font

OpenAI built a cybersecurity system that finds and patches software flaws on its own

Why automated patching is entering commercial use now

SoftBank’s patching workflow and the DARPA baseline

Open questions about autonomy, accuracy, and adversarial risk

Cassian Holt

Author

UniQure plans to file for approval of its Huntington’s therapy within months

7 smart-home gadgets that may be listening more than you think

9 cars owners say they most regret buying new

Trump invoked wartime production powers to ramp up U.S. munitions output

The deepest part of the ocean stays near freezing and darker than any night

More in Cybersecurity

Cybersecurity

Never trust a ‘loan approved’ text you never asked for, the FTC warns

Cybersecurity

How to wipe an old phone so its data can never be recovered

Cybersecurity

5 things security experts say you should never type into a public computer

Cybersecurity

7 apps that quietly keep tracking your location after you close them

Cybersecurity

The FBI says scammers spoofing bank and police phone numbers stole more than $215 million last year

Cybersecurity

6 settings that lock down your home Wi-Fi against strangers

Cybersecurity

The FBI warns AI voice clones of relatives are now convincing enough to drain bank accounts

Cybersecurity

8 signs your phone may have been quietly hacked

IG

FB

PIN

LI

X

IG

FB

PIN

LI

X

OpenAI built a cybersecurity system that finds and patches software flaws on its own

Why automated patching is entering commercial use now

SoftBank’s patching workflow and the DARPA baseline

Open questions about autonomy, accuracy, and adversarial risk

Author

Get weekly updates with the latest news and tips!

More in Cybersecurity

IG

FB

PIN

LI

X