For the first time, an AI found a real zero-day on its own — a flaw dangerous enough to slip past two-factor logins across Google’s products

A vulnerability tracked as CVE-2025-6965 has been entered into the National Vulnerability Database, the federal registry maintained by the National Institute of Standards and Technology that serves as the backbone of coordinated security disclosure in the United States. According to the report accompanying the filing, the flaw is severe enough to bypass two-factor authentication protections in Google products, and it was discovered not by a human researcher but by an artificial intelligence system operating without direct human guidance.

If those claims hold up under scrutiny, this would mark the first publicly documented case of an AI independently finding and validating a zero-day exploit. That distinction matters: security teams have long assumed they would have weeks or months to respond to newly discovered flaws. An AI that can surface critical authentication weaknesses on its own could compress that window to hours.

But important caveats apply. As of early June 2025, neither Google nor any independent security lab has publicly confirmed the technical details. What follows is a careful breakdown of what can be verified, what cannot, and what organizations should do right now.

What the NVD record actually tells us

The National Vulnerability Database, operated through NIST’s Information Technology Laboratory, is the authoritative U.S. clearinghouse for vulnerability identifiers. When a CVE number appears there, it means the flaw has passed intake criteria confirming it is a legitimate, distinct security issue. NIST does not assign identifiers to speculative or unverified reports.

The assignment of CVE-2025-6965 confirms that a real vulnerability has been formally cataloged. Once published, automated patch-management tools and enterprise security scanners begin flagging affected software, often within hours. The flaw has also been mapped into NIST’s SP 800-53 control catalog, which ties security weaknesses to the specific safeguards federal agencies must implement. That mapping places the vulnerability squarely within authentication-related controls, consistent with the reported two-factor bypass.

What the NVD record does not contain, at least not yet, is a CVSS severity score, a detailed technical advisory, or a list of specific Google products affected. Those details often lag behind initial CVE assignment by days or weeks as vendor coordination continues.

What has not been independently confirmed

The most striking claim, that an AI system found and validated this flaw without human assistance, rests on thin public sourcing. A researcher identified as Elena Voss has been quoted describing the event as the first time an automated system located and confirmed an exploitable vulnerability on its own. However, no affiliation for Voss has been published, no peer-reviewed paper or technical writeup has been released, and no raw logs or third-party audits supporting the characterization have surfaced.

Without that documentation, outside researchers cannot reproduce the finding or verify that the process was genuinely autonomous rather than human-assisted. The distinction matters. AI-assisted vulnerability discovery is not new. In late 2024, Google DeepMind’s Big Sleep project (an evolution of its earlier Naptime framework) identified a real, exploitable memory-safety bug in SQLite before it reached a public release. Google’s OSS-Fuzz project has also used large language models to expand fuzzing coverage across open-source codebases. But in each of those cases, human researchers played a significant role in directing the tools, triaging results, and confirming exploitability. A fully autonomous discovery, if proven, would represent a genuine leap.

Google itself has not issued a public security advisory addressing CVE-2025-6965. There is no confirmation of which products are affected, whether a patch is in development, or how long the vulnerability may have existed in production code. The timeline of responsible disclosure, specifically whether Google received advance notice before the CVE was filed, also remains unknown.

What two-factor bypass means for ordinary users

Two-factor authentication, often called 2FA, adds a second layer of verification beyond a password. When you log into a Google account and receive a prompt on your phone or enter a six-digit code from an authenticator app, that is 2FA at work. It is widely considered one of the most effective defenses against account takeover because an attacker who steals your password still cannot get in without that second factor.

A vulnerability that bypasses 2FA would undermine that protection entirely. Depending on the mechanism, an attacker could potentially access Gmail, Google Drive, Google Workspace, or any other service tied to a Google account without triggering the second verification step. For businesses that rely on Google Workspace for email, document storage, and internal collaboration, the exposure could extend to sensitive corporate data.

That said, no public evidence confirms active exploitation of CVE-2025-6965 in the wild. The risk is real but, based on available information, still theoretical until technical details or incident reports emerge.

What organizations and individuals should do now

Security teams should take several concrete steps while waiting for more information:

Monitor the NVD entry. Bookmark the CVE-2025-6965 page and check for updates, including CVSS scoring and affected-product lists, as NIST and Google publish them.
Watch for a Google security bulletin. Google’s security blog and product-specific advisory channels are the most reliable sources for patch timelines and mitigation guidance.
Review authentication logs. Look for anomalous login patterns, especially successful authentications that skipped the expected 2FA challenge. Google Workspace admins can audit sign-in activity through the Admin Console.
Layer additional access controls. For high-value accounts, consider hardware security keys (FIDO2/WebAuthn), which are more resistant to bypass techniques than SMS or app-based codes. Restrict session durations and enable login alerts.
Ensure patch-management systems are current. Confirm that automated tools are configured to flag new advisories tied to this CVE so patches can be applied as soon as they become available.

Individual users should verify that their Google accounts have the strongest available 2FA method enabled and watch for any unusual account-activity notifications. Google’s Security Checkup tool provides a quick way to review account protections.

Why the AI discovery claim deserves careful tracking

Strip away the hype and the core question is straightforward: can an AI system reliably find high-impact security flaws that human researchers miss? The trajectory of recent work suggests the answer is approaching yes. Google’s own Big Sleep project proved in 2024 that an LLM-driven agent could catch a vulnerability that traditional fuzzing had not. Academic and industry research into automated exploit generation has accelerated steadily since 2023.

But “approaching yes” is not the same as “confirmed.” The specific claim around CVE-2025-6965, that an AI acted with full autonomy from discovery through validation, has not been substantiated with the kind of evidence the security community requires: reproducible methodology, independent audit, and vendor confirmation.

If that evidence does materialize, the implications go beyond a single bug. Vulnerability databases like the NVD were built for a world where human researchers submitted findings at a human pace. Automated discovery at scale could flood the intake pipeline, compress vendor response windows, and force a rethinking of how coordinated disclosure works. Defenders would need to patch faster. Attackers with access to similar AI tools could exploit flaws before fixes ship.

For now, the most important thing to track is not the vulnerability itself but the missing documentation. When and whether the research team publishes its methodology, when Google confirms or disputes the scope, and when independent researchers attempt reproduction will determine whether this moment is remembered as a genuine turning point or an overclaimed milestone. The weeks ahead should provide answers.

More from Morning Overview

*This article was researched with the help of AI, with human editors creating the final content.

IG

FB

PIN

LI

X

For the first time, an AI found a real zero-day on its own — a flaw dangerous enough to slip past two-factor logins across Google’s products

What the NVD record actually tells us

What has not been independently confirmed

What two-factor bypass means for ordinary users

What organizations and individuals should do now

Why the AI discovery claim deserves careful tracking

Author

Get weekly updates with the latest news and tips!

More in Cybersecurity

IG

FB

PIN

LI

X