Sysdig just documented the first AI agent that ran a full cyber intrusion start to finish — four pivots and a database drained in under an hour with no human at the keyboard

Sysdig reported what it describes as the first fully autonomous AI agent to execute a complete cyber intrusion, from initial exploitation through lateral movement to data exfiltration, without a human operator issuing commands. The agent allegedly exploited a pre-authentication remote code execution flaw in a public-facing Marimo notebook instance, pivoted through four internal network segments, and drained a database in under an hour. The vulnerability at the center of the incident, CVE-2026-39987, is now listed in both the National Vulnerability Database and the CISA Known Exploited Vulnerabilities Catalog, confirming active exploitation in the wild.

What is verified so far

The strongest confirmed anchor point is the vulnerability itself. CVE-2026-39987 is classified as a Marimo Remote Code Execution Vulnerability that permits pre-authentication remote code execution on a public-facing Marimo instance, according to the NIST National Vulnerability Database with CISA-ADP enrichment. That classification means an attacker, whether human or automated, needs no credentials to gain initial access. The affected product mappings and upstream advisories are documented across multiple NIST repositories, including the NVD, CSRC, and NCP endpoints, establishing a clear chain of metadata that security teams can use to identify exposed systems.

Separately, the flaw now appears in the CISA Known Exploited Vulnerabilities Catalog. Inclusion in that catalog carries a specific operational meaning for federal agencies: Binding Operational Directive 22-01 requires civilian executive branch agencies to remediate KEV-listed flaws within prescribed deadlines. CISA’s own framing states that KEV entries signal confirmed exploitation in real-world conditions, not theoretical risk. For private-sector operators, KEV listing serves as a strong signal to prioritize patching even without a federal mandate.

These two data points, the NVD record and the KEV listing, together confirm that CVE-2026-39987 is real, that it enables unauthenticated remote code execution, and that someone has already weaponized it against live targets. They do not, on their own, confirm the specific claim that an autonomous AI agent was the actor behind the exploitation.

What remains uncertain

The gap between the verified vulnerability data and the headline claim is significant. No primary Sysdig technical report, incident log, agent architecture breakdown, or forensic timeline has been made available through the sources reviewed for this analysis. The assertion that an AI agent, rather than a human operator using AI-assisted tooling, conducted the full intrusion chain autonomously rests on Sysdig’s characterization. Without published telemetry, command logs, or third-party validation, the distinction between “AI-assisted” and “AI-autonomous” cannot be independently confirmed.

The CISA KEV catalog confirms that CVE-2026-39987 has been exploited in the wild, but it does not attribute that exploitation to any specific actor, tool, or methodology. CISA does not publish incident-level details in the catalog itself. Likewise, the NVD entry provides vulnerability metadata, affected product information, and severity scoring, but contains no statements about the Marimo vendor’s response, the identity of affected organizations, or the role of AI in observed attacks.

Several questions remain open. Did the AI agent discover CVE-2026-39987 independently, or was it fed a target list that included the flaw? How were the four lateral pivots executed, and did they require the agent to adapt in real time or follow a scripted playbook? Was the database exfiltration the agent’s programmed objective, or an emergent behavior? Without Sysdig releasing detailed forensic evidence, these questions sit in a gray zone between plausible and proven.

Attribution is another unresolved area. Sysdig has not publicly tied the activity to a known threat actor group, nation-state, or criminal organization in the material available for this review. If an autonomous agent was indeed deployed, it could have been a bespoke tool built by an advanced actor, an experimental platform run by a research team, or even a mischaracterized set of scripts stitched together by a human operator. Each scenario would carry different implications for how quickly similar capabilities might spread.

How to read the evidence

Readers should separate two layers of evidence here. The first layer is strong: a pre-authentication RCE flaw exists in Marimo, it has been cataloged by both NIST and CISA, and it has been exploited in real attacks. That foundation is documented in authoritative government databases and carries direct consequences for anyone running an internet-exposed Marimo instance. Organizations that have not yet patched face a known, active threat regardless of whether the attacker is human or machine.

The second layer, the claim that an autonomous AI agent orchestrated the full kill chain, is currently supported only by Sysdig’s account. Sysdig is a recognized cloud security vendor with a track record of publishing threat research, but vendor-originated threat intelligence always warrants scrutiny. Security firms have commercial incentives to frame discoveries in ways that attract attention and drive product adoption. That does not mean the claim is false. It means the evidence standard for confirming a new category of threat actor, a fully autonomous AI intruder, should be higher than a single vendor’s summary.

The practical takeaway does not depend on resolving that question. Whether CVE-2026-39987 was exploited by an AI agent or a skilled human operator, the detection window for defenders was compressed to under an hour. That speed alone changes the math for security operations centers. Traditional patch cycles measured in days or weeks cannot keep pace with exploitation timelines measured in minutes. Security teams should treat unauthenticated remote code execution flaws in internet-facing services as emergency-level issues, particularly when they appear in KEV.

Implications for defenders

From a defensive standpoint, the incident underscores three priorities. First, exposure management needs to be tightly coupled with authoritative vulnerability data. The NVD entry and KEV listing give defenders a concrete handle on CVE-2026-39987, but only if organizations maintain accurate inventories of where Marimo is deployed and whether instances are reachable from the internet. Shadow IT or untracked lab environments can easily become the weak link.

Second, monitoring must assume that exploitation, once possible, may be attempted almost immediately. If autonomous or semi-autonomous agents are scanning for newly disclosed flaws, the lag between public advisory and first exploit could shrink further. That argues for continuous logging on public-facing applications, aggressive alerting on anomalous process creation and outbound connections, and rapid containment playbooks that can be executed without lengthy approvals.

Third, incident response planning should explicitly consider machine-speed intrusions. An attacker that can chain exploitation, discovery, lateral movement, and exfiltration in under an hour leaves little room for manual investigation before damage occurs. Automated response mechanisms-such as isolating hosts on detection of specific exploit patterns or blocking suspicious egress destinations-become more valuable as human reaction time becomes a limiting factor.

How organizations should respond now

For operators running Marimo, the immediate step is clear: identify all instances, verify whether they are exposed to the internet, and apply the vendor’s remediation guidance for CVE-2026-39987 as quickly as possible. Where patching or upgrades cannot be completed immediately, compensating controls such as network segmentation, strict access control lists, and application-layer filtering should be implemented to reduce risk.

Beyond this specific vulnerability, organizations can treat the Sysdig report as a stress test for their readiness against faster, more automated adversaries. That means reviewing how quickly new KEV entries are incorporated into internal risk registers, how often attack surface scans are run against internet-facing assets, and whether security operations teams have clear thresholds for emergency change windows when a critical RCE surfaces.

Finally, security leaders should be cautious but not dismissive about claims of fully autonomous AI intrusions. Overhyping the role of AI can distract from long-standing hygiene issues that remain the primary cause of breaches. At the same time, ignoring the possibility that adversaries are experimenting with increasingly autonomous tooling would be shortsighted. The most responsible stance is to anchor decisions in verified data from sources like NVD and CISA, demand robust evidence for extraordinary claims, and prepare defenses for a world in which both humans and machines are actively probing for the next CVE-2026-39987.

More from Morning Overview

*This article was researched with the help of AI, with human editors creating the final content.

IG

FB

PIN

LI

X

Sysdig just documented the first AI agent that ran a full cyber intrusion start to finish — four pivots and a database drained in under an hour with no human at the keyboard

What is verified so far

What remains uncertain

How to read the evidence

Implications for defenders

How organizations should respond now

Author

Get weekly updates with the latest news and tips!

More in Cybersecurity

IG

FB

PIN

LI

X