Anthropic says mass GitHub takedowns over leaked code were accidental

Anthropic, the AI company behind the Claude chatbot, has acknowledged that a wave of DMCA takedown requests it filed on GitHub mistakenly targeted repositories that had nothing to do with its proprietary code. The mass removals disrupted developers who found their public projects suddenly yanked offline, and the incident has reignited debate over how easily the copyright takedown system can be weaponized, even by accident. With the affected repositories now restored, the episode offers a sharp case study in the tension between corporate IP protection and the open-source ecosystem that millions of developers depend on.

What Went Wrong With Anthropic’s Takedown Requests

According to Anthropic, the company issued a batch of DMCA notices to GitHub after detecting what it believed was leaked proprietary code circulating on the platform. The intent, as the company described it, was to protect trade secrets tied to its AI models. But the automated or semi-automated process that generated those notices cast a far wider net than intended, flagging public repositories that bore no relation to Anthropic’s work. Developers across multiple projects woke up to find their code had been pulled down without warning.

Anthropic has called the overbroad takedowns an error rather than a deliberate strategy. The company has not disclosed the exact number of repositories affected or the specific internal code it was trying to protect. That lack of transparency makes it difficult to assess the full scope of the disruption. What is clear is that once GitHub received the notices, the platform acted swiftly to comply, pulling flagged repositories before any meaningful review could take place. The repos were later reinstated after Anthropic acknowledged the mistake, but the damage to developer trust had already been done.

How the DMCA’s Safe Harbor Enables Fast Removals

The speed with which GitHub removed the flagged repositories is not a quirk of the platform’s policies. It is a direct consequence of how U.S. copyright law structures incentives for online platforms. Section 512 of the U.S. Code establishes a safe harbor framework that shields service providers from copyright infringement liability, but only if they act promptly to remove content once they receive a valid takedown notice. The statute states that “a service provider shall not be liable for infringement of copyright by reason of the storage at the direction of a user of material,” provided the platform follows the prescribed notice-and-takedown procedure.

This creates a strong incentive for platforms like GitHub to remove first and ask questions later. A platform that delays removal risks losing its safe harbor protection and facing direct liability for any infringing material. From GitHub’s perspective, the rational move when receiving a DMCA notice from a well-known company like Anthropic is immediate compliance. The law places the burden of challenging a takedown on the affected user through a counter-notification process, which can take days or weeks to resolve. For developers whose projects were pulled offline, that gap represents real lost time, broken build pipelines, and potential reputational harm.

The Circumvention Angle and AI Trade Secrets

Anthropic initially invoked protections that extend beyond simple copyright infringement. Chapter 12 of Title 17 addresses the circumvention of technological protection measures, a provision originally designed to prevent people from breaking digital locks on copyrighted works. AI companies have increasingly leaned on these provisions to argue that reverse-engineering or leaking model-related code constitutes a form of circumvention, even when the “technological measure” in question is less like a DVD encryption scheme and more like an internal access control.

This legal framing is worth scrutiny. Applying anti-circumvention rules to leaked AI code stretches the original intent of the statute, which was crafted in the late 1990s to address digital piracy of media content. When an AI company uses these provisions to pull down code on a public platform, it effectively weaponizes a law designed for a different era against a different kind of target. Whether courts would ultimately uphold such claims in a contested proceeding is an open question, but the takedown process does not require judicial review. The notice itself is enough to trigger removal.

Why Open-Source Developers Bear the Cost

The practical fallout from Anthropic’s error landed squarely on individual developers and small teams who had no connection to the company’s proprietary systems. When a repository is taken down on GitHub, it does not just disappear from view. Downstream dependencies can break, collaborative workflows stall, and contributors lose access to their own commit history until the repo is restored. For projects that serve as building blocks for other software, even a brief removal can cascade through the supply chain.

The counter-notification process that the DMCA provides is technically available to affected developers, but it is not a practical remedy for most of them. Filing a counter-notice requires identifying yourself by name and address, consenting to federal court jurisdiction, and waiting up to 14 business days for the platform to restore the content, assuming the original filer does not pursue legal action. For a solo developer or a small open-source maintainer, that process is intimidating and slow. The asymmetry is stark: a large corporation can file dozens of takedown notices in a single batch, while each affected developer must individually navigate a legalistic appeals process to get their work back online.

A Pattern Among AI Companies, Not an Isolated Mistake

Anthropic is not the first AI company to use aggressive IP enforcement on code-sharing platforms, and the broader pattern suggests this will not be the last such incident. As AI firms invest billions in training proprietary models, the perceived risk of code leaks has grown. Companies increasingly treat any public appearance of code that resembles their internal systems as a potential breach, even when the similarity is coincidental or the code in question is independently authored.

This defensive posture creates a chilling effect on open-source development. Researchers who publish code related to AI techniques, even those based on publicly available papers, risk having their repositories flagged if the work touches areas where a commercial AI company claims proprietary interest. The DMCA’s notice-and-takedown system was not designed to adjudicate these kinds of disputes. It was built to handle clear-cut piracy, not the gray zones that emerge when an entire industry is built on shared research foundations and open academic literature.

The assumption that dominates current coverage, that better internal processes at AI companies will prevent future mistakes, misses the structural forces at work. As long as the law rewards rapid removal and imposes few penalties for overreach, companies will continue to err on the side of taking down more rather than less. Even with improved internal review, the incentives point toward broad, conservative enforcement that treats open-source repositories as potential liabilities.

Can the DMCA Process Be Rebalanced?

Fixing these problems would require changes at multiple levels. One avenue is procedural reform by platforms like GitHub. They could choose to scrutinize bulk notices more closely, especially when they target widely used repositories or long-standing projects with extensive contributor histories. Pre-removal review by a technical team might catch obvious mismatches between the code alleged to be infringing and the repositories flagged in a notice.

However, platform-level safeguards run up against the same liability concerns that drove GitHub’s rapid compliance in Anthropic’s case. Any added friction in the takedown pipeline increases the risk that infringing content remains online long enough to threaten safe harbor. Without legislative changes that give platforms more room to investigate claims without sacrificing protection, there is a hard limit to how much they can slow down the process.

Lawmakers could also revisit the counter-notification framework to make it less daunting for individuals. Options might include allowing pseudonymous counter-notices in low-risk cases, shortening restoration timelines when the original filer does not respond, or creating penalties for repeat filers whose notices are routinely found to be defective or abusive. Such reforms would not eliminate wrongful takedowns, but they could reduce the duration and impact of errors like Anthropic’s.

What AI Companies Could Do Differently

Even within the current legal structure, AI companies retain significant discretion over how they enforce their IP rights. Anthropic and its peers could narrow the scope of their monitoring tools to focus on exact matches of known leaked code, rather than broad pattern-matching that sweeps up unrelated projects. They could also commit to manual review of any notice that would affect long-lived repositories or high-profile open-source libraries before sending it to a platform.

Transparency would help rebuild trust. When a company admits to an erroneous takedown, publishing a post-mortem that explains how the mistake occurred, how many projects were affected, and what safeguards are being added can reassure developers that the incident is not being brushed aside. Clear public guidelines about what kinds of code the company will and will not target can also reduce uncertainty for researchers and maintainers working near the same technical areas.

Most importantly, AI firms need to recognize that their success is intertwined with the health of the open-source ecosystem. The models they build rely on open research, shared tools, and community-maintained infrastructure. Treating that ecosystem primarily as a vector of risk, rather than as a partner, invites backlash and erodes the goodwill they depend on.

A Stress Test for the AI–Open Source Relationship

The Anthropic takedown episode is less about one company’s misstep than about a system that makes such missteps almost inevitable. A legal framework designed for media piracy now governs disputes over AI model code. Platforms caught in the middle are pushed to act first and evaluate later. And the people who bear the brunt of errors are often those with the least resources to fight back.

As AI becomes more central to software development, conflicts between proprietary models and open-source code are likely to intensify. Whether that future is defined by recurring clashes and accidental collateral damage, or by more careful stewardship of shared infrastructure, will depend on how companies, platforms, and policymakers respond to incidents like this one. For now, the message to developers is clear: even if you have never touched an AI company’s code, you can still find your work entangled in its efforts to keep that code locked down.

More from Morning Overview

*This article was researched with the help of AI, with human editors creating the final content.

IG

FB

PIN

LI

X