Philadelphia Live News

collapse
Home / Daily News Analysis / The US government’s Anthropic models ban was never about an AI jailbreak

The US government’s Anthropic models ban was never about an AI jailbreak

Jun 25, 2026  Twila Rosenbaum  2 views
The US government’s Anthropic models ban was never about an AI jailbreak

On a quiet Friday afternoon, the U.S. Commerce Department sent a letter to Anthropic that would send shockwaves through the AI industry. The letter invoked an obscure export control directive that effectively banned non-Americans—including the company's own employees—from accessing two of its most advanced models, Fable 5 and Mythos 5. The reason given: an unspecified national security concern. Anthropic, left in the dark, immediately pulled both models offline to ensure compliance. The move was swift, unilateral, and appeared to require no court approval.

The intervention by the Trump administration was not about a technical jailbreak or a critical vulnerability. According to security researcher Katie Moussouris, founder of Luta Security, the incident was triggered by a paper describing a guardrail bypass in Fable 5. The paper, written by security researchers at Amazon and shared privately with Anthropic, detailed how users could ask the model to review code for security issues rather than fix it—a distinction with no meaningful difference in outcome. Moussouris argued that such behavior cannot be fixed without weakening the model for defense purposes, and that the government's reaction was hasty and misguided.

What Actually Happened

The timeline of events reveals a government acting on incomplete information. Axios reported that personality differences between Anthropic and the Trump administration played a larger role than any technical flaw. The export control directive, part of the International Emergency Economic Powers Act (IEEPA), is typically reserved for controlling the spread of weapons technology or sensitive cryptographic tools. Using it against a commercial AI model—one that was already publicly available—set off alarms across the tech industry.

Moussouris publicly criticized the directive, calling it dangerous to national security because it stripped network defenders of advanced cybersecurity capabilities. Dozens of other top security researchers joined her in calling for the order to be revoked. They warned that the move would make the United States less safe by hindering the very tools that protect critical infrastructure from malicious actors.

A Pattern of Overreaction

This is not the first time the U.S. government has overreached with export controls on dual-use technology. During the 2010s, language used to fix export laws covering cybersecurity tools was so broad that it nearly outlawed legitimate security and vulnerability research. The current directive appears to repeat that mistake, albeit with far greater consequences for the AI industry.

The Trump administration's decision seems retaliatory rather than rooted in genuine security concerns. Justin Hendrix, editor of Tech Policy Press, warned that such actions would raise alarms in foreign capitals about the reliability of American AI for critical applications. If the U.S. government can unilaterally shut down a major AI service without due process, no international partner can trust that the models will remain available.

Implications for the AI Industry

The precedent set by this case is chilling for any technology company operating in the United States. The government has demonstrated that it can force a company to pull its products offline with a single letter, bypassing courts and legislative review. This creates an environment of uncertainty and risk for AI labs that need to assure customers of their reliability.

Anthropic, which had built a reputation for safety and transparency, now faces a crisis of trust. Customers who relied on Fable 5 and Mythos 5 for cybersecurity analysis, threat detection, and other defense applications were suddenly cut off. The company scrambled to communicate with users, offering little more than assurances that it was working with the government to resolve the situation.

Meanwhile, the broader AI ecosystem watches closely. If the government can target Anthropic over a paper about a guardrail bypass—one that experts say is fundamental to how AI models work—then no AI company is safe. The message is clear: comply with opaque directives or risk being shut down.

Criticism from Security Experts

Katie Moussouris, a respected cybersecurity veteran, laid out the technical flaws in the government's reasoning. The guardrail bypass described in the Amazon paper involves asking the model to review code for security issues, which the model does by generating the same output it would if asked to fix the code. The end result is identical. Attempting to block this behavior would require removing the model's ability to reason about security altogether, rendering it useless for defensive tasks.

Moussouris called the export control directive a dangerous overreaction that would hurt American cybersecurity. She noted that the U.S. government's own agencies use AI models for defense, and restricting access to these capabilities weakens national security. The irony is that the government's actions may have done more to harm security than any theoretical jailbreak could.

Political Motivations

The involvement of the Trump administration adds a layer of political intrigue. Sources told Axios that personality differences between Anthropic's leadership and the administration had been simmering for months. Some speculate that the directive was a way to pressure the company into aligning more closely with the White House's agenda. Others suggest that Amazon CEO Andy Jassy may have raised concerns with senior officials, either out of competitive spite or a genuine misunderstanding of the research.

What is clear is that the government has not provided a detailed explanation for its actions. The letter remains confidential, and officials have not clarified why an export control directive was the appropriate tool. This lack of transparency only fuels speculation and distrust.

Justin Hendrix observed that the climate is one of suspicion, where senior officials appear to pick favorites based on personal and political factors. The aftermath of this directive will be a chilling effect on innovation, as companies weigh the risk of government interference against the benefits of releasing new models.

A Dangerous Precedent

The U.S. government's intervention in Anthropic's operations is a warning to the entire tech industry. It shows that no company, no matter how careful or compliant, is immune from unilateral government action. The export control directive, originally designed to keep advanced technology out of the hands of adversaries, has been repurposed as a tool of corporate control.

Security researchers and civil liberties groups are already planning legal challenges, arguing that the directive violates due process and free speech protections. The case could redefine the boundaries of government power over software released in the United States.

In the meantime, Anthropic remains in limbo, unsure when or if its models will be restored. The company that once championed safe AI development now finds itself at the mercy of a system that values speed and control over technical nuance. For the rest of the AI industry, the message is unmistakable: the government can shut you down at any moment, for reasons that may have nothing to do with the technology itself.


Source: TechCrunch News


Share:

Your experience on this site will be improved by allowing cookies Cookie Policy