“Defenders need to be able to ask AI to fix the bugs in a file, explain why the fix matters, and write tests that confirm the patch works,” Katie Moussouris wrote, summing the argument that has launched a fierce debate over the government’s decision to restrict Anthropic’s Fable 5.
Department of Commerce export controls and the White House response
Last Friday the Department of Commerce imposed export controls on Anthropic’s Fable 5, a move the Trump administration said was motivated by reports that researchers had jailbroken the model soon after its public release. According to reporting cited by CyberScoop, Amazon and another cybersecurity researcher asserted they had found ways to bypass Fable 5’s guardrails, prompting officials to conclude that techniques available to U.S.-based researchers would also be available to foreign adversaries. The Commerce Department action led Anthropic to shut off the models for all users while the company sought to persuade the White House to reconsider the restrictions.
Anthropic’s guardrails, testing, and access controls
Anthropic has taken several steps it says were designed to limit risk: it declined to publicly release its Mythos model for commercial sale, routed Mythos access to organizations for cyber defense work, and built Fable 5 so that on sensitive topics — including cybersecurity and biological warfare — it defaults answers to older, less powerful models. Anthropic also reports that Fable 5 underwent 1,000 hours of testing from internal and external red teamers and that those reviews did not find a universal jailbreak that would remove the guardrails or let Fable 5 access Mythos for cyber and biology tasks.
Katie Moussouris and the third‑party research she reviewed
Katie Moussouris said she was provided a copy of third‑party research on guardrail bypass techniques for Fable 5. The researchers set three Claude models — Fable 5, Mythos and Claude Opus — to review batches of known, vulnerable open‑source code for security issues. Moussouris said Fable 5 initially refused the request, but that researchers used “a multistep and manual process” to get the model to turn its output into automated scripts capable of testing patches for the vulnerability. She emphasized that the capability demonstrated — having a model find, fix, and test a patch — is precisely the defensive loop security teams need, and she characterized broad export restrictions on Fable 5 as “heavy handed” and “misguided.” Moussouris also noted her prior technical work on the Waasenaar Agreement.
Expert community pushback and the “Free Fable” letter
Dozens of cybersecurity experts signed an open letter Monday urging the Trump administration to “Free Fable.” The signers argue that Mythos‑class models are “quite good” at finding and exploiting software vulnerabilities but “are not uniquely good” compared with other frontier models used for defensive work. The letter highlights that OpenAI’s Daybreak provides similar vulnerability discovery and patching capabilities yet was not included in the Commerce Department’s restrictions.
The researchers also criticized Fable 5’s guardrails as overly sensitive at launch, calling them “a source of humor in the cyber community on launch day” because IT and cyber workers reported the model would not perform basic defensive tasks. The letter questions whether the observed jailbreaking work constitutes offensive capability and notes that similar behaviors can be reproduced in other commercial and open‑source models — including GPT 5.5, Claude Opus, Claude Sonnet and Chinese models such as Kimi 2.7. “The justification for this unprecedented action was that Fable provides a unique ‘uplift’ of capabilities beyond other AI models,” the letter states, arguing that AI has been finding bugs and generating working exploits at superhuman levels since last year.
Public concern and the political backdrop
The White House decision arrives amid growing public pressure for AI regulation. A Johns Hopkins University poll in May found broad, bipartisan support for more rules: 73% of respondents favored bans on AI‑generated images and video, 68% wanted labels on AI content, 75% supported disclosure laws for AI chatbots, and 70% backed the “right to interact with a human rather than an AI in medical, legal, educational and government settings.” A separate global survey of 18,000 people named misuse enabling misinformation, deepfakes, facilitating criminal hacks, and helping terrorists create new weapons among the top four public concerns about AI.
What this means for technologists, policymakers, and defenders
- Technologists and security teams: Researchers argue they need access to frontier models that can “execute the find, fix, and test loop” — capabilities the community says are essential for defensive security and that have been demonstrated across multiple models.
- Policymakers and regulators: The Commerce Department’s move spotlights a dilemma: whether to treat a single model as uniquely dangerous, or to craft rules that address capabilities available across multiple commercial and open‑source systems.
- Defenders and enterprises: Anthropic’s temporary shutdown of model access and reports of oversensitive guardrails have already disrupted how some IT and cyber teams use these tools for routine vulnerability work.
The immediate conflict is clear: the administration acted on reports of early jailbreaking and limited foreign access, while many security experts say the research does not show a unique, exploitable uplift in Fable 5 compared with other frontier models. Anthropic has cut access and pressed the White House to reverse course; the expert community has pressed back with public letters and technical critique. The next concrete step — whether Commerce keeps the export controls in place or adjusts them after the community’s rebuttals — will determine whether this episode becomes an isolated intervention or a precedent for how governments regulate advanced AI capabilities.




