eifachposte

eifachposte

Government concerned that the advanced cyber capabilities of Fable and Mythos would be available ‘to people who shouldn’t have them.’ David Sacks, co-chair of the President’s Council of Advisors on Science and Technology and the administration’s former AI czar, said the U.S. government warned Anthropic that Claude Fable 5 had been jailbroken and that CEO Dario Amodei refused to fix the flaw or pull the model. In a post on X on Saturday, Sacks laid out the administration’s account a day after it ordered both Fable 5 and Mythos 5 disabled worldwide . Sacks said the administration issued the export control “reluctantly” after that refusal, that it wants the restriction lifted once the jailbreak is patched, and that “the ball is in Anthropic’s court.” Sacks claims that a trusted partner of both Anthropic and the U.S. government, testing Fable, came forward with a jailbreak of the guardrails that separate the consumer model from the unrestricted cyber capabilities of Mythos, the model it’s built on. He said the administration asked Amodei to fix the bypass or de-deploy the model, and that Amodei declined. Anthropic instead prioritized keeping its consumer model live over safety, Sacks wrote, calling that inconsistent with the company’s positioning as a safety-first lab that had itself lobbied for Mythos to be regulated as a cyberweapon . I’ve had a number of conversations with folks inside and outside government about the current situation with Anthropic, and here is what I believe to be true: — As we know, Anthropic publicly released its Mythos class models earlier this week under the commercial name Fable. — Fable is Mythos with guardrails. But if those guardrails fail, then you’ve exposed Mythos and its advanced cyber capabilities to people who shouldn’t have them. (Keep in mind that Anthropic itself widely promoted the idea that Mythos was a cyberweapon and needed to be regulated as such. They asked for government regulation of Mythos and championed the guardrails on Fable. If there is a vulnerability — big or small — it is Anthropic’s responsibility to patch.) — A highly credible trusted partner of both Anthropic and the USG who was testing Fable came forward with a jailbreak of those guardrails. The Admin asked Dario to fix the jailbreak or de-deploy the model. Dario refused. — In their blog post, Anthropic defended its decision by saying the jailbreak isn’t serious. That is not what the trusted partner and the USG believe; nor is that kind of minimizing language consistent with Anthropic’s brand as the AI safety company. It’s difficult to fathom how they could claim a jailbreak allowing operability of a cyber weapon could be defined as not “serious.” — In the past, Anthropic has always said that safety must be top priority and taken super seriously. In this case, Anthropic prioritized the continued offering of the consumer model over safety. — In reaction, the Admin issued the export control. The Admin did this reluctantly. It’s been very surprised that Anthropic hasn’t wanted to cooperate with a reasonable safety request (ie fixing the jailbreak issue). Anthropic’s reaction is very much at odds with their branding and ethos as a safe AI research community. — The Admin’s hope now is that Anthropic remediates the safety issue, the export control is lifted, and Fable goes back into general release. The Admin wants all of this to happen as soon as possible. It is frankly bewildered that Anthropic hasn’t wanted to comply with safety requests that it previously said were its highest priority. — Those trying to misdirect and tie this action to the prior DoW/Anthropic issues are wrong. The Admin values Anthropic’s technical capabilities and feels that this issue, while serious, should be easily resolved. The ball is in Anthropic’s court. Sacks also moved to separate the action from Anthropic’s earlier clashes with the government, writing that anyone tying the export control to those disputes is wrong, and that the administration values Anthropic’s technology and sees the issue as easily resolved. A person close to the White House told the news outlet Semafor that Amazon flagged the jailbreak to the government, and that Amazon CEO Andy Jassy had been in contact with the administration about it. Amazon, which has invested billions in Anthropic and supplies much of its cloud computing , didn’t confirm the details, with a spokesperson telling Semafor that governments often seek its counsel on security risks and that it doesn’t discuss those conversations. This isn’t the first time Mythos access has leaked; it happened back in April when unauthorized third parties reached the restricted model using information from a data breach. Anthropic’s public position is that the bypass is narrow and non-universal, that it amounts to asking the model to read a codebase and identify software flaws, and that the same result can be produced on other public models, including OpenAI’s GPT-5.5. The company has stated that it disagrees with the notion that a narrow jailbreak should necessitate the recall of a model used by hundreds of millions of people. Sacks rejects this, arguing that a bypass enabling operation of a cyberweapon is difficult to define as anything other than serious. https://www.tomshardware.com/tech-industry/artificial-intelligence/trump-adviser-david-sacks-says-anthropic-refused-to-fix-fable-5-jailbreak-before-us-export-controls submitted by /u/bakanoace

Originally posted by u/bakanoace on r/ClaudeCode

US government warned Anthropic that Fable 5 had been jailbroken, but firm 'refused' to fix before US implemented export controls — Anthropic defended its decision by saying the jailbreak 'isn’t seriou

US government warned Anthropic that Fable 5 had been jailbroken, but firm 'refused' to fix before US implemented export controls — Anthropic defended its decision by saying the jailbreak 'isn’t seriou