5Models·Jun 17

The White House Wants Anthropic to Block All Jailbreaks. That May Not Be Possible

The Trump administration has conditioned the potential rerelease of Anthropic's Fable 5 model on the company's ability to guarantee that its safety guardrails remain impenetrable. While officials demand a system that cannot be bypassed, security researchers caution that creating an unhackable AI model is technically unfeasible given the current state of adversarial testing. This standoff highlights a growing regulatory push to hold developers strictly accountable for model security, even as technical experts warn that perfect protection against jailbreaking remains elusive.

Covered by 3 sources

WWired AI↗Hugo LowellJun 17
TTechCrunch AI↗Theresa Loconsolo, Anthony Ha, Rebecca Bellan, Sean O'KaneJun 19
TTechCrunch AI↗Theresa LoconsoloJun 19

The White House Wants Anthropic to Block All Jailbreaks. That May Not Be Possible

Covered by 3 sources

Related stories