💻 Technology Live

The White House Wants Anthropic to Block All Jailbreaks. That May Not Be Possible

Trump administration officials tell WIRED that if Anthropic wants to rerelease Fable 5, it will need to ensure the model's guardrails can't be circumvented. Security experts say that can't be done.

Wired

17 Jun 2026 4 days ago 1 min read

The White House Wants Anthropic to Block All Jailbreaks. That May Not Be Possible

Wired — 17 June 2026

Text:

11 0 0

🎙️ AI Podcast — Two-Host Discussion

The White House Wants Anthropic to Block All Jailbreaks. That May Not Be Possib…

Kokoro TTS · ~5 min episode · American English voices

Choose voices for Host A and Host B. Changes take effect on next play.

Host A 🟥

Host B 🟦

Trump administration officials tell WIRED that if Anthropic wants to rerelease Fable 5, it will need to ensure the model's guardrails can't be circumv

Read Full Story at Wired →

⚡ Quickyla Analysis Original editorial context — not sourced from the article above

The White House’s demand that Anthropic prevent all possible "jailbreaks" of its AI model Fable 5 underscores a growing tension in the regulation of artificial intelligence: the near-impossibility of fully securing systems that are inherently designed to adapt and interpret language flexibly. Jailbreaks—methods that bypass safety guardrails to elicit restricted outputs—have become a persistent challenge in AI governance, exposing the limits of technical fixes in an era where models are trained on vast, uncurated datasets. What makes this case particularly fraught is its timing. As the Trump administration pushes for rapid AI deployment, the White House’s stance suggests a preference for rigid, enforcement-driven controls over the nuanced, risk-based approaches favored by many in the field. This could set a precedent where future model releases are contingent on meeting unrealistic security standards, potentially stifling innovation or pushing developers toward less transparent solutions. A deeper layer of the problem lies in the nature of AI itself. Guardrails in models like Fable 5 rely on probabilistic reasoning, meaning they are not hard-coded rules but approximations that can be probed and circumvented through clever input manipulation. Security experts often compare this to securing a building with windows that can be cracked if the right pressure is applied—no amount of reinforcement eliminates the possibility entirely. The broader context here is the broader race between adversarial actors and AI developers, where each new safeguard is met with increasingly sophisticated bypass techniques. This cat-and-mouse dynamic is why organizations like the National Institute of Standards and Technology (NIST) have emphasized layered defenses rather than absolute guarantees. The open question now is whether the White House’s ultimatum will lead to a stalemate with Anthropic or prompt a shift in regulatory strategy. If enforcement becomes punitive, companies may prioritize weaker, more controllable models over cutting-edge ones, skewing the AI landscape toward safer but less capable systems. Alternatively, this could accelerate investment in adversarial testing and real-time monitoring, turning the pressure into a catalyst for better safety practices. Either way, the episode highlights a fundamental truth about AI governance: absolute security is a myth, and the focus must shift to managing, not eliminating, risk.

usa ticker:GE

React:

Read original story at Wired

Sources

Wired

The White House Wants Anthropic to Block All Jailbreaks. That May Not Be Possible

Share this story

More to Read