Claude Fable 5: How to Use It, How It Differs From Opus, and the 19 Days It Vanished
Claude Fable 5: How to Use It, How It Differs From Opus, and the 19 Days It Vanished
Anthropic shipped its most capable public model, Claude Fable 5, on June 9, 2026 — a "safe-for-general-use" cut of its internal Mythos class. Three days later the U.S. government pulled the plug. Nineteen days after that, it came back. If you only skimmed the headlines, here's the part that matters to you as a builder: claude-fable-5 is a real model ID you can call today, it behaves differently from Opus in a couple of ways that will bite you if you're not paying attention, and the shutdown story is a genuinely useful lesson in model dependency.
Let's do the practical stuff first, then the drama.
What Fable 5 actually is
Fable 5 is Anthropic's public release of a Mythos-class model — the tier they'd previously kept internal because of misuse risk. It's state-of-the-art on nearly every capability benchmark Anthropic tested: software engineering, knowledge work, vision, scientific research. The headline capability is duration: it can sustain days-long, asynchronous, multi-step tasks that earlier models fell apart on.
It is not a replacement for Opus 4.8 for most workloads. It's a bigger, more expensive, more heavily-guarded hammer. Whether you need it depends entirely on whether your tasks are actually Mythos-hard.
How to use Claude Fable 5
The migration is deliberately boring — that's the point.
- Model ID:
claude-fable-5(compare: Opus isclaude-opus-4-8) - Context window: 1,000,000 tokens
- Max output: 128,000 tokens per request
- Availability: Claude API, Claude Code, Claude.ai, Amazon Bedrock, Google Cloud (Vertex), and Microsoft Foundry
If you're already on Opus, switching is one line — swap the model ID:
from anthropic import Anthropic
client = Anthropic()
msg = client.messages.create(
model="claude-fable-5", # was: claude-opus-4-8
max_tokens=4096,
messages=[{"role": "user", "content": "Refactor this service and write the migration plan."}],
)
print(msg.content[0].text)
The gotcha nobody reads until it breaks
On Opus 4.8, extended thinking is optional and off by default. On Fable 5, thinking is always on and you cannot turn it off. That has two consequences you need to budget for:
- Latency. Every call reasons before it answers. Great for hard problems, wasteful for "uppercase this string."
- Cost. Thinking tokens are output tokens, and output tokens are the expensive ones (more below). Fable 5 will spend tokens thinking whether your task needed it or not.
Rule of thumb: route only genuinely hard, long-horizon work to Fable 5. Keep your cheap, high-volume calls on Opus, Sonnet, or Haiku. Don't make Fable your default model — make it a deliberate escalation.
How Fable 5 differs from Opus 4.8
Three differences matter in practice: price, context, and the safety rerouting.
┏━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━┓
┃ Dimension ┃ Opus 4.8 ┃ Fable 5 ┃
┣━━━━━━━━━━━━━━━━━━╋━━━━━━━━━━━━━━━━━━╋━━━━━━━━━━━━━━━━━━┫
┃ Model ID ┃ claude-opus-4-8 ┃ claude-fable-5 ┃
┃ Context window ┃ 200K tokens ┃ 1M tokens ┃
┃ Max output ┃ (standard) ┃ 128K tokens ┃
┃ Thinking ┃ Optional, off ┃ Always on ┃
┃ Input / 1M tok ┃ $5 ┃ $10 ┃
┃ Output / 1M tok ┃ $25 ┃ $50 ┃
┃ Batch in/out ┃ — ┃ $5 / $25 ┃
┗━━━━━━━━━━━━━━━━━━┻━━━━━━━━━━━━━━━━━━┻━━━━━━━━━━━━━━━━━━┛
Pricing is exactly 2× Opus 4.8 on every dimension. That's not an accident — it makes the migration math trivial: whatever you spend on Opus for a given token volume, double it for Fable. One useful arbitrage: Fable's batch price ($5/$25) equals Opus 4.8's standard price. If your Fable workload tolerates async batch processing, you get Mythos-class output at Opus-standard cost.
Context is 5× larger — 1M vs 200K. This is the real reason to reach for Fable on genuinely large inputs (whole-repo reasoning, long document sets) rather than chunking.
The safety rerouting is a real API behavior, not a footnote
Because Mythos-class capability carries actual misuse risk, Fable 5 ships with classifiers covering cybersecurity, biology/chemistry, and model distillation. When a request trips one, the answer is silently handled by Claude Opus 4.8 instead, and the response tells the user it happened. Anthropic says this triggers in under 5% of sessions on average.
For most developers this is invisible. But if you're building in security tooling, bioinformatics, or anything that reads as dual-use, plan for the possibility that some fraction of your Fable calls will quietly downgrade to Opus. Log which model actually answered.
The conspiracy: released, then shut down, then back
Here's the part everyone actually wants to talk about — and unlike most "AI conspiracy" chatter, the spine of this one is documented fact.
The timeline:
- June 9, 2026 — Fable 5 launches publicly. Notably, this came days after Anthropic had publicly warned that AI was becoming too dangerous — releasing its most powerful public model in the same breath. That whiplash is where the theories start.
- June 12 — The U.S. Department of Commerce issues a national-security directive suspending all access to Fable 5 (and Mythos 5) for any foreign national — inside or outside the U.S., including Anthropic's own foreign-national employees. The trigger: Amazon researchers documented a jailbreak that got Fable to flag software vulnerabilities and, in one case, to write code showing how a flaw could be abused.
- ~19 days offline — Fable 5 effectively goes dark for a large share of the world.
- June 30 — Commerce lifts the controls.
- July 1 — Anthropic begins redeploying Fable 5 to global users.
Anthropic's counter-argument is the crux of the "was it ever really that dangerous?" theory: they say their own testing found that other widely available models — including OpenAI's GPT-5.5 — could produce the same outputs flagged in the Amazon report. In other words, the capability wasn't unique to Fable; the report just happened to test Fable. Anthropic then built a new classifier that neutralizes the documented bypass in over 99% of attempts, and is pushing an industry-wide rubric to score jailbreak severity on four axes: capability gain, breadth of gain, ease of weaponization, and discoverability.
So what are the actual theories? Reading the developer community, they cluster into three:
- "It was competitive/regulatory theater." A single Amazon-authored report triggers a government shutdown of an Anthropic model — from the company that runs Bedrock, where Fable also ships. Skeptics note the conflict of interest. Anthropic's "GPT-5.5 does it too" rebuttal fuels this reading.
- "The warning was the marketing." Anthropic loudly warns AI is too dangerous, then ships the most dangerous public model days later. The shutdown-and-return arc kept Fable in the headlines for three weeks. Cynics call it the most effective launch campaign of 2026.
- "It was exactly what it looked like." The most boring theory, and probably the most correct: a real dual-use capability got flagged, the government overreacted with a blunt instrument (banning foreign employees from a shipped product is extraordinary), Anthropic patched it, and cooler heads lifted the order. No grand plot — just AI policy being improvised in real time.
The honest read: there's no evidence of a manufactured stunt, but the episode exposed how one report can take a production model offline for 19 days by regulatory fiat.
The real lesson for builders
Strip the intrigue and Fable 5's first month is a case study in model dependency risk. Teams that had hard-coded claude-fable-5 as their only path spent 19 days broken. Teams that had a fallback — route to Opus 4.8 when Fable is unavailable — barely noticed.
That's the takeaway worth keeping:
- Never single-source a model for a production path. Fable's own safety layer already reroutes to Opus; your architecture should be able to do the same on purpose.
- Abstract the model ID behind config, not code. The people who migrated in "one line" are the same people who can fall back in one line.
- Log which model actually served each request — because with Fable, it isn't always the one you asked for.
Fable 5 is a genuinely powerful tool and, at batch pricing, a surprisingly reasonable one. Just don't let it become a single point of failure. The last three weeks proved it can be — through no fault of the code.
FAQ
What is the Claude Fable 5 model ID? claude-fable-5.
How much does Fable 5 cost? $10 per million input tokens and $50 per million output tokens — exactly 2× Opus 4.8. Batch pricing is $5/$25 (equal to Opus 4.8 standard).
Can I turn off thinking on Fable 5? No. Unlike Opus 4.8, extended thinking is always on and cannot be disabled.
How is Fable 5 different from Opus 4.8? Bigger context (1M vs 200K), always-on thinking, 2× price, Mythos-class capability, and safety classifiers that reroute cyber/bio-chem/distillation requests to Opus 4.8 in under 5% of sessions.
Why was Claude Fable 5 shut down? A June 12 U.S. Commerce directive suspended foreign-national access after Amazon researchers documented a jailbreak eliciting software-vulnerability exploitation. Controls were lifted June 30; Fable returned July 1.
Is Fable 5 available now? Yes — as of July 1, 2026, it's redeployed globally on the Claude API, Claude Code, Claude.ai, Bedrock, Vertex, and Foundry.
Sources: Anthropic — Claude Fable 5 & Mythos 5 · Anthropic — Statement on the US directive · Anthropic — Redeploying Fable 5 · Claude Platform Docs — Introducing Fable 5 & Mythos 5 · TechCrunch · CNBC — controls lifted · The Hacker News · Finout — pricing & benchmarks vs Opus 4.8 · Espressio — API access & what's different from Opus