Anthropic's strongest model has two versions. You get the weaker one.

Anthropic just released the most capable model it has ever built. Then it split that model in two, locked the powerful half behind a government partnership, and gave the public a version with training wheels. The headline benchmarks you'll see shared everywhere? Most of them belong to the model you cannot buy.

This is Claude Fable 5 and Claude Mythos 5, launched today (June 9, 2026). Same underlying architecture. Same weights. Two radically different levels of access. And a 30-day data retention policy that quietly rewrote Anthropic's privacy promises.

What Actually Happened

Anthropic released a single frontier model under two product names:

Claude Fable 5 ($10/1M input, $50/1M output): Generally available via API, claude.ai, GitHub Copilot, AWS, GCP, and Microsoft Foundry. Comes with safety classifiers that monitor every prompt for cybersecurity, biology, chemistry, and distillation content. When a classifier triggers, the query silently hands off to Claude Opus 4.8, a less capable model. Fewer than 5% of sessions hit this handoff, but for certain workloads (security research, bioinformatics), you'd be paying Fable 5 prices to get Opus 4.8 performance.
Claude Mythos 5: Same model, no safety classifiers on those domains. Not for sale. Access restricted to vetted partners through Project Glasswing (cyberdefenders, critical infrastructure providers, select biomedical researchers). Expanded to ~150 organizations across 15+ countries.

The catch that nobody's leading with: Anthropic retains all prompts and outputs for 30 days to run those safety classifiers. This data is not used for training, but it's stored. For GitHub Copilot users, this is a hard shift from zero data retention. Business and Enterprise admins must explicitly acknowledge this policy to enable Fable 5.

Benchmarks and the Asterisk Problem

Anthropic's benchmark table looks impressive at a glance. SWE-Bench Pro at 80.3% (coding). FrontierCode scores at medium effort that beat Opus 4.8 at full effort. A 50-million-line Ruby codebase migration completed in days rather than months. The model even beat published genomics research in Science while being 100x smaller.

But here's what the marketing won't emphasize: many of those benchmarks are marked with an asterisk, meaning they were tested on Mythos 5, not Fable 5. The safety classifiers pull Fable 5's performance closer to Opus 4.8 on certain tasks. On OSWorld-Verified (computer use), the older Mythos Preview actually edges out the new model (85.4% vs 85.0%). The model you can buy is measurably worse than the model Anthropic is showing off.

One impressive demo that applies to both versions: Fable 5 completed Pokemon FireRed using only raw screenshots, no external navigation aids. That kind of spatial reasoning and long-horizon planning is genuine.

The Pricing Math

At $10 per million input tokens and $50 per million output tokens, Fable 5 is exactly 2x the price of Opus 4.8 across the board. Prompt caching gives a 90% discount on input tokens, which softens the blow for repeat-heavy workflows.

The subscription angle is interesting: Fable 5 is currently included in Pro, Max, Team, and Enterprise plans at no extra cost. But this "included" window closes June 22, 2026. After that, usage draws from your credits. Anthropic says this is temporary until capacity catches up, but it reads more like a trial period to get teams hooked before the bill arrives.

Community Reaction

The r/ClaudeAI threads are split. Power users are excited about the raw capability. "It feels a bit magical," one user wrote about Fable 5. Others immediately flagged the data retention change as a dealbreaker for enterprise adoption. The r/ClaudeCode community noted it's "a neutered version of Mythos" and questioned whether the premium price is justified when the model silently downgrades on security and biology topics.

The privacy conversation is louder than the capability conversation. Anthropic previously positioned itself as the privacy-first AI company. Now every Fable 5 prompt is stored for a month. The company's explanation (safety classifiers need the data to detect novel jailbreak attempts) is technically sound but commercially inconvenient for teams with compliance requirements.

Don't Confuse It

The Claude model naming is getting confusing. Here's the current lineup:

Claude Fable 5: The new public flagship (this article)
Claude Mythos 5: Same model, no safety classifiers, restricted access
Claude Opus 4.8: Previous flagship, now the "fallback" model when Fable 5's classifiers trigger
Claude Mythos Preview: Earlier Mythos-class model, slightly better on computer use benchmarks than Fable 5
Claude Sonnet 4.5, Haiku 4.5: Lower-tier models, zero data retention

Fable 5 is NOT the same as Fable, which is NOT the same as Mythos Preview. The "5" in Fable 5 refers to the 5th model generation, not a version number relative to other Fable releases.

So What

The two-tier split is the real story here, not the benchmark numbers. Anthropic just showed the industry what controlled deployment looks like: ship your most capable model, but gate the dangerous parts behind government partnerships and let safety classifiers quietly route certain work to a weaker model. It's a playbook other frontier labs will copy.

The data retention policy is the piece that matters for anyone building on this. If you're a startup shipping products on Claude, your users' prompts are now stored for 30 days. That's a different compliance conversation than "zero data retention." Anthropic needed the retention to make the safety architecture work, but they chose to ship it anyway rather than delay the launch.

What surprised me: the Pokemon demo. Not because it's flashy, but because it demonstrates genuine long-horizon spatial reasoning without external scaffolding. That capability, available to the public (minus the safety-classified domains), is the kind of thing that actually moves real applications forward. The benchmark wars are marketing. The spatial reasoning is real.

Sources:

What Actually Happened

Benchmarks and the Asterisk Problem

The Pricing Math

Community Reaction

Don't Confuse It

So What

RELATED_ENTRIES

The Government Just Killed Anthropic's Best Models Over a Single Jailbreak

Your coding agent wastes tokens thinking. This one doesn't.

Your AI Agent Will Burn Your AWS Budget. Here's Proof.