// Wednesday · June 10, 2026

Fable 5 Raises the Bar for AI Ambition

Anthropic ships Fable 5 — the first Mythos-class model and, fairly undisputedly, the best AI we've ever used. But at this level, getting the most out of state-of-the-art isn't your old prompts with a new model: it's controversial guardrails, usage-based reality, and a new skill called task imagination.

Ad-free on Patreon
Today's sponsors — KPMG · Section · Zencoder · OutSystems · all offers →
The One Idea

Fable 5 doesn't just beat the benchmarks — it raises the bar for ambition.

Anthropic's first Mythos-class model is a genuine leap, especially on agentic coding, where it one-shots things that used to take teams months. But the real story is what it demands of us: in a usage-based world, we have to become token-efficiency optimizers who match models to use cases, and we have to develop "task imagination" — the ability to hand an agent days of responsibility, not minutes of tasks. The frontier has moved from answers to tasks to responsibilities, and most of us aren't dreaming big enough to use it.

// 01

By the Numbers

80.3%
Fable 5 on SWE-bench Pro vs GPT-5.5's 58.6% and Opus 4.8's 69.2%
29.3%
Fable 5 on the new Frontier Code benchmark — more than double Opus 4.8
91/100
Fable 5 on Every's senior-engineer benchmark vs 62-63 for rivals
78%
Fable/Mythos 5 on Exploit Bench vs GPT-5.5's 34%
50M lines
Ruby codebase Stripe migrated in a day with Fable — was a 2-month team job
95%
Of Fable sessions have no safety fallback to Opus, per Anthropic
30 days
Mandatory prompt/output retention with human review on Mythos-class models
100+ hrs
How long the new model class can run on a single goal
// 02

The Brief

ModelsExecEng00:00

Anthropic launches Fable 5, the first Mythos-class model

On Tuesday June 9th, Anthropic released Claude Fable 5 — NLW calls it fairly undisputedly the best AI model we've ever been able to use. It arrives just weeks after Opus 4.8, which still plays a big role in the Fable-led ecosystem.

AI Daily Brief
Models02:30

Mythos 5 is the unrestricted twin — and almost nobody gets it

Mythos 5 is effectively the same model as the public Fable 5, minus the controversial safeguards. It's available only through Project Glasswing, deployed in collaboration with the US government, with broader "trusted access" promised later.

AI Daily Brief
Models03:00

A new tier above Opus — and the first full base-number jump since GPT-5

Anthropic added "Fable" as a class above haiku, sonnet, and opus. It's also the first time since GPT-5's disastrous August rollout that any lab put a full new base number on a model — the turn-of-2026 jumps were all 4.5/4.6. The naming alone signals Anthropic isn't playing.

AI Daily Brief
ModelsEng04:15

The benchmark jump is big enough to actually mean something

NLW usually distrusts saturated benchmarks, but the leaps here are real: 80.3% on SWE-bench Pro, 88% on Terminal Bench, and a top blended ranking from Artificial Analysis. The model's clear purpose is agentic coding.

AI Daily Brief
ModelsEng06:30

Cognition's Frontier Code tests whether code is actually mergeable

The new benchmark combines unit tests with assessments of scope, discipline, style, and codebase standards — measuring not just whether code works, but whether it's good enough to merge into production. Fable 5 scored 29.3%, more than doubling Opus 4.8's 13.4%.

AI Daily Brief
ModelsEng07:15

More than half of SWE-bench results is unmergeable slop.

— Sean Wang, Cognition / Latent Space. Cognition's Sean Wang's framing for why Frontier Code is needed: even code that nominally solves a problem is often unusable by the organization running it.

The AI Daily Brief
BusinessFinanceExec07:45

Fable gets pulled from subscriptions — the token-scarcity era is here

API costs are double Opus, and Anthropic is positioning subscription access as an introductory offer: Fable will be removed from Pro-tier plans on June 23rd, after which access is pay-per-usage. More evidence we're in a firmly usage-based pricing paradigm.

AI Daily Brief
ModelsLegalProduct09:15

The biology guardrails are tripping over "mitochondria"

Users report Fable refusing or rerouting on basic biology — the word "cancer" flagged as a biosecurity risk, "tell me about mitochondria" pausing the chat. Anthropic admits it ratcheted up bio/chem filtering precisely because the model is more capable.

AI Daily Brief
ModelsProduct10:00

Sensitive requests silently fall back to Opus 4.8

When Fable's classifiers detect cybersecurity, biology/chemistry, or distillation requests, the response is handled by Opus 4.8 instead, with the user informed. Anthropic says 95% of sessions never hit a fallback, arguing a downgrade beats an outright refusal.

AI Daily Brief
ModelsEngLegal11:45

Buried on page 13: Fable is deliberately worse at frontier AI research

Anthropic added interventions limiting Claude's effectiveness on pre-training pipelines, distributed training, and accelerator design — aimed at actors who'd violate its terms (read: Chinese models distilling its work). NLW sees a dragnet catching legitimate researchers.

AI Daily Brief
ModelsEng13:15

It's the first publicly available model that I am explicitly not allowed to use for my work.

— Will Brown, Prime Intellect. Critics including Nathan Lambert and Dean Ball blasted not just the research limits but that they're invisible and undisclosed — "shockingly hostile," said Ball. SemiAnalysis claims models will secretly degrade output quality on interesting ML work.

The AI Daily Brief
ModelsExec13:30

You thought Anthropic was going to let Eli Lilly extract that and get the patent? The labs are going to do all of it.

— Tenebris, on X. The counter-camp says the pearl-clutching is naive — locking down capability and capturing the upside was always the plan. OpenAI staffer Adam GPT quipped, "Well, look at that. OpenAI ends up being the OpenAI lab."

The AI Daily Brief
EnterpriseLegalExec14:00

A 30-day retention requirement breaks the enterprise case

Mythos-class prompts and outputs are retained for 30 days with human review on every platform. Critics warn it could violate NDAs — especially with memory on, which pulls sensitive past chats into context. NLW expects this constraint won't last long.

AI Daily Brief
BusinessFinanceEng16:00

Actually solving the problem is token efficient, it turns out.

— John vs Malik, on X. Amid "I'll be out of usage in an hour" panic, others found Fable cheaper than Opus in practice: it costs more per token but one-shots far more often, so users burn less time re-prompting.

The AI Daily Brief
ModelsExecOps20:30

I kicked it off, went to a long lunch, and didn't have to do squat to steer it.

— Ali K. Miller. Ali K. Miller called Fable 5 an actual leap with high-performing models that can run 100-plus hours — reframing work away from a 9-to-5 of constant babysitting toward giving complex, goal-oriented prompts and aligning your org on what to kick off.

The AI Daily Brief
ModelsEngProduct21:30

One-shotting a Replit clone — and a Lovable clone in four prompts

Riley Brown reported Fable one-shotting "Replit Mobile," a Swift app that builds web apps, and rebuilding a working Lovable clone in two prompts. Skeptics noted a real company is more than an interface — but the speed was a genuine moment.

AI Daily Brief
ModelsEngProduct23:00

Custom 3D worlds, a humanoid robot, and the Boeing 747 benchmark

Matt Schumer said Fable "solved 3D world building" with custom Three.js in-browser. Jake Fitzgerald got a humanoid robot design after 2 hours and 1.4M tokens. Hugging Face's Victor said Fable did an "AGI-level job" on his Boeing 747 Three.js benchmark.

AI Daily Brief
EnterpriseEngExec25:30

Stripe: a 50-million-line migration in a day

Per Anthropic's launch post, Stripe reported Fable 5 compressing months of engineering into days — performing a codebase-wide migration on a 50-million-line Ruby base in a day that would have taken a team over two months by hand.

AI Daily Brief
EnterpriseSalesProductCS26:15

By the end of the call I showed a fully working product with the exact workflow they mentioned 15 minutes earlier.

— Todd Saunders. Todd Saunders had Claude transcribing a customer call in the background — and building the requested features in real time as the customer described them. Autonomous looped building triggered straight from a sales conversation.

The AI Daily Brief
Models27:15

Fable doesn't seem much better to me, but every 150-IQ person I know is like, the singularity came sooner than I thought.

— Citrini Research. Citrini Research captures the new dynamic: state-of-the-art no longer reveals itself across all tasks. The gains show up specifically in things that simply weren't possible before — visible to the people pushing hardest.

The AI Daily Brief
◆ The TakeExecOps27:45

The first model that disagrees — and updates without kowtowing

NLW's standout everyday win: in a strategic debate, Fable disagreed clearly, then updated its position on new information without collapsing into telling him what he wanted to hear. That alone is a massive upgrade for strategic ideation, a top real-world use case.

The AI Daily Brief
◆ The TakeProductEngExec29:15

NLW: Fable rebuilt three of his own products in single shots

In hours of unattended work, Fable rebuilt Superintelligent's audit input system on Whisper, the Agent Transformation Intensive site and platform, and turned AIDB's shareable-nuggets mockups into a real production pipeline. The takeaway: far less management, much bigger ambition.

The AI Daily Brief
ModelsProductEng34:00

Moving from giving AI tasks to giving it responsibilities.

— Felix Reisberg, Anthropic. Felix Reisberg, who leads Claude Code and CoWork, says a third era quietly began: from answers, to tasks, to loops. He no longer tells Claude to investigate a crash report — it watches every crash and its job is to keep the apps from crashing. He expects 2027 apps to look nothing like today's.

The AI Daily Brief
◆ The TakeExecOps36:30

The new skill: task imagination

With models that can run for days, NLW argues we'll all have to up-level ambition — and become token-efficiency optimizers who match models to use cases. Nate B. Jones's framing: most of us have nothing that's ever taken even an hour on AI, so the scarce skill is imagining tasks worth handing to a model that works for days.

The AI Daily Brief
Models38:45

Feeling pretty good about things.

— Thibault, OpenAI / Codex. Asked whether Anthropic had blown past OpenAI with three models in two months — and Fable isn't even its best — Codex product lead Thibault offered a confident, cryptic reply. NLW: we could be in for quite a week.

The AI Daily Brief
Machine-readable ▸Download .mdTranscript .md— feed it to your own agent

Got this from a colleague? Get the brief every day.