Fable is Back: Here's What You Should Try First

ComputeEngFinance▶ 00:00

OpenAI cut inference costs in half — sort of

The Information reports OpenAI found an optimization technique that halved inference requirements for existing models, letting it serve all signed-out ChatGPT users on just 100 GPUs. The technique wasn't disclosed; speculation ranges from quantization to query routing — and none of those improve larger models without quality trade-offs.

AI Daily Brief

◆ The TakeEng▶ 01:00

There's still no free lunch on inference

NLW's caution: this is likely a smaller breakthrough than the headline suggests, and it's being tested on OpenAI's least-engaged users — which could be a reasonable first step or a hint of quality-degradation risk. Don't treat it as a silver bullet for the compute crunch.

The AI Daily Brief

Compute▶ 02:00

“

This is a very important secret sauce for them that they don't even want to tell other OpenAI employees about.

— Stefanie Palazzolo, The Information. The Information's Stefanie Palazzolo argues OpenAI is holding the technique close because if it leaked, rival labs could quickly adopt it to lower their own costs.

The AI Daily Brief

Models▶ 03:00

“

There's nothing my mom actually asks of her AI products that needs to be done by the frontier.

— Everett Randall, Benchmark Ventures. Benchmark Ventures' Everett Randall's "AI mom test" captures why quality reductions may be tolerable for casual users — likely the exact segment OpenAI's new technique targets.

The AI Daily Brief

BusinessFinanceEng▶ 03:00

“

Five founders cut inference spend 75%+ with little effort, no performance change, better latency.

— Harry Stebbings, 20 Minute VC. 20 Minute VC's Harry Stebbings said founders from 10-person startups to a $200 billion public company all reported major inference savings in 24 hours: "The times they are a-changing."

The AI Daily Brief

ModelsEngProduct▶ 04:00

Base44 builds its own model to control costs

Vibe coding platform Base44 launched Base1, fine-tuning an open-source base model on hundreds of millions of user interactions — the same playbook as Cursor's Composer. The bet: a model that only needs to be good at building web apps can rival the frontier while giving Base44 control over cost, latency, and quality.

AI Daily Brief

BusinessExecProduct▶ 05:00

“

Owning more of that intelligence becomes just as important as owning the infrastructure around it.

— Shlomo, Base44. Base44's Shlomo frames the strategic logic: as AI becomes central to how software is created, controlling the model itself — paired with the harness — is a core moat.

The AI Daily Brief

EnterpriseSalesOps▶ 05:00

AWS drops $1B on forward-deployed engineers

AWS is investing a billion dollars in a new unit of forward-deployed engineers to help customers set up AI tools, joining OpenAI, Anthropic, Google, and Microsoft in the FTE race. It's upskilling salespeople into "solution architects" and focusing on healthcare, government, and financial services.

AI Daily Brief

EnterpriseFinanceExec▶ 06:00

“

Open weight models are definitely gaining traction — price performance, but also they service the task.

— Vasquez, AWS Frontier AI Engineering and Services. AWS's Vasquez noted the big enterprise shift toward AI budget optimization rather than just capability deployment, with open-source and open-weight models gaining ground.

The AI Daily Brief

EnterpriseOpsProduct▶ 06:00

Claude Tag is coming to Microsoft Teams

Anthropic told Microsoft it plans to bring Claude Tag to Teams. It's an organization-centric agent — not tied to any user — with persistent memory and tool access, and it actually calls the full Claude Code suite rather than plain Claude.

AI Daily Brief

EnterpriseExec▶ 07:00

“

The most exciting things in AI are plug-ins in Word or Excel... we have a structural position in knowledge work.

— Satya Nadella, Microsoft CEO. Satya Nadella framed third-party agents like Claude as reinforcing Microsoft's ecosystem — though NLW wonders whether the tune changes as Claude gets more endemic across knowledge work.

The AI Daily Brief

ComputeOpsExec▶ 08:00

SpaceX offers half-price Starlink to quell Memphis backlash

To ease controversy over its Colossus data centers — powered off-grid by 46 gas turbines running without a permit — SpaceX is offering greater-Memphis residents half-price Starlink and free hardware, and recommitted to a wastewater treatment plant. The US government recently intervened to shut down the turbines, calling Colossus a national security matter.

AI Daily Brief

◆ The TakeExec▶ 09:00

Applaud the direction, but go a lot harder than half-price

NLW is glad data center operators are starting to cut communities into the benefits — but says making residents become customers to get a discount is far too small a gesture. Still, encouraging signs that these relationships are being rethought.

The AI Daily Brief

PolicyExec▶ 13:00

Fable 5 is officially back after 19 days

Anthropic announced the Department of Commerce lifted export controls, restoring Fable 5 for all global paid subscribers starting July 1st. A short subsidy extension covers up to 50% of weekly usage limits until July 7th; after that, access requires purchasing usage credits.

AI Daily Brief

ModelsEngLegal▶ 15:00

Anthropic: the jailbreak exposed no unique dangerous capability

Anthropic said testing confirmed many less-capable models — including GPT-5.5 and Kimi K2.7 — could identify the same vulnerabilities Fable 5 flagged, and the reported technique only involved routine defensive cybersecurity work. It trained a new classifier claiming a 99% block rate, validated by Commerce's Center for AI Standards and Innovation as "extraordinarily strong."

AI Daily Brief

PolicyLegalExec▶ 17:00

“

This opacity will not lend itself well to a stable, investable, trustworthy industry over time.

— Dean Ball, policy advisor. Policy advisor Dean Ball welcomed Fable's return but noted no one knows what Anthropic actually did to make the model safe or how it applies to other models in the licensing queue. Still, he called a two-week review timeline "not insane" and real progress.

The AI Daily Brief

Policy▶ 18:00

“

The first rule of Fable Club is you do not ask too many questions.

— Miles Brundage. Miles Brundage captured the prevailing policy mood: even with lots of open questions about what exactly Anthropic agreed to, people are cautiously optimistic just to have the model back.

The AI Daily Brief

◆ The TakeExecLegal▶ 18:00

We're living with a framework that requires heavy judgment

NLW's read: there's a semblance of a practical framework now, but a lot of subjectivity goes into assessing risks. The best-case outcome is an efficient process with faster review for incremental updates — a bad outcome would be every capable release triggering the same weeks-long review and slowing the pace of breakthroughs.

The AI Daily Brief

ModelsEng▶ 20:00

Only a small fraction of coding tasks fall back to Opus

The announcement's line that "some routine tasks like coding and debugging will fall back to Opus 4.8" sparked backlash that the relaunch was "fake." The Claude Code team clarified the wording was loose: only a small fraction of routine coding and debugging tasks get flagged and reverted to Opus.

AI Daily Brief

ModelsEngProduct▶ 21:00

Anthropic ships Claude Sonnet 5 — its most agentic Sonnet

Pitched as nearly as good as Opus 4.8 for a fraction of the cost, Sonnet 5 sits a few points shy of Opus on major coding benchmarks but posted a big jump on GDPVal, hinting at strong end-to-end agentic follow-through. NLW suspects the timing means Anthropic didn't expect Fable 5 back so soon.

AI Daily Brief

ModelsFinanceEng▶ 23:00

Sonnet 5 can cost more than Opus — even more than Fable

Artificial Analysis found Sonnet 5 used ~40% more output tokens and ~3x the agentic turns of Sonnet 4.6; on the full benchmark it was more expensive than Opus 4.8 and even Fable. Without promotional pricing, it costs more per task than Opus — leaving many asking what it's actually for.

AI Daily Brief

ModelsEng▶ 24:00

“

You have to use it wildly differently... it's basically an automatic Ralph loop.

— Ben Davis. Ben Davis argued Sonnet 5 is good but inefficient and slow — spawning sub-agents, stacking PRs, reviewing itself adversarially. Others suggest the real framing is Fable 5 as the intelligent advisor and Sonnet 5 as the fast implementer running the sub-agents.

The AI Daily Brief

◆ The TakeExecMarketing▶ 26:00

Fable 5 blows the field away on strategic thinking

NLW's first-hand experience: unlike GPT-5.5 and Opus 4.8, which cave immediately under pushback and over-interpret instructions, Fable 5 accepts part of a critique while holding its ground on other parts — behavior he's never seen from another model. And it doesn't burn many tokens, so it won't chew through the 50% usage limit.

The AI Daily Brief

◆ The TakeMarketing▶ 27:00

Fable is far better at rubric-driven writing

Contrary to Every's vibe check, NLW found Fable 5 much better at instruction-following, with fewer AI-isms and less try-hard prose. His suspicion: when you have a clear rubric or examples of good writing, Fable meets that standard far better than prior models — even if it's not necessarily better at blank-page writing.

The AI Daily Brief

ModelsEngProduct▶ 25:00

A one-week playbook for using Fable without going bankrupt

Panjwani's advice: use Fable 5 for planning not implementation (delegate coding to GPT-5.5 via Codex), ask it for improvements on your most valuable projects, use other models to surface your hardest problems then have Fable propose solutions, and review Fable's output with GPT Pro. NLW agrees on the hard technical problems half but pushes hard on also using it for strategy and writing.

AI Daily Brief

Fable is Back: Here's What You Should Try First

By the Numbers

The Brief

OpenAI cut inference costs in half — sort of

There's still no free lunch on inference

This is a very important secret sauce for them that they don't even want to tell other OpenAI employees about.

There's nothing my mom actually asks of her AI products that needs to be done by the frontier.

Five founders cut inference spend 75%+ with little effort, no performance change, better latency.

Base44 builds its own model to control costs

Owning more of that intelligence becomes just as important as owning the infrastructure around it.

AWS drops $1B on forward-deployed engineers

Open weight models are definitely gaining traction — price performance, but also they service the task.

Claude Tag is coming to Microsoft Teams

The most exciting things in AI are plug-ins in Word or Excel... we have a structural position in knowledge work.

SpaceX offers half-price Starlink to quell Memphis backlash

Applaud the direction, but go a lot harder than half-price

Fable 5 is officially back after 19 days

Anthropic: the jailbreak exposed no unique dangerous capability

This opacity will not lend itself well to a stable, investable, trustworthy industry over time.

The first rule of Fable Club is you do not ask too many questions.

We're living with a framework that requires heavy judgment

Only a small fraction of coding tasks fall back to Opus

Anthropic ships Claude Sonnet 5 — its most agentic Sonnet

Sonnet 5 can cost more than Opus — even more than Fable

You have to use it wildly differently... it's basically an automatic Ralph loop.

Fable 5 blows the field away on strategic thinking

Fable is far better at rubric-driven writing

A one-week playbook for using Fable without going bankrupt

Fable is Back: Here's What You Should Try First

By the Numbers

The Brief

OpenAI cut inference costs in half — sort of

There's still no free lunch on inference

This is a very important secret sauce for them that they don't even want to tell other OpenAI employees about.

There's nothing my mom actually asks of her AI products that needs to be done by the frontier.

Five founders cut inference spend 75%+ with little effort, no performance change, better latency.

Base44 builds its own model to control costs

Owning more of that intelligence becomes just as important as owning the infrastructure around it.

AWS drops $1B on forward-deployed engineers

Open weight models are definitely gaining traction — price performance, but also they service the task.

Claude Tag is coming to Microsoft Teams

The most exciting things in AI are plug-ins in Word or Excel... we have a structural position in knowledge work.

SpaceX offers half-price Starlink to quell Memphis backlash

Applaud the direction, but go a lot harder than half-price

Fable 5 is officially back after 19 days

Anthropic: the jailbreak exposed no unique dangerous capability

This opacity will not lend itself well to a stable, investable, trustworthy industry over time.

The first rule of Fable Club is you do not ask too many questions.

We're living with a framework that requires heavy judgment

Only a small fraction of coding tasks fall back to Opus

Anthropic ships Claude Sonnet 5 — its most agentic Sonnet

Sonnet 5 can cost more than Opus — even more than Fable

You have to use it wildly differently... it's basically an automatic Ralph loop.

Fable 5 blows the field away on strategic thinking

Fable is far better at rubric-driven writing

A one-week playbook for using Fable without going bankrupt

Got this from a colleague? Get the brief every day.