# The AI Chart Everyone Is Getting Wrong — Transcript (2026-06-12)

https://aidailybrief.ai/e/2026-06-12 · Listen: https://pod.link/1680633614

---

[00:00:00] Today on the AI Daily Brief 

The shift from token maxing to token panic happened so quickly. I'm gonna explain why things or a lot different than a lot of the charts and analysis running around would make you think. Before that in the headlines, a preview of the upcoming SpaceX The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI. All right, friends, quick announcements before we dive in

First of all, thank you to today's sponsors, KPMG, Section, Zencoder, and OutSystems. To get an ad-free version of the show, go to patreon.com/aidailybrief, or you can subscribe in Apple Podcasts. you wanna learn more about sponsoring the show, send us a note at sponsors@aidailybrief.ai, or you can check out the new aidailybrief.ai/sponsors By the way, one of the things that we have now on the new AI Daily Brief site, in addition to every episode having a whole page That organizes it into, easy to share chunks

Is a sponsors page

where you can go see all of the offers we've shared. like for [00:01:00] example, getting a free month of Bolt Pro

You can find all of that at aidailybrief.ai/sponsors

And while you're there, check out the rest. Send me ideas. We're gonna be adding a lot here. For now, though, let's talk about the first big AI IPO of the year 

Welcome back to the we have a bit of an exciting Friday today. After months of anticipation, SpaceX is conducting the largest IPO in history. Now, this has been one of the most hyped up events in markets for a very long time.

Investment banks have been battling it out for institutional sales, and the retail frenzy is already off the charts as as of close of trading on Thursday, Bloomberg reports that retail investors submitted more than $100 billion in orders. Yes, that is billion with a B.

Now, SpaceX was only selling $75 billion worth of stock and reportedly reduced the retail allocation from 30% to 20%. That means the retail allocation was almost 7X oversubscribed and would have been enough to fill the entire IPO by itself. The sale was priced at $135 per share, [00:02:00] a flat price set by SpaceX earlier in the process. That pricing implies a valuation just shy of $1.8 billion, meaning the company will debut as the seventh-largest company in the world ahead of Saudi Aramco, Tesla, and Meta Some anticipate the flat pricing will increase day one volatility as there was no price discovery mechanism in the IPO process.

And much of the commentary has already declared this a retail bloodbath waiting to happen, and possibly an obvious market top for the AI bull run. A rare opinion piece for Reuters declared, " There's a serious risk that investors piling into the world's largest IPO will get burned, especially the retail crowd."

The analysis focused on the relative lack of revenue for a company of this size. Their twenty twenty-five financials showed a five billion dollar loss on eighteen point seven billion in revenue. In contrast, Meta delivered two hundred billion in revenue last year, and even a Tesla that isn't at the top of its game managed ninety-five billion Even after SpaceX signed mega data center deals with Anthropic and Google over the past month They're well short of revenue numbers that put them up alongside those other companies.



there [00:03:00] also has been criticism around the way the company was marketed, with Goldman Sachs simultaneously conducting the IPO and providing wildly bullish research analysis. In a report last week, they forecast that SpaceX could hit $474 billion in revenue by 2030, with their AI division growing a hundredfold to some, this was less about plausibility and more about an analyst with a clear incentive, forecasting a bajillion dollars in revenue

There is also the sideshow of Elon Musk on the verge of becoming the world's first trillionaire. Based on Bloomberg's net worth calculations, Elon was worth just shy of seven hundred billion last month, with more than sixty percent of his wealth tied up in SpaceX. The IPO pricing would bring his net worth to nine hundred and seventy-one billion, so any significant pop would push him over the line

now many expect the IPO to be a bit of a circus. but one of the big questions is what will the implications be for the Anthropic and OpenAI IPOs to come? Some are seeing this as the first chance the US market has to price an AI model company

Meaning that if SpaceX does well, it could imply even greater valuations for the frontier labs. then again, there's also the [00:04:00] potential with that line of thinking that SpaceX puts in the market top, theoretically making it more difficult for OpenAI and Anthropic to get their IPOs out the door at a premium valuation.

d- I tend to disagree with this as the right way to look at things. Two big reasons why. The first is the simple one. You can't really apply anything around Elon Musk to anyone else love him or loathe him, he kind of operates in his own vortex.

and I don't necessarily think that people are going to read this as a referendum on AI models

as much as the pricing

on the Elon market halo

Secondly, though, I don't think that anyone is really looking at this as an AI model company. this-- I think that the 11th hour shift to SpaceX as a neo cloud totally changes the narrative equation. First of all, it adds a whole bunch of billions to their revenue stream that look a lot more durable and interesting.

And second, it makes their whole push to get data centers in space look a lot more aligned and frankly plausible

So yes, I do think that for some the IPO will be AI related, but it's not gonna be about models, it's gonna be about the infrastructure build out with a [00:05:00] very heavy dose of Elon on top. Now, as I record, it is still early, so markets are yet to open in New York

I will of course provide full coverage of whatever is important about the craziness to ensue on Monday's show

I think though, if you're trying to look for a single take that avoids the hyperbole from either side, economist Peter Atwater kinda nailed it when he said, " SpaceX has created an idiot moment for investors. Buy it and it goes down, you are an idiot. Don't buy it and it goes up, you were an idiot."

an idiot." now now speaking of very high net worth dudes, Jeff Bezos' AI startup Prometheus has closed their latest round of funding, valuing the company at a measly $41 billion. The The round raised $12 billion with participation from JP Morgan, Goldman Sachs, BlackRock, and Bezos himself.

Prometheus aims to build what they're calling an artificial general engineer, an AI system that can design and manufacturer, anything, including complex equipment such as jet engines

The company has already staffed up, hiring 150 people across offices in San Francisco, London, and Zurich In an [00:06:00] interview, Bezos said the goal was to, quote, " Empower engineers to make an invention

easier and faster so smaller teams can do much bigger things on much shorter time cycles. Asked about fears of an AI jobs apocalypse, Bezos dismissed the premise. He believes that AI will instead produce a labor shortage because, quote, " Even though you're shrinking the number of people needed by 10X, AI will create 10X more opportunities."

Bezos added, " There's going to be two-earner income households where one earner drops out of the labor pool because there's going to be so much productivity."

Alongside their plans to produce AI that can accelerate the entire pipeline for physical manufacturing, Prometheus is also looking at starting a fund for industrial buyouts Now, there is no new news on this front, but in March, The Wall Street Journal reported on talks to raise $100 billion 

dollars.

The fund would essentially take the private equity roll-up model and apply it to the manufacturing sector using Prometheus's proprietary technology to improve productivity

is, all in all, Bezos dismissed the growing pessimism around AI, claiming that view is the, quote, "opposite of reality."

all societal [00:07:00] wealth is driven by invention, he said. Six thousand years ago, somebody invented the plow, and we all got wealthier. Then, much later, somebody invented the steam engine, and we all got wealthier. What Prometheus seeks to do is to offer a set of tools that dramatically accelerates that invention loop

interested-- in-- Now for most people, what's interesting is the physical aspect of this. As Chubby points out, " The problem is that the physical economy can't be scraped. There's no internet of manufacturing data to train on, which is exactly why the reported hundred billion dollar vehicle tobuy up legacy industrial companies is interesting.

You don't find that data, you acquire the factories that generate it

Or as Dr. Singularity put it, " This is how the acceleration escapes the screen and enters atoms."

atoms." Next Next up, one brutal one. Meta has completed an operational split with Manus in compliance with orders from Chinese officials. Bloomberg reports that Meta has firewalled operations between the two companies. Manus staff are no longer able to access data systems, and Meta staff can no longer use Manus' tools for internal work.

Now, by way of recap, Manus was one of Meta's [00:08:00] flagship acquisitions as they reset their AI strategy coming into the year. they paid two billion dollars for the company just astwenty twenty, just as 2025 turned into 2026.

In March, however, the Chinese government opened an investigation into the deal and later barred Manus founders from leaving the country. Manus had attempted to circumvent Chinese tech export controls by first relocating operations to Singapore before courting the acquisition. In April, Beijing ordered the deal to be unwound despite the workaround.

This This now leaves Manus in a difficult situation, to say the least. Sources said the company is attempting to raise a billion dollars to fund a buyback, but it's unclear if there are any takers.

and while the product has seen updates since the separation was ordered, there is a heck of a lot less attention on Mana since the rise of open source harnesses like OpenClaw and Hermes, and just in general, the agentic push of the core harnesses like Claude Code and Codex

In China, the unwinding of the Manas deal has cast an absolutely chilling effect. Manas' strategy of decamping to Singapore before seeking foreign capital was a very common approach, which some even called the red chip corporate structure [00:09:00] With Beijing cracking down on Manas, the Chinese tech industry has received the message that those times are over.

The Financial Times reports that numerous prominent startups are looking to unwind their foreign corporate structures to reincorporate in China. Step Fun has completed the process in anticipation of a Hong Kong IPO, 



While Kimi creator Moonshot as well as Kling are considering doing the same.

Eugene Wang, an attorney at Shanghai-based Wintel & Co., said, whether to dismantle the red chip structure is no longer in question. The key is how to complete the restructuring as cheaply and efficiently as possible."

Unsurprisingly, the AI industry is a particular focus. Reports claim that Chinese officials are now seizing passports from key researchers and executives at private firms, which was previously beyond the pale. Both capital and talent are facing a major crackdown as Beijing seeks to secure the strategically important industry

industry Now over Now over in Chip Land, backlogs at TSMC are driving Google to consider Samsung for parts of their next-generation chips. The Information reports that Google is evaluating Samsung's two-nanometer process for some components of their tenth-generation TPUs, [00:10:00] codenamed Ice Fish. Until now, Google has exclusively used TSMC for the full manufacturing process.

However, the Taiwanese chipmaker has a years-long wait list, and expects they won't have the capacity to meet demand for quite some time. Customers then are beginning to look elsewhere. Earlier this week, it was reported that Google had placedorders with Intel for their twenty twenty-eight production run.

Google will still be using TSMC to produce the actual processors, but Intel will provide advanced packaging services to mate the processor to networking circuits

Sources said that Google could turn to Samsung for the memory input-output die, which marries the processor to memory chips. Basically, what's emerging is a complex supply chain where TSMC just produces the processor which requires the most advanced fabs. other companies, including Samsung and Intel, are increasingly producing less sensitive components

Now, similar to the Intel news from earlier in the week, this doesn't appear to be a case of people being dissatisfied with TSMC's quality. It is simply the case that long wait times are forcing chip makers to look elsewhere to keep up with demand

Meanwhile, private equity companies [00:11:00] continue to pile into data center investments as KKR KKR and and NVIDIA announce a ten billion dollar construction company. The new company is called Helix Digital Infrastructure and will feature private equity giant KKR and Kuwait Sovereign Wealth as capital partners.

NVIDIA will participate in the venture through the deployment of their chips and related infrastructure, and power company Vistra is attached to provide energy

Helix said they have 10 billion in committed capital and have disclosed that they will be a wholly owned subsidiary of KKR

Adam Selipsky, the former CEO of AWS, is attached to lead the new venture. In a LinkedIn post, he commented, " Data centers, power, and connectivity have all too often been built on separate tracks. That fragmentation hasbecome an industry-wide bottleneck. This is slowing down the benefits of AI worldwide."

Now, this is one of several similar deals in recent months, with Broadcom announcing a similar tie-up with Apollo and Blackstone earlier this week. At the same time, real estate firm JLL reports that almost half of data center projects around the country are being delayed

So is it the case that increasingly bringing chip makers, utilities, and capital providers together in a single vehicle will reveal itself to be the best way to ensure that all thecomponents come [00:12:00] together for a successful project? With Helix, we have another chance to see

see Finally Finally today, Goldman Sachs believes everyone is underestimating the AI infrastructure boom by a fairly wide margin.

Now, you wouldn't think most people would call the current forecast for AI CapEx spend conservative, but that's exactly what Goldman strategists led by Ryan Hammond have done this week. The median Wall Street analyst believes the AI industry will deploy nine hundred and twenty billion dollars to build AI data centers next year, rising from around eight hundred billion for this year.

According to Hammond's team, those are rookie numbers. In a research note they wrote, " Consensus 2027 hyperscaler CapEx estimates are too conservative." Their team now expects one point one trillion in AI spending for 2027 as a baseline scenario and one point four trillion in a bullish scenario. Now, their key assumption is that AI demand is still in the opening innings They expect to see token consumption increase twenty-four X through twenty thirty driven by the widespread deployment of agents.

Analysts wrote, " Higher input costs also put upward pressure on the nominal dollars [00:13:00] of CapEx required to support a given amount of token consumption." In other words, excessive demand will keep the pressure on supply chains, driving build-out costs even higher

Now you might be thinking that's a pretty bold claim in a week where many on Wall Street are focused on a corporate push to rein in token budgets. But Goldman believes that's just noise, with the signal being the expanding order books for the hyperscalers

Now, as you will see very soon

I am firmly in the Coleman camp on this one 

And there is in fact one specific new narrative on Wall Street

that I would very much like to take on now One of the most important AI questions right now isn't who's using ai, it's who's using it? Well,

KPMG and the University of Texas at Austin. Just to analyzed 1.4 million real workplace AI interactions and found something surprising. The highest impact users aren't better prompt engineers. They treat AI like a reasoning partner.

They frame problems, guide thinking, iterate, and push for better answers. and the good news, these behaviors are [00:14:00] teachable at scale.

If you're trying to move from AI access to real capability, KPMG's research on sophisticated AI collaboration is worth your time. Learn more at kpmg.com/us/slash sophisticated. That's kpmg.com/us/sophisticated. 

Here's a harsh truth. Your company is probably spending thousands or millions of dollars on AI tools that are being massively underutilized. Half of companies have AI tools, but only 12% use them for business value. Most employees arestill using ai. To summarize meeting notes, if you're the one responsible for AI adoption at your company, you need section.

Section is a platform that helps you manage AI transformation across your entire organization.

It coaches, employees on real use cases

tracks who's using AI for business impact and shows you exactly where AI is and isn't creating value.

The result, You go from rolling out tools to driving measurable AI value. Your employees move from meeting summaries to solving actual business problems, and you can prove the ROI. Stop guessing if your AI investment is working. [00:15:00] Check out section@sectionai.com.

That's S-E-C-T-I-O-N ai com. 

Coding agents are basically solved at this point. They're incredible at writing code. But here's the thing nobody talks about. Coding is maybe a quarter of an engineer's actual day. The rest is standups. Stakeholder updates, meeting prep, chasing context across six different tools, and it's not just engineers.

Sales spends more time assembling proposals than selling finances, manually chasing subscription requests. Marketing finds out what shipped two weeks after it merged. Zen Coder just launched Zen Flow work. It takes their orchestration engine, the same one already powering coding agents connects it to your daily tools.

Jira Gmail, Google Docs, linear calendar notion. It runs goal-driven workflows that actually finish

your standup brief is written before you sit down, review cycle. Coming up, it pulls six months of tickets and writes the Prep Doc.

Now you might be thinking, didn't open Claw try to do this?

It did, but it has come with a whole host of security and functional issues. which can take a huge amount of time to resolve. Zen Coder took a different approach. [00:16:00] SOC two. Type two certified. Curated integrations titer. Security perimeter. Enterprise grade from day one.



Model agnostic and works from Slack or Telegram. Try it at Zen. Flow free. 

This episode of the AI Daily Brief is brought to you by OutSystems, a leading agentic systems platform built for the enterprise. Organizations all over the world are building, orchestrating, and governing agentic systems on the OutSystems platform and with good reason.

OutSystems' open and unified platform allows teams to architect, deliver, and scale governed agentic systems with agility. Teams of any size and technical depth can use OutSystems to build, deploy, and manage AI apps and agents quickly and cost effectively without compromising reliability and security.

Without systems, you can rapidly launch ideas from concept to completion. It's the leading agentic systems platform that is unified, agile, and enterprise-proven, allowing you to accelerate growth, reduce operational friction, and deliver real enterprise impact with AI OutSystems.

Build your agentic future 

[00:17:00] Welcome back to the AI Daily Brief. Oh, friends, it's my favorite time of year It's that time when some new set of numbers, or in this case chart

Inflame everyone on Wall Street to go into an absolute frenzy with their AI counter narratives

proving to themselves finally that this time they're right and the bubble is about to burst

yes, the speed at which the investors have gone from token maxing to token panic is head-spinning And yet The chart in question, this chart 

the Silicon Data LLM Token Expenditure Index, as shared by Citadel Securities, shockingly, and I just mean shockingly, doesn't say what everyone on social media is saying it says



In this episode, I'm going to explain why this chart, which shows a big, scary downward line on something called the token expenditure index, has nothing to do with token demand, nothing to do with token volume, and nothing to do with actual token expenditure

which is not to say that there is an interesting signal there. It's just not the signal that [00:18:00] Wall Street is trying to look for. And why this matters to you, even if you are not an investor

is that the story that it is telling is part of the shift from the token subsidy era to the token scarcity era that we've been tracking, and does have some interesting implications for how we all build

all, all right, so let's talk about where this chart started to come online

It is obviously not coming into a vacuum. if you've been listening closely over the last couple of weeks, 

you've seen the professional investor class Really start to take notice of headlines like this one about Walmart capping usage of their internal AI tool because there was too much demand Or even more of Uber setting spending caps 

after it blew through its token budget in the first four months of the year

This is the natural follow-up to that 

And in this context, Citadel just published a research note called Tokenomics

chart that, the primary chart that it shares

is the one that I just mentioned before, the Silicon Data LLM Token Expenditure Index with that big scary downward line

This of course led, to the perhaps expected onslaught of social media commentators implying that this was [00:19:00] somehow some very scary and big deal

Failed crypto founder Mo Shaikh writes, 

Citadel is one of the most significant hedge funds, and they just dropped tokenomics, and it's not what you would've expected." 

that, that scary sentiment of course went viral, getting over a half million views

There were also endless AI slot posts like this one From Thierry from Arvy, who wrote, "Citadel Securities just put institutional weight behind what the AI bulls won't say out loud

When one of the most sophisticated trading firms on Earth starts writing about AI in the language of cost curves and rationing instead of limitless demand, the conversation has quietly changed. The hype was about what AI could do. The reckoning is about what it costs

Now, by the way, if you're asking me why I say that's an AI slot post, I would point you over to 

Another Twitter post, this time from Nicholas Mugali, talking about a related topic, OpenAI's plan to cut token prices, that, oh, weirdly ends with the same line. " The hype was what AI could do.

The reckoning is what it costs."

And then of course we have Zero Hedge

who have been nearly quivering with [00:20:00] excitement over a new Doom narrative to peddle Token prices down six days in a row, longest streak since January

Make that seven days, Token Price Index slide back to mid-January levels, fading much of the agentic frenzy of the past three months

of course they made their point explicit with a blog post called Tokenomics Equals Panic

and unfortunately it's not just the zero hedges. Real Vision's Andreas Steno Larsen writes, "This is the chart that everyone should be watching. If the token pricing rolls over, everything from the memory trade to the broader hardware and data center trade is over for this cycle, in my humble opinion.

The whole setup depends on this."

Now, as you might be able to tell by now, I am absolutely allergic to this sort of pattern of discourse When I see the whole setup depends on this 

dot, dot, 

dot

or someone loudly proclaiming something that is so obviously counter to the experience that everyone is having. 

suggesting that somehow the agentic frenzy of the past three months has now returned to some 

pre-agentic state.

I [00:21:00] immediately start to ask, what's actually going on here? What is the chart actually trying to say? Did Citadel Securities, who have been held up as evidence of all of this, actually even make any of the arguments that people are crediting to them

So let's talk about this chart first As I think it's clear from those posts, the implication that people are trying to suggest

is some combination of demand for tokens going down, volume of tokens going down, or, and this I guess is reasonable given that it's called the token expenditure index, the total expenditure on tokens going down. But that, it turns out, is not what this chart is actually trying to measure

And I can prove that to you by going to Silicon Data themselves, 

who took to Twitter to clarify

They write, "Our LLM token expenditure index should really have been named the token expenditure price index because it's an expenditure or usage-weighted average token price index. It tells you how much currently the entire market AI is paying for a million LLM tokens [00:22:00] irrespective of models. The naming might have led to some misinterpretations, as some seem to have interpreted the index as either the total volume of tokens used or the average price of tokens.

In reality, the index captures something much more subtle than either interpretation. It tells us the marginal willingness to pay for LLM models." Now, much credit to them for trying to clarify, even though the more dramatic assumptions would get their name out there more

Although I even disagree with what they say their chart is saying, as I'll come to in a moment

But the specific and important note is that what this chart is measuring has nothing to do with demand, nor total volume, nor total expenditure. What this measures is the average amount the market is paying right now for a million tokens So what this chart is actually saying, and this line decrease, is that in mid-June, the average price that the market was paying in practice for a million tokens had gone down from the peak at the beginning of June and was back at the level that it was paying around the beginning of [00:23:00] May

Let me say that again. The average cost of a million tokens that buyers were paying in mid-June, had gone down from a peak of what they were paying for a million tokens at the beginning of June, and was around what they were paying for a million tokens at the beginning of May

Now, as we'll discuss, there is some interesting signal there

But what it's not is signal, again, on anything related to total demand for tokens, total volume of tokens consumed, or total expenditure on tokens. It's just about the average price paid for a million tokens



now here's where I disagree with their assessment. They say it tells us the marginal willingness to pay for LLM models The idea being that if you see the average price paid for a million tokens go down, it means that some portion of the market is necessarily 

shifting their buying behavior from the most expensive basket of tokens to lower cost options

And I think that that's partially true, or at least that could be one interpretation of what this data is saying It could also be saying, [00:24:00] however

that the cost for tokens on offer

from the frontier have gone down. Now we know 

that 

that's not the case in this period, but as we'll discuss, that might be something that happens coming up soon

And more importantly Certainly for any listeners of the AI Daily Brief

It will come as no surprise

that companies appear to be looking for lower cost token options

so-- as I have loudly said on this show and elsewhere, I think every AI company is now in the token efficiency business. The equation is really simple. The shift from assisted to agentic use cases radically increases the amount of AI that companies use because of the real constraints of the physical world There are only so many tokens to go around.

As demand starts to outpace supply, the prices that people pay get high And companies which never had to think about token efficiency

or mixed basket models of some types of tokens for one use case and other types of tokens for other use cases now all of a sudden do That process is exactly what we've been tracking closely for the last couple of weeks.

And so in many ways, this chart just reflects [00:25:00] exactly what we've been talking about

The reason, however, that I think it's even a little bit less impactful than they think

even when interpreted correctly, and why I don't believe that at least in full, they are right to say that it tells us the marginal willingness to pay for LLM models has to do with the sources of their data

Or specifically the data they don't have. The Silicon Data LLM Token Expenditure Index does not measure anything about the average prices that people are paying directly to the major labs themselves. They have no insight, in other words, into the direct customer to OpenAI relationship or the direct customer to Anthropic relationship

Now, obviously on a percentage basis, 

the 

vast, 

vast majority of token expenditure is going to be direct to those companies. So what source of data could they possibly have?



the Silicon Data Index draws only from third-party token routers

" Wait," you might be sitting there saying, " You mean the token routers that people explicitly go to [00:26:00] use to get access to cheaper tokens?" Yes. The token routers whose entire purpose in the market is to route different use cases more efficiently to lower cost and better models for their particular need, bringing costs down

So this chart, which is neither about total demand nor about total volume, nor about total expenditure, but just the weighted average price of a million tokens is based on data from companies whose entire purpose in the market is to provide lower cost alternatives

My argument then is that this is going to greatly exaggerate

actual shifts in behavior away from high-cost frontier models and towards lower-cost alternatives

Which again, is not to say that there isn't signal. I think that one could view this as a really good leading indicator of where advanced AI users are oriented

In fact, I would argue

that we're likely to see some follow-on behavior over the course of [00:27:00] the next six to 12 months that looks fairly similar even from the companies that have their direct relationships with OpenAI and Anthropic. But what this certainly doesn't reflect is the average experience of buyers in the market right now.

It just simply doesn't

and to be clear, 

the points that Citadel is even making in this,

are much less bombastic than those who are screenshotting it all over X

I w- in fact, I would bet that if you go read the note



You'll probably agree with a lot of it, and a lot of it'll remind you of what we've been talking about here

The simplest version of the point that they're trying to make is that not all AI demand looks the same anymore and increasingly will be separated into different categories

put it as, they put it as a bifurcation in frontier versus everyday AI usage which sounds not dissimilar 

the discussion here a couple of days ago About whether consumer and normal ChatGPT usage should even be considered as the same thing as work AI



at no point does Citadel argue that the implications are some cratering of demand for the most expensive tokens. They write

We do [00:28:00] not think this implies that the frontier of inference-intensive AI will be abandoned, only that it is likely to be concentrated among a narrower set of firms with the balance sheets to absorb the compute costs, the research depth to deploy it effectively, and most important, the operating domain to scale the rewards from solving genuinely hard problems

Another way to put that is that they're arguing

that most of the most expensive AI is increasingly going to flow to the firms that can use it best



of, but in a world of token scarcity, where there's already not enough of the best AI to go around, doesn't that just sound like the market efficiently allocating the most expensive AI to the people who can use it the most effectively?

That's not an AI bubble popping, that's an AI market rationalizing

constr- by the way, despite the construction of that statement that was absolutely not written by an LLM. It just came out of my mouth. But maybe I'm spending a little bit too much time with the LLMs if now I'm talking like that

Now there's another really important part of this discussion. 

as people have been talking about all of these caps being set One [00:29:00] of the things that gets lost in the discourse is that those are the very most advanced firms

who have already consumed sufficient AI to get to the level of agentic usage where they would have to start putting caps. The vast majority of companies aren't even close to that

And here's one example

finance company Ramp tracks how their customers 

spend money on AI. They've been tracking it for quite some time now

And they note that as the share of businesses using AI approaches one hundred percent, the focus at their economics lab has started to shift to tracking the intensity of adoption. They are now tracking things like spend per employee

And that spend per employee is a really important number Right now, the top 1% of firms

Those who are fully AI pilled are spending about seven and a half thousand dollars per employee on AI

That's a lot, right? Those are the type of numbers where you're gonna see firms start to really ask what the ROI of that is or start to consider caps and more efficient [00:30:00] options. But again, that's just the top 1%. When you move down to the top 10%, that number comes all the way down to $610 per month

Which you will note 

is less than half of $1,500 monthly per employee cap. and the median firm in their index Right firmly in the middle of AI usage across all of Ramp's customers, who by the way are going to be more tech-savvy than the average business, AI spend sits at $11.38.

Not $1,138, $11.38

If we are looking at the market implications 

of a shift in the basket of token consumption towards lower cost options. We have to contrast that against total growth in token demand, and total growth in token volume. In other words, actual token expenditure

When the median company is still only spending eleven bucks a person on AI

the sheer amount of growth in total AI that will be consumed It is very [00:31:00] hard for me to imagine a scenario, at least in anything in the short or medium term, where the growth in the total amount of AI consumed does not massively, and 

I mean 

massively, outweigh Any shift in the balance away from the most expensive tokens to less expensive tokens

Put differently, if every firm followed Uber's example and set the cap at $1,500 per month The total increase in the market size for AI as firms go from $11.38 

per employee per month to $1,500 per month, is gonna dwarf any lost revenue on the other side because companies start to get more efficient

And what about these reports that OpenAI is considering drastic price cuts?

as a preemptive strike against Anthropic, who they think might also cut costs in a vicious price war for customers Will that tank revenue for the whole industry?

Maybe. But then you have to ask about token margins

Analyst Max Weinbach wrote, " If OpenAI does drop token pricing, this is likely because they've heard from customers they can't adopt AI at volume at the current pricing. [00:32:00] Margin is high now for served tokens. They could cut prices by 

like 

60% and still be profitable in my opinion."

Now Max isn't coming out of nowhere. While no one knows for sure the margins except the labs themselves Weinbach has done a lot of work to get to the unit economics of API tokens, and his estimates are pretty similar to a lot of the other estimates that I've seen which tend to guess something like 70% margins on API pricing 

for the most inference-intensive tokens



so summing up, The argument is not

that this token expenditure price index isn't a useful signal. It's telling a similar story to the one that we've been exploring here, and that I think will shape the next period of especially enterprise AI

But at the end of the day, all of these shifts look a lot more to me like markets doing what markets are meant to do and figuring out how to allocate scarce resources at the right price to different types of customers

Look, if you Look, if you take nothing else away from this, if you ever see someone end a tweet with an ellipsis, run in the other direction

AI-- For now, that's gonna do it for today's AI Daily Brief. Appreciate you listening or watching as always, and [00:33:00] until next time, peace 

​
