# The Next Wave of Enterprise AI — Transcript (2026-06-03)

https://aidailybrief.ai/e/2026-06-03 · Listen: https://pod.link/1680633614 · (ad-free transcript; timestamps may run earlier than the with-ads edition)

---

[00:00:00] Today on the AI Daily Brief, the next wave of enterprise AI is upon us. Before that in the headlines, the very confusing and weird process around the latest AI executive order. The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI.

All right, friends, all, thank you to today's sponsors Welcome back to the AI Daily Brief headlines edition, all the daily AI news you need in around five minutes. today we begin with the latest in the saga of this Trump AI executive order



This is just one of the absolute strangest policy processes I've seen

So what's going on? How did we get here, and what was actually signed?

First of all, by way of context, the reason that this is coming up at all is a couple parts



firstly, there are some very, very different and contentious groups when it comes to AI, including in Trump's own coalition

Republicans like Governor DeSantis in Florida



as well as very loudly former presidential advisor Steve Bannon



have been squawking quite [00:01:00] loudly about AI and more broadly decrying Trump's close alliance with the technology industry for some time now



and yet the specific catalyst for this new round of policy discussion

was the cyber capabilities of Anthropic's Mythos model



So the executive order we started hearing about a few weeks ago seemingly had something to do with labs needing to give the government access to their most advanced models before actually releasing them Indeed, that was the core policy of the draft that was circulated two weeks ago

That seemed at the time like a done deal. A signing ceremony had been scheduled. A who's who of tech CEOs had been invited to attend. However, hours before the event, President Trump pulled the order, stating, "I didn't like certain aspects of it," and adding that he thought that it would get in the way of the US lead over China in the AI race Now, it later surfaced that former AI czar David Sacks had intervened at the 11th hour, placing a call to the president to talk him out of signing the policy, at least for now



the order that was signed this week is substantially the same as the draft order that was scrapped a couple of weeks ago. Both versions of the order made safety testing [00:02:00] voluntary, although in the current climate, that's not all that meaningful a distinction.

All major AI labs have agreed to submit advanced models for testing And while some White House personnel were reportedly pushing for compulsory testing, it appears that that position never made it into a draft indeed it seems like the only significant change is that companies are encouraged to make their models available 30 days prior to public release, as opposed to the draft order, which had asked for a 90-day period.

It was It was that 90-day period more than anything else that triggered industry backlash for its potential to significantly slow down the release cycle

Neither version of the order provided any mechanism for the government to block a model's release. fact, one subtle change in the new version is aninclusion of a disclaimer which reads, " "Nothing Nothing in this section shall be construed to authorize the creation of a mandatory government licensing, pre-clearance, or permitting requirement for the development of new AI models."

This This sounds like a direct response to the critique that many had had that what the White House was doing with this executive order was a de facto licensing regime

Functionally, however, Functionally, however, the policy just allows the government to assess new capabilities before they're available to the public

[00:03:00] The NSA has been assigned primary responsibility for model testing with support from various cyber technology and defense agencies. In addition to safety testing, the order establishes a cybersecurity clearinghouse run by the Treasury in consultation with the NSA, the Department of Homeland Security, and the Cybersecurity and Infrastructure Security Agency.

There's also provisions instructing civilian and military agencies to harden systems against AI-driven cybersecurity risk

Outside 

Outside of the 90 to 30-day switch, the other biggest difference with this version of the order is the way that it was presented to the public. Rather than a Rather than a high-profile signing ceremony, the order was signed in private with zero fanfare I don't think it's an unreasonable interpretation

To see this as the administration treating this sort of AI safety regulation as some version of eating its vegetables



rather than the Big Mac or the well-done steak that it would prefer



the the order does contain some language reaffirming the administration's commitment to AI acceleration

Talking about a commitment to the United States AI global dominance

But ultimately this is the sort of Rorschach test policy that gives everyone something to comment on, everyone something to claim victory around, while ultimately doing very [00:04:00] little



The The New York Times reported the order as in their words, "signaling a shift from the hands-off approach the White House had previously taken toward AI



The The White House Office of Science and Technology Policy, however, called this lazy and inaccurate reporting from The New York Times



trying to draw the line in the sand



around the difference between oversight and voluntary sharing. the EO they wrote creates a process for frontier labs to voluntarily share cutting-edge cyber models in order to secure critical infrastructure and strengthen the government's own cyber defenses.

We are not conducting oversight of all new models as that level of government overreach would have chilling effects on free speech and innovation



David David Sacks himself chimed in to explain that the policy is intended to only cover models that, in his words, represent, quote, "Meaningful step change in cyber capabilities, i.e. Mythos, not incremental changes to existing models like Opus 4.8."



He He also took the chance to comment on the slippery slope argument, writing, " "I understand I understand the concerns of many that this could morph into an FDA for AI. Of course, bureaucratic mission creep is always a danger, and this should be closely monitored, but the EO expressly forbids the creation of a new licensing pre-clearance or [00:05:00] permitting regime."

From From the labs, all of the commentary was pretty similar. clearly talking points were circulated, s- suggesting they all use the language of this being a, quote, "important step

Former White House advisor Dean Ball was concerned about the implications of this first step, calling it a, quote, "Fairly major win for the safety contingent within the administration and a significant loss for the SACs accelerationist wing."

Now Dean, despite Zach's assurances to the contrary, thinks this heads in exactly one way He writes, " This is clearly teeing up the infrastructure for a model licensing regime, and the fact that the administration is classifying the details of how this voluntary system will work is egregious."

The public and the employees of the labs have a right to know how this works. Most lab staff don't have clearances, but if the literal regulatory thresholds that trigger pre-deployment review are classified, researchers themselves won't know whether what they are training is regulated by 

the CEO. All for a benefit that is barely articulable? what exactly is the intelligence community going to do in 30 days to make the model safer? It's not a huge mistake, but a small to medium-sized one. But I am fairly confident this is a mistake nonetheless

And again, while Saxe says this isn't a [00:06:00] step to more

The more safety-minded certainly see this as a crack in the Overton window that they can pry open

Right-wing pundit Steve Bannon said, " "For the For the first time, it's on a piece of paper, a structure and a process. That process is still pretty ill-defined. It doesn't meet our requirements. But as I tell people, we're going to eat the elephant one bite at a time. I strongly believe we're heading towards mandatory within the next couple of We intend to ramp up the pressure campaign."



and showing how weird AI makes different political bedfellows



Bernie Sanders seems to want the same thing as Steve Bannon, saying, " After calling efforts to regulate AI foolish, Trump finally acknowledged AI poses a real threat. That's the good news. The bad news? His executive order is voluntary and does almost nothing to protect Americans.

Congress must act."



ultimately, the reason that everyone is speculating around what the implications are

is that the order itself doesn't do all that much the AI labs already have agreements in place to share new models with the government ahead of release David Remler from the Center for a New American Security said that the order, quote, "effectively formalizes what has already been happening between the US government and the leading AI companies."

and so speculation about what comes next [00:07:00] is the natural next place to go

Certainly there will be a lot to watch here, But for now we actually do still have a few more headlines

One of them from the very project which got this whole ball rolling, which is Project Glasswing. The specific way, of course, that Anthropic is releasing their Mythos model. Anthropic has just expanded access to Mythos, adding 150 new partners

With this new announcement, Anthropic is rolling Mythos out to firms across fifteen countries, with the expanded group of partners including new sectors that weren't covered by the initial project, including energy, water, communications, healthcare, and computer hardware

Writes Anthropic, " What each partner has in common is that a successful attack on their code base could be catastrophic. For most partners, we estimate that a major attack could affect more than 100 million people with important ramifications for both global and national security."

Now, Now, the announcement also included some further discussion of a public release. You might remember that during the rollout of Opus 4.8 last week, Anthropic said that they expect to have a Mythos-level model ready in the coming weeks. In Tuesday's Glasswing update, they wrote, " We're working as quickly as we can to safely release Mythos-level capabilities and general access.

To do so, we'll need highly robust safeguards that prevent the model [00:08:00] cyber capabilities from being misused, safeguards that we, and to our knowledge, all other AI developers have yet to develop. Because Because cybersecurity has both helpful and destructive uses, making safeguards that are both strong and precise enough is a major challenge."

Which to me honestly kind of feels like walking back the language that they had used In the Opus 4.8 announcement All the way back then last Thursday? They said it was coming in the next couple of weeks



but now they're saying they need infrastructure that doesn't exist. Honestly, the messaging is about as confusing as the government's executive order

Meanwhile, the information checked in with some of the teams working with Mythos Finding that although the model is powerful, it is also eye-wateringly expensive

Most of the testers are finding themselves running through millions of dollars worth of tokens very quickly. And what's more, for now, Anthropic is subsidizing use, so firms aren't even paying the full cost

At the same time, it also appears valuable enough to justify that cost with many of the firms that the Information talked to saying that they're basically aligning their budget so that they can build their strategy around Mythos when it becomes more broadly available

And And lastly today on that, same theme of the token shortage, SK Hynix now plans to double their manufacturing capacity for memory chips

[00:09:00] to help address the global shortage

This year's rapid growth in token use has led to shortages throughout the AI supply chain, with one of the more prominent shortages being in memory chips. The cost of high bandwidth memory for AI servers has more than doubled so far this year. And up until now, memory manufacturers have been reluctant to build new plants to boost supply.

In the past, cyclicality in chip pricing has punished long-term investment, with new plants often missing the window of peak demand. SK Hynix's new plan suggests that they now view AI-driven demand as a structural change, and they're deploying capital to take advantage of it

Now, to Now, to be clear, they're talking about doubling capacity by the end of the decade, so this is unlikely to do much for the chip crunch in the short term

Indeed, Chairman She Te-yuan told reporters that the shortage could last until 2030

And yet still the deeper investment is the right policy With Chairman Hsieh arguing, " " The The whole AI industry needs to be more sustainable. We have to continue to grow, but sudden jumps in price can become a problem and actually hurt sustainability."



So lots of big movement today, but that is going to do it for the headlines. Next up, the main episode 

Welcome back to the AI Daily Brief



[00:10:00] yesterday we had two dueling events, both focused on enterprise AI

One was from OpenAI and one from Microsoft



and they each provided in their own way

some indications of where enterprise AI is currently and where it's headed

Now the context for this, of course, Is the broader shift we've been discussing on this show of moving from the subsidy era of AI to the scarcity era of AI or the token shortage era of AI

The basic idea is that as we move from assisted to agentic workloads



the sheer quantity of the AI tokens we use goes up

And we're now running into the limits of what the available compute and physical infrastructure can produce

Meaning that business models are realigning

Costs are going up and everyone's scrambling to figure out how to adapt to all of that. in the in the meantime though

Part of what makes this challenging is thatit's not at all clear to most organizations

how to best use this new set of tools



In other words, the question of enterprise AI adoption is not just a question of costs



but also one of tool and use case fluency

And And increasingly, [00:11:00] enterprise users are living inside the power tools like Claude Code and CoWork and OpenAI's Codex

Interest in Codex has been surging for a while

With Google searches for Codex actually spiking past Claude Code for the first time in May



The The Information wrote about how the quote, "vibe shift on Codex hasbeen palpable

And OpenAI's event yesterday centered on a set of new updates for Codex that are all about it moving out of the strict realm of the developer into the broader world of knowledge work Now, alongside the event, a, OpenAI released a report called The Next Era of Knowledge Work

And the TLDR on the Was not only that Codex was growing, hitting 5 million weekly active users But that the biggest source of its growth was not developers, but non-technical knowledge workers who are now adopting Codex at a three times faster pace than developers are

And one of the things that's interesting about the report



is that it's not just a bunch of reported stats, but actually shows quite a bit of the design philosophy and the first principles understanding that's going into how OpenAI is thinking about Codex

One of the central themes is what OpenAI calls a strange [00:12:00] abundance " Modern workers," they write, "can produce documents, messages, dashboards, models, and presentations faster than ever, yet they spend a remarkable share of their time looking for context, reconciling conflicting versions, waiting for responses, and moving information across systems."



they point to a McKinsey that found that the average knowledge worker right now spends more than a quarter of their workweek managing email And almost a fifth of it looking for internal information or trying to find people who can help within their company around some specific task

Overall, they say three frictions define the daily cost of knowledge work

The first is the cost of finding relevant inputs

Across, as they put it, sprawling, untransparent systems

Second is information coordination costs

and third are approvals and verifications

In fact, they argue that these frictions are what accounts for the delays between a new technology being introduced and it actually showing up in the productivity statistics

Knowledge work, they write, is still waiting for its factory redesign. Previous generations of workplace software lowered the cost of producing artifacts, but did not reduce the attention required to consume them. Email made correspondence cheap, then multiplied [00:13:00] correspondence. Docs made drafting cheap, then multiplied drafts and review cycles.

The result is an excess of documents and tools and even scarcer time and attention



and you might be seeing where they're going with this. Codex, they write, is that factory redesign

So what are they seeing in how people are actually using Codex? First of all, everyone is producing artifacts 72% of knowledge workers using Codex are producing some sort of artifact, be it a PDF or a spreadsheet or something else, on a weekly basis

outside of coding and software engineering-related They're also doing research 41%, data analysis 27%



as well as implementing what they call business function workflows at 15%.



importantly though, people are doing a lot of these at the same time. " The most consequential shift in behavior," they write, " is towards parallel tasks. Roughly 50% of users now have more than one Codex task running simultaneously at some point during the up from less than one-third in mid-April."

The shift, they write, from sequential to parallel use is what lets a single knowledge worker operate at the scale of a small team. One turn to inspect a dataset, another to draft a script, another to assemble a report, another to check an [00:14:00] application. The user becomes the orchestrator of work streams rather than executing a single task at a time



So what goodies did we actually get?

The three highlights are annotations, plugins, and sites



Annotations are effectively a more precise way to interact with context Within Codex, when you're looking at a specific document or artifact You can highlight rather than having to explain with words the specific part of the document that you wanna discuss or query about or change.



you can use the annotations feature to select just that part of the document for the model to reason over within Codex

Simon Smith from Click Health wrote, " "You can You can already use annotations to give feedback on websites, but now it looks like that interaction model is expanding across outputs."

I love working with Codex by selecting things in the preview pane, adding them into chat context, and then talking to Codex about them. This makes that way of working more powerful



Next up was an expansion of Codex plugins

Now previously, plugins were a way to connect specific software into the Codex ecosystem

But with this latest update, Codex is adding a set of role-specific plugins for common functions including sales, data analytics, [00:15:00] creative production, product design, public equity investing, and investment banking

Now, Now, given the IPO horse race dynamics and competitive storyline between Anthropic and OpenAI, this is the update that a lot of mainstream media focused on as it resembled to them Anthropic's strategy of releasing a set of tools for specific functions and industries as well



in in their announcement, OpenAI writes, " writes, Each role-specific plugin bundles the relevant apps, skills, instructions, and workflows." Across these six new function plugins

they include access to 62 apps and 110 skills. Basically about 10 apps and 20 skills per role

You can almost think about the role-specific plugins

as organized bundles of features that were already available but presented in a way that requires much less setup

Another way Another way to think about it is that this goes a long way to productizing best practices You can You can think about it kind of like this if you took the best user across each of these six functions from a wide variety of companies, and you looked at the app integrations and skills they most often drew from



and then turned those average set of skills and app plugins into a bundle, That would [00:16:00] effectively be what Codex is releasing here

And interestingly, this becomes sort of product-led education

Where the salesperson, for example, who now has accessto the plugins and skills that are used by the salespeople who are best at getting the most value out of Codex



can start to imitate those best practices byvirtue of what's being presented in this functional plugin

Simon Smith again notes, " plugins seem to follow what Anthropic is doing with plugins focused on different business domains. But what's interesting is that OpenAI plugins seem like they can do more than provide instructions and connectors.

They can add interactivity inside the Codex preview pane, like buttons and guided actions that make powerful workflows more clickable."

Still, the update that I'm most excited about and one which at some point I might do an entire operator episode on, is the new sites feature

My guess is that a lot of you have had the experience at this point



of realizing as you're going about your normal work, that something that you might previously have as some sort of static document



might now be better suited



To presenting as some sort of small website



for example, instead of some PDF presentation, maybe you just send them a URL



It's easier to share because they don't have to download [00:17:00] anything. Plus you can update it as makes sense

Codex Sites productizes that type of behavior



it allows you to turn any sort of artifact that you've built inside of Codex into a full website or web app that's shareable with your team

They They give the example of a revenue forecast planner

That represents a sort of much more interactive way to look at budget planning than a traditional spreadsheet or presentation might have been An event operations dashboard

And a product launch hub. With both the event operations dashboard and the product launch hub

representing a way to keep track of operational progress With a highly customizable set of inputs

Rounding out his analysis of these three, Simon Smith again writes, " Sites are kind of like Claude artifacts, but on steroids, but onthis puts vibe coding even more directly in the hands of everyone in an organization. You can build stuff, share it, deploy it, and importantly, do it in a more secure way, which has been a real issue with some internal vibe coded tools."

now I think that Simon's analysis is right, but I think this is where we have a terminology problem. Part Part of why vibe coding never really fit for this type of use

is that these sort of sites are effectively disposable software and web apps. They're meant for a [00:18:00] specific purpose for a specific set of time

And the only thing that they have in common with software engineering is that they use code to deliver an output. But this isn't non-coders all of a sudden becoming product designers and engineers and trying to get in on the product building game

This is people using code in websites

to improve, how they share things and collaborate with colleagues



my argument would basically be that in the same way that building a slide deck or writing a document or interacting with a spreadsheet is a core knowledge work primitive, building websites and disposable web apps is also going to be a core knowledge work primitive going forward.

Codex Sites is a hyper-simple version of that experience that's going to make that primitive much more accessible to a large number of people

Like Like I said, I actually think that sites might be deserving of an entire operator's episode to dig into different types of things that people might be able to do with it



but if you had just one thing to play around with In the short term, that's where I'd be looking



it's very clear that OpenAI and the Codex team see the Codex app as the new interface for knowledge work, and are going to continue pushing to figure out all the implications of what you can do in this new type of environment

But as I said at the beginning



the question of the [00:19:00] next phase of enterprise is both one of interface, which we've been discussing with Codex, but it's also one of efficiency and cost management

Uber, a company that has somehow found itself in the headlines as exhibit A in the changing tides of agentic AI, has now put a $1,500 monthly cap across token spending for all employees

Now, I have a lot more to say about what I think does and doesn't work about that strategy, but we'll save that for another episode. The point for us today is that cost management isthat another vector of the next wave of enterprise AI is going to be cost management. 

And interestingly, that seemed to be at the core of of the announcements from Microsoft Build

Nominally, the big announcement was seven new Microsoft AI models. Image 2.5, Image 2.5 Flash, Transcribe 1.5, Thinking One, Voice Two, Voice Two Flash, and Code One Flash



a family of models that were optimized around different sets of use cases



and certainly just like any other time that we get model releases

There was a bunch of discussion of the benchmarks



the headliner was Mai Thinking One, a one trillion parameter model using a mixture of experts architecture for inference optimization



that [00:20:00] Microsoft tried to place as a model somewhere in the Sonnet 4.6 to Opus 4.6 type of range

Now Now to some, the discussion was just about Microsoft making progress in the model training game at all.



wrote Sean Wang, " "You have You have to give Microsoft props for training all these in-house models from scratch and getting all of them to near state-of-the-art. Mustafa Suleyman built a full-fledged Neo lab inside Microsoft in two years. That Microsoft now fully controls from chip to model to harness Absurdly impressive

Prime Intellect's Elliot Bakoush

Writes that Thinking One uses zero synthetic data or distillation from previous models Quote, " This means reasoning, agentic behavior, tool use are all learned fully during post-training with no cold start. Bold choice that makes it harder and requires more iterations to reach state-of-the-art, but you get full control over your model series, and it proves they are serious about being a frontier lab."

Ethan Mollick lamented the fact that no one really has gotten their hands on these things

so we're just left to squint at the benchmarks, which themselves are confusing. He writes, " It's difficult to know how good MAI Thinking 1 is from the scores alone, like weirdly low GPQA in Terminal Bench 2.0. [00:21:00] But Microsoft makes it really hard to try its models upon release, so I don't know



others like Leaker I Rule the World

poo-pooed the releases saying, " In case it's unclear, the Microsoft model isn't competitive, particularly not for anything agentic

And indeed thinking one scores on the agentic coding tests like Terminal Bench 2.0 and SuiteBench Pro were meaningfully lower than competitors even one generation ago from Anthropic and OpenAI

But But go one step deeper and it's quite clear p- that Microsoft is playing a different game

I believe that they have very clearly identified cost optimization as an issue



and believe that their approach can be part of the answer. In his announcement post, Mustafa Suleyman wrote, " All of this is the foundation for Microsoft Frontier Tuning. it lets you customize our models to create custom company-specific agents that only you control Early adopters are already seeing a difference.

When we tuned our models for McKinsey's tasks, MAI delivered the highest win rate, outperforming GPT-55 on quality while being 10x lower on cost

On stage, Microsoft CEO Satya Nadella called this a pretty significant shift. He said, " We believe the [00:22:00] time has come for every company to just move from consuming a frontier model to fully participating at the frontier in the frontier ecosystem."

In In other words, I don't think that we should be looking at this series of models

completely in raw terms as something where one of us as listeners is going to decide to fire up MAI Thinking 1 instead of GPT-5.5 or Opus 4.8

Instead, they are very self-consciously being positioned as part of an overall strategy to not only get state-of-the-art performance, but to do so at a lower cost. And given that Microsoft already has the strongest distribution inside the enterprise of any company

Their play here is worth taking seriously

If If you wanna simplify it, when it comes to enterprise AI, the second half of 2026 is going to be about wrestling into a workable cost-effective approach all of the opportunities that the first half of 2026 unlocked In different ways, both OpenAI and Microsoft



showed off big plays yesterday to those ends

And I certainly don't anticipate that's the last we'll be hearing about And it's very clear that the race for the next wave of enterprise AI [00:23:00] adoption is fully on. For now, that's gonna do it for today's AI Daily Brief. Appreciate you listening or watching, as always. And until next time, peace 

​
