NewsOpinion

The problem with AI demand

CNBCMay 29, 2026

The AI industry is facing a broken demand signal, where token usage is being inflated by unsustainable practices like employee leaderboard gaming, runaway agent costs, and flat-rate pricing models that don't reflect real compute costs. Anthropic is the only major AI lab visibly adjusting its pricing model to reflect actual usage economics. This raises serious questions about whether the massive AI infrastructure buildout is sized for real, sustainable demand.

Summary

The video argues that the AI industry's demand signal is fundamentally broken, with only Anthropic actively pricing its services to reflect what real consumption actually looks like. The CEO of Anthropic, speaking on the Dark Cast podcast, suggested that competitors don't fully understand the financial risks they're taking, implying they're building products based on excitement rather than rigorous economic modeling.

At the core of the problem is token consumption — the basic unit of AI usage. While simple chat interactions cost only a few hundred tokens, the new generation of autonomous AI agents can run for hours, burning through millions of tokens unattended. This has led to major companies like Uber blowing through their entire annual AI budget by April, and Goldman Sachs Research reporting that companies are overrunning their inference budgets by orders of magnitude, with AI costs on track to rival full engineering headcount.

A secondary problem is the 'token maxing' phenomenon, where companies like Meta and Shopify are incentivizing employees with leaderboards based on AI usage volume rather than productivity outcomes. Nvidia CEO Jensen Huang exemplified this mindset by suggesting a $500,000 engineer should be consuming at least $250,000 in tokens. Critics note that when usage — not output — is the metric, employees will simply game the system by burning tokens without productive purpose.

Anthropick's response to the unsustainable flat-rate pricing model has been to cut off third-party tools exploiting unlimited subscriptions and to move enterprise customers to per-token billing. One estimate suggested a single $200 Claude Max plan could be costing $2,000–$5,000 in actual compute. This pricing correction, while economically rational, risks triggering demand pullback from consumers and companies that budgeted for fixed costs.

The video concludes with a stark warning: the entire AI infrastructure investment cycle — data centers, chips, and networking — is predicated on the assumption that token demand will keep growing. If a significant portion of current usage is artificial (leaderboard gaming, looping agents, or unsustainable budget burns), then the infrastructure being built may be drastically oversized for the real, sustainable demand that will eventually emerge.

Key Insights

Anthropic's CEO claims that competing AI companies don't truly understand the financial risks they're taking, suggesting they're building products because it 'sounds cool' rather than from disciplined economic analysis.
Goldman Sachs Research found that companies are overrunning their initial AI inference budgets by orders of magnitude, with AI costs projected to rival full engineering headcount in 2024.
Jensen Huang argued that a $500,000 engineer who does not consume at least $250,000 worth of tokens is cause for alarm — framing token consumption as a direct proxy for employee productivity and value.
Anthropic found that a single $200 Claude Max unlimited subscription could be costing between $2,000 and $5,000 in actual compute, leading the company to cut off third-party tools exploiting the model and shift enterprises to per-token billing.
The video argues that if a meaningful portion of current AI token demand is driven by leaderboard gaming, looping agents, or budgets companies can't sustain, then the massive infrastructure being built to serve that demand may be sized for an inflated number that won't hold.

Topics

AI token economics and pricing modelsUnsustainable AI usage and budget overrunsAI infrastructure investment riskFlat-rate vs. per-token billingEmployee AI usage gaming and leaderboards

Transcript

[0:00] The AI demand signal is broken and only one company is acting like it. >> I kind of get the impression that, you know, some of the other companies have not written down the spreadsheet that they don't really understand the risk they're taking. They're just kind of doing stuff cuz it sounds cool. >> That's the CEO of Anthropic on the Dark Cash podcast talking about the biggest spending cycle in tech history. His company, it's the only major AI lab pricing for what demand actually looks like. So, what's happening? Every time you use AI, it costs tokens. They're the basic unit of AI consumption. Every [0:32] question that you ask, every answer it gives back, every…

Full transcript available for MurmurCast members

View original source →

More from CNBC

Get AI summaries like this delivered to your inbox daily

The problem with AI demand

Summary

Key Insights

Topics

Transcript

More from CNBC

How China came to dominate the global EV factory boom

Americans keep getting priced out of fun. Here's how it impacts us at home

Will The New Housing Bill Lower Housing Costs?

AI’s Next Race: Cost, Control, and Compute

Why Ford is betting $5 billion on EVs

Get AI summaries delivered to your inbox