Research

09 Feb 2026

5 minute read

Navigate the economics of AI

How tokenomics is reshaping AI costs and ROI

AI computing demand is surging along with AI costs. But spend is no longer linear or predictable. It’s volatile and complex. The economics of AI require a new currency: tokens. Learn what they are, how different agentic models influence pricing, and detailed strategies to optimise your token use for maximum competitiveness.

What’s behind today’s AI token economy?

Recently, NVIDIA’s CEO said that AI computing demand will surge by a billion times.¹ Google now processes 1.3 quadrillion tokens per month a 130-fold leap in just over a year.²

AI has become the single fastest-growing line item in corporate technology budgets, consuming a quarter to one-half of IT spend³ at some firms. An increasing number of CIOs in commercial enterprises has already seen this trend as their cloud computing bills shot up 19% in 2025,⁴ with Generative AI taking center stage. At the same time, geopolitical uncertainties are intensifying calls for data sovereignty and technology infrastructure independence, leading enterprises to consider alternate structures.

This is not merely an operational choice for chief information officers. It is a strategic reckoning for chief financial officers, boards, and investors.

Get the full report

Explore token pricing benchmarks, model efficiency strategies, and infrastructure approaches to optimize your AI costs. Learn about AI factories—and the truth about when it makes sense for an organization to build its own.

The new AI currency: Tokens, models, and value

What are AI tokens and why do they matter?

A token is the fundamental unit of AI work. It’s a small chunk of data: characters of text, pieces of an image, or a slice of audio that an AI model processes. Applications run on tokens. Every interaction—whether training, inference, or reasoning—is measured in tokens.

Unlike prior waves of technology investment, AI spend is volatile and nonlinear. AI costs rise not just with adoption, but with:

Workload type
Model complexity
Infrastructure intensity.

This new reality is less about total cost of AI ownership and more about navigating AI economics with precision: tracking, predicting and optimising token use. Tokens are the true unit of value. In the AI economy, they are the currency that translates opaque infrastructure decisions into tangible economic terms: what it costs in AI consumption to generate a dollar of revenue, margin or productivity.

Organisations that address these economics will differentiate themselves not just by adopting AI, but by efficiently converting tokens into enterprise value. Those that don’t risk AI costs drifting into endless experiments—with the very real downside of locking in tech debt with no predictable return.

How to navigate the new economics of AI tokens

AI cannot be governed by the same cost models that guided past technology waves. Traditional frameworks—total cost of ownership, per-user licensing, or static virtual machine pricing—were designed for predictable workloads and stable consumption patterns. AI is different. Workloads scale in nonlinear ways, consuming resources at unpredictable rates, and AI costs are ultimately measured not in licenses or cores but in tokens.

Tokens have become the true unit of AI economics, the common denominator that reveals:

What organisations are paying for
How efficiently they are consuming it
Where value is (or isn’t) being created

This requires a paradigm shift—where a CIO may need to think like a CFO, while a CFO may need to think like a CIO.

The implications for C-suite leaders are profound. Without discipline, AI costs drift quietly upward, hidden in SaaS renewals, spiking in unpredictable API bills, or locked into infrastructure commitments that can’t be unwound. And unlike prior technology cycles, where budget overruns were frustrating but manageable, AI overspend directly erodes competitiveness.

How do AI agent adoption models affect token pricing?

Packaged AI solutions (e.g., Agentforce, ServiceNow Copilot, Workday AI) abstract tokens entirely. Leaders see a predictable subscription or per-seat fee, but little transparency into token consumption efficiency. The risk is overpaying for simplicity.
API and model consumption (e.g., Anthropic, OpenAI, Together AI) make tokens explicit. Every query is metered, billed and exposed. This brings transparency, but also volatility: costs rise based on workload design, prompt length, and hidden choices of infrastructure providers. AI costs go up due to a token meter running in real time.
Self-hosted AI factory (internalised token costs). Tokens are the output of decisions about open-source model choices (e.g., Mistral, Llama, etc.), GPU architecture, network interconnects, storage tiers and energy contracts. This path demands high capital expense and skilled teams but grants unmatched control over long-term unit economics and data sovereignty. Large-scale adoption of this archetype is typically referred to as an "AI factory."

How are AI tokens priced?

There are three major factors to token pricing:

The underlying AI tech stack
How it is hosted and consumed
What type of model and level of customisation is required to power the solution

Every token processed by a model reflects a cascade of infrastructure decisions. For package buyers, these costs are hidden. Costs are abstracted, bundled into familiar enterprise contracts and vendor managed across every layer of the tech stack, which make unpacking your total cost of ownership challenging.

End notes

Steven Rosenbush, "AI Economics Are Brutal. Demand Is the Variable to Watch.", The Wall Street Journal, October 14, 2025, accessed November 18, 2025.
Rosenbush, "AI Economics Are Brutal. Demand Is the Variable to Watch."
Tim Smith, Gregory Dost, Garima Dhasmana, Parth Patwari, Diana Kearns-Manolatos and Iram Parveen, "AI is capturing the digital dollar. What's left for the rest of the tech estate?", Deloitte Center for Integrated Research, October 16, 2025, accessed November 18, 2025. [Report regarding Deloitte's 2025 Tech Trends survey.]
WindowsForum.com, “2025 Cloud Pricing Trends: Big Three Providers Shake Up Costs for Enterprises,” July 5, 2025, accessed November 4, 2025.

Yes

No

Navigate the economics of AI

How tokenomics is reshaping AI costs and ROI

What’s behind today’s AI token economy?

Get the full report

The new AI currency: Tokens, models, and value

What are AI tokens and why do they matter?

How to navigate the new economics of AI tokens

How do AI agent adoption models affect token pricing?

How are AI tokens priced?

End notes

Thanks for your feedback

Follow us

Race and Culture

Race and Culture

Race and Culture

Race and Culture

Race and Culture

Race and Culture

Race and Culture

Climate

Cyber

Climate

Cyber

Climate

Cyber

Climate

Cyber

Climate

Cyber

Climate

Cyber

Path to Scale 2.0

Path to Scale 2.0

Path to Scale 2.0

Path to Scale 2.0

Discover your future in Canberra

Student & Graduate opportunities

Discover your future in Canberra

Student & Graduate opportunities

Discover your future in Canberra

Student & Graduate opportunities

Discover your future in Canberra

Student & Graduate opportunities

Discover your future in Canberra

Student & Graduate opportunities

Navigate the economics of AI

How tokenomics is reshaping AI costs and ROI

What’s behind today’s AI token economy?

Get the full report

The new AI currency: Tokens, models, and value

What are AI tokens and why do they matter?

How to navigate the new economics of AI tokens

How do AI agent adoption models affect token pricing?

How are AI tokens priced?

End notes

Thanks for your feedback

Contact us

​David Alonso​

Stuart Scotis

Dr. Elea Wurth

Recommendations

2026 Global Human Capital Trends

Building Resilience

State of AI in the Enterprise 2022

2022 Deloitte CxO Sustainability Report | Deloitte Global

Follow us

David Alonso