AI is now the fastest-growing expense in corporate technology budgets, with some firms reporting that it consumes up to half of their IT spend. Cloud computing bills are rising sharply—up 19% in 20251 for many enterprises—as generative AI becomes central to operations. Yet, as costs mount, returns can remain elusive. According to Deloitte’s 2025 US Tech Value survey, nearly half of leaders expect it will take up to three years to see ROI from basic AI automation, and only 28% of global finance leaders report clear, measurable value from their AI investments.
This disconnect is not just a financial headache—it’s a strategic reckoning. For many organizations, the imperative to adopt AI is less about immediate returns and more about staving off existential threats or maintaining competitive parity. In these cases, the focus must shift from whether AI delivers value to how its economics are measured and managed for organizations to thrive in a structurally different environment. As such, enterprise technology, business leaders face a new economic reality, defined not necessarily by traditional metrics but rather the volatile, nonlinear dynamics of token-based AI consumption.
Unlike previous technology waves where costs were tied to subscriptions or virtual machines, AI economics now revolve around tokens—the fundamental unit of AI work. Every interaction, from model training to inference, is measured in tokens, or small chunks of data that models process, making costs inherently variable and often unpredictable. Key drivers of this volatility include:
Token costs, meanwhile, are shaped by a cascade of technical decisions:
Leaders who understand these dynamics can more effectively manage AI as a true economic system, aligning infrastructure and model choices with business priorities, optimizing spend while delivering high-quality outcomes.
While the unit price of AI tokens is falling, overall enterprise spending on and scaling of AI systems is rising. The number of users, complexity of models, and intensity of workloads will likely drive greater token consumption and, consequently, higher costs.
As AI workloads scale, the underlying mechanics of a new AI economy emerge, with spending likely falling into different buying patterns depending on how organizations consume intelligence:
A Deloitte simulation set up to isolate how hosting choices, AI model selection, and usage maturity interact to drive token consumption and total cost based on 8 GPU scaled increments found:
To prevent runaway costs, leaders should strive to optimize what they use. The following approaches can be helpful:
AI cannot be managed with outdated cost models. Business leaders should treat AI economics with the same rigor as energy or capital allocation, recognizing tokens as the new currency. Hybrid infrastructure and FinOps are key to sustainable AI adoption, enabling organizations to deploy workloads where they can be most economically and strategically advantageous. Fluency in token economics will increasingly distinguish organizations that can scale AI confidently and convert consumption into measurable enterprise value.
Read the full report: The pivot to tokenomics: Navigating AI’s new spend dynamics.
This article originally appeared in Deloitte Executive Perspectives in CIO Journal from The Wall Street Journal on Jan. 14, 2026. The Wall Street Journal News Department was not involved in the creation of this content.