Enterprise AI cost and ROI — pillar

Verifying, tracking, and challenging the ROI claims vendors and analysts make about enterprise agentic AI.

The actual numbers. Procurement-ready cost analysis with workload-pinned math and dated pricing-page citations.

Enterprise AI cost analysis is dominated by two failure modes: vendor-supplied ROI calculators that assume their own success, and analyst headlines that quote a single percentage without naming the underlying survey question. This pillar exists because procurement teams need neither.

The McKinsey 17% EBIT-attribution figure, the CMU 30.3% agent capability gap, the GPT-5 Pro $200/month tier, the Salesforce Agentforce $800M ARR run-rate — every cost claim that gets cited at a CIO budget meeting belongs on this pillar with its underlying methodology audited. When the audit passes (Holding), the piece stays. When it doesn't (Partial / Not holding), the piece stays anyway with the correction appended; nothing is quietly removed.

What "cost" actually covers in 2026 enterprise agentic AI: per-token and per-call API economics across model providers, with workload profiles (RAG-heavy versus agent-loop-heavy) pinned in volume terms. Subscription tier value analysis — is the Pro / Business / Enterprise upcharge supported by the published rate-limit + feature delta or not? TCO comparators — model API + vector DB + observability + governance tooling stack versus cloud-native (Bedrock, Vertex AI, Foundry) integrated stack.

Failure-mode cost — what does an agent loop costing 1000× a normal completion actually look like, and what governance prevents it. Effort accounting — the realistic engineering hours per agent shipped, drawn from named-company case studies where the case study survives source verification.

Pricing data ages fast. Every cost-pillar piece declares a dated source-pull and runs on a 30-day review cadence rather than the standard 30–90.

Pillar last refreshed 2026-05-01

What survives review

Microsoft 365 E7 and the new shape of AI licensing — Holding · AM-202
Agentic AI FinOps: the cost-governance discipline most enterprises skipped — Holding · AM-194
The AI Layoff Dividend That Has Not Arrived — Holding · AM-189
Security-platform agentic AI: evaluating TCO and ROI for the buying committee — Holding · AM-174
AI water use in context: comparing the 500 ml claim to coffee, beef, and cotton — Holding · AM-173
Agentic IAM TCO at the 2,000-employee scale: a structural cost model for the 3-year horizon — Holding · AM-180
Agentforce vs Microsoft Copilot pricing in 2026: the deep-dive for the buying decision — Holding · AM-182
Why AI productivity gains create workforce reduction pressure: the demand ceiling and the competitive trap — Holding · AM-166
You're Scoring This on the Wrong Axis — Holding · AM-162
97 percent invest, 5 percent are ready: why enterprise AI data readiness is a budget allocation problem — Holding · AM-165
AI and jobs: why the task-level frame is the one CIOs need — Holding · AM-161
The Energy Bill Nobody Budgeted For — Holding · AM-154
The agent fan-out problem: when one prompt becomes 400 LLM calls — Holding · AM-149
The CIO's playbook: what the named-success agentic AI deployments actually share — Holding · AM-010
The 56% AI-skill wage premium: what the Atlanta Fed data measures, and who actually captures it — Holding · AM-006
The MIT 95% GenAI-pilot-failure claim: what the State of AI in Business 2025 report actually measured — Holding · AM-128
AI in IT operations: what is actually shipping in 2026, and what the savings really look like — Holding · AM-121
Agentic-AI vs human workers: the 2026 cost economics CIOs should actually model — Holding · AM-106
The McKinsey 17% EBIT claim: what the survey actually measured — Holding · AM-033
Why 88% of agentic AI deployments fail — Holding · AM-213
The McKinsey 23%: the agentic AI scaling gap — Holding · AM-030
The CFO's agentic AI business case: TCO and ROI — Holding · AM-027
Build vs buy vs partner for enterprise agentic AI in 2026 — Holding · AM-028
Google AI Mode restaurant booking: the template for every partner-aggregation vertical — Holding · AM-023
The hidden costs of agentic AI: a CFO's guide to true TCO and ROI modeling — Holding · AM-020
Production agentic AI cost: the layered optimisation playbook for enterprise CFOs — Holding · AM-061

What has broken

Enterprise AI cost and ROI in 2026: what the evidence actually shows — Partial · AM-201
Mid-market agentic AI ROI in 90 days: what the cited data actually supports vs the vendor pitch — Partial · AM-129
The CMU 30.3%: the enterprise agent capability gap — Partial · AM-031
GPT-5 Pro at $200 a month: what the pricing tier signals to enterprise IT — Partial · AM-003
The two-cohort split in enterprise agentic AI outcomes: why the high-performing tail is structurally distinct — Partial · AM-132

Spoke articles

Microsoft 365 E7 and the new shape of AI licensing
Microsoft's $99 E7 Frontier Suite and the 1 Jul base-price rises move the enterprise AI-licensing decision onto the renewal table, where even the customers who decline Copilot end up paying more.
Enterprise AI cost and ROI in 2026: what the evidence actually shows
The enterprise AI cost question in 2026 is not the seat price on the order form; it is total cost of ownership measured against realised ROI. Across four independent datasets the high-return minority is separated from the majority by measurement discipline and operational preconditions, not by model capability or vendor choice.
Agentic AI FinOps: the cost-governance discipline most enterprises skipped
Enterprises that scale agentic AI without a dedicated FinOps discipline for inference, covering workload-level cost allocation, spend-cap tooling, and model-routing policy, repeatedly under-budget production spend. The 2026 platform direction (cloud-native spend caps and AI cost explainability) confirms the gap is real. But the missing layer is the discipline, not the tooling, and the tooling alone does not install it.
The AI Layoff Dividend That Has Not Arrived
The thesis driving 2026's restructuring is that agentic AI plus fewer people equals higher margin. Gartner's survey of 350 executives at billion-dollar firms found the companies that cut deepest earned returns close to identical to those that cut least. The return on AI is real, but it is not falling out of the headcount line, and the distinction changes how a CIO should frame the next budget.
Security-platform agentic AI: evaluating TCO and ROI for the buying committee
Security-platform agentic AI sits in a different TCO category than the general-purpose agentic AI the CFO playbook covers. The unit of analysis is the alert and the analyst hour, not the seat or the token. The 2026 evaluation that survives audit walks the buying committee through five cost components and three discount factors against vendor-supplied ROI numbers, and gates the procurement on a 90-day in-environment baseline, not a vendor demo.
AI water use in context: comparing the 500 ml claim to coffee, beef, and cotton
The 500 ml-per-prompt claim about generative AI, compared honestly to the water footprint of coffee, beef, cotton, and rice. The aggregate is small. The local concentration is the real story. What CIOs should defend when sustainability committees raise this.
Agentic IAM TCO at the 2,000-employee scale: a structural cost model for the 3-year horizon
The IAM TCO conversation at the 2,000-employee scale answers the CFO question that the Okta-vs-NHI-specialists matrix at AM-176 raises. The 3-year horizon prices five cost components (license, integration, operations, migration, exit) across three identity classes (human workforce, managed service accounts, agent-runtime), and reveals that the agent-runtime class is the line item growing fastest in the 2025-2026 cycle and the line item most often unpriced in the year-one budget.
Agentforce vs Microsoft Copilot pricing in 2026: the deep-dive for the buying decision
The feature comparison of Agentforce against Microsoft Copilot lives at the /compare/ page; the pricing comparison is a separate conversation because pricing models in this category change faster than features. The 2026 pricing structure resolves on per-conversation versus per-user-seat, the publicly disclosed unit rates, the buying-committee discount expectations at enterprise scale, and the year-two renewal pattern that the order-form headline does not predict; the 30-day review cadence on this piece is calibrated to the pricing-page change frequency.
Why AI productivity gains create workforce reduction pressure: the demand ceiling and the competitive trap
The argument that AI-driven productivity lets companies keep all their workers and simply produce more runs into two hard limits: consumer demand and competitive dynamics. Both constraints are structural, operating regardless of management intent, and both resolve in the same direction: fewer workers for the same revenue.
You're Scoring This on the Wrong Axis
The coverage of Karpathy joining Anthropic's pre-training team read it as a talent-war coup. It is also misreading which seat has the leverage. That axis error is one enterprise IT makes with its own best engineers every day.
97 percent invest, 5 percent are ready: why enterprise AI data readiness is a budget allocation problem
Dun and Bradstreet's 2026 AI Momentum Survey of 10,000 businesses across 32 countries found that 97 percent of organisations report active AI initiatives, but only 5 percent say their data is adequately ready to support them. That gap is not primarily a technology problem. Most enterprise data environments were built for human workflows, not for autonomous AI systems operating continuously across mission-critical processes. The gap between initiative volume and data readiness is a budget-allocation failure: enterprises that treat data infrastructure as the prerequisite spend rather than a parallel track are the ones that reach scale. Enterprises that treat it as a follow-on investment do not.
AI and jobs: why the task-level frame is the one CIOs need
The job-level question every CIO is fielding from employees — 'will AI replace my role?' — keeps missing what is actually happening at the task level. The frame mismatch is the visible mechanism behind the retraining-budget gap.
The Energy Bill Nobody Budgeted For
Nvidia says agentic AI may need up to a thousand times the compute of a chatbot. The credible enterprise range is 10x to 100x by 2030. Even the floor of that range absorbs the renewable headroom the energy transition depends on, and almost no enterprise AI roadmap is pricing it.
The agent fan-out problem: when one prompt becomes 400 LLM calls
Production agentic systems amplify a single user request into dozens or hundreds of internal LLM calls. Most enterprise unit-economics, latency budgets, and observability setups are still priced for 1:1.
The CIO's playbook: what the named-success agentic AI deployments actually share
Four named enterprise deployments (JPMorgan, Toshiba, Wipro, Aberdeen City Council) cleared the McKinsey scaling threshold; the documented cohort that did not, RAND's 2024 study of 65 senior data scientists, identified an 80% pilot-to-production failure rate. The five operational characteristics shared by the named-success cases are observational, citable, and distinct from the proprietary acronym frameworks that crowd the procurement deck. CIO-level visibility on per-deployment ROI is the one most often missing in the failed cohort.
The 56% AI-skill wage premium: what the Atlanta Fed data measures, and who actually captures it
The Federal Reserve Bank of Atlanta's May 2025 'By Degrees' analysis (Lightcast job-posting data through 2024) reports a 56% wage premium for AI-skilled workers and AI-skill demand surfacing in 1.62% of all job postings. The headline number is real; the typical mid-career worker reading it should not expect to capture it from a generic AI-literacy course. Boston Consulting Group's October 2024 study (n=11,000+ employees, 50+ countries) reports a 14% frontline vs 44% leader gap in AI upskilling access. That gap, not the 56% itself, is the operational variable for who captures the premium and who sees credential inflation without the wage signal.
The MIT 95% GenAI-pilot-failure claim: what the State of AI in Business 2025 report actually measured
MIT NANDA's GenAI Divide report (August 2025) is the source of the 2026's most-cited bear-case statistic: 95% of generative AI pilots fail. The number is a self-reported survey result with a specific methodology, and the way it gets read in procurement decks materially overstates what the underlying data supports. The structural findings underneath the headline are more useful than the headline itself.
Mid-market agentic AI ROI in 90 days: what the cited data actually supports vs the vendor pitch
The 240% ROI in 90 days framing is the most common mid-market agentic AI vendor pitch in 2026, and the most-cited stat that no audited mid-market deployment has actually produced. Read against the McKinsey 17%, MIT NANDA 95%, and IDC/Lenovo pilot-graduation data, the realistic 90-day mid-market ROI band is much narrower and much more useful for procurement than the pitch suggests.
AI in IT operations: what is actually shipping in 2026, and what the savings really look like
Deep dive into the AI-in-IT-ops market in mid-2026: ServiceNow Now Assist, Microsoft Copilot, AIOps platforms, and the gap between vendor pitch and audited reality. What is actually shipping, what is failing, and what the staff-reduction numbers honestly look like when you trace them to primary sources.
Agentic-AI vs human workers: the 2026 cost economics CIOs should actually model
Loaded FTE cost vs total agent operational cost does not favour replacement at parity in 2026 for most roles. The math works for narrow, high-volume task categories and breaks for judgment-laden ones.
The McKinsey 17% EBIT claim: what the survey actually measured
The McKinsey 17% EBIT-attribution figure is the most-cited single statistic in 2026 enterprise agentic AI procurement. The way it is typically read materially overstates what the underlying survey supports.
Why 88% of agentic AI deployments fail
The famous failure figure is real, but it is not an ROI distribution. IDC research commissioned by Lenovo found 88% of AI proof-of-concepts never reach production; about 4 of 33 graduate. This article was restated on 10 Jun 2026 after its original Stanford attribution failed verification.
The McKinsey 23%: the agentic AI scaling gap
McKinsey 2025: 23% scaling, 39% experimenting. The pilot-to-production chasm is not about model readiness. It is about operational preconditions.
The CMU 30.3%: the enterprise agent capability gap
Carnegie Mellon 2026: 30.3% task completion for best frontier models. The deployments that work operate within the 30.3%, not around it.
The CFO's agentic AI business case: TCO and ROI
Most agentic AI business cases fail audit. Three documents survive: TCO with named components, ROI with pre-deployment baseline, scenario-weighted NPV.
Build vs buy vs partner for enterprise agentic AI in 2026
Most enterprises frame agentic AI as build vs buy. It's a binary on a three-body problem. Partner — the third path — is systematically under-chosen.
Google AI Mode restaurant booking: the template for every partner-aggregation vertical
Google shipped agentic restaurant booking to eight countries on 10 April 2026. The restaurant vertical is not the story. The story is that eight named.
GPT-5 Pro at $200 a month: what the pricing tier signals to enterprise IT
OpenAI's GPT-5 Pro tier launched in August 2025 with no benchmarks and a $200/month subscription. The pricing decision is more interpretable than the capability claim. What the tier signals for enterprise procurement and how the McKinsey 17% EBIT-attribution figure cited around the launch should actually be read.
The two-cohort split in enterprise agentic AI outcomes: why the high-performing tail is structurally distinct
Enterprise agentic AI outcomes split into a small high-performing tail and a much larger struggling body. Gartner, McKinsey State of AI, and MIT NANDA outcome data converge on that shape, and IDC pilot-graduation data shows most deployments stall before production. What separates the two cohorts is operational discipline, not model selection — and the 73%/27% framing in the slug captures that pattern more cleanly than the original AI-slop body did.
The hidden costs of agentic AI: a CFO's guide to true TCO and ROI modeling
Enterprise TCO models underestimate agentic-AI programmes by 40-60%. The surprise is not that the costs are hidden. It is that they are distributed.
Production agentic AI cost: the layered optimisation playbook for enterprise CFOs
Production agentic-AI bills routinely run several times the POC forecast. The mechanism is structural: token economics, orchestration overhead, context drift, observability. So is the optimisation.

What we're watching next

Frontier-model inference pricing dropping another order of magnitude before end of Q3 2026.The LLMflation curve has held since 2022. If it continues, the cost-economics math shifts the boundary between replaceable and augmentable categories. If it breaks, the augmentation-over-replacement frame strengthens further.
Published Fortune 500 / FTSE 100 case data showing residual-team productivity recovery curves materially shorter than the 6-12 month band.The retraining-gap claim is built on the published 2024-2025 cohort. The next round of named-company audits in the back half of 2026 either confirms or contracts the dip estimate.
McKinsey and BCG publishing late-2026 enterprise gen-AI cost-multiplier studies.The current pilot-to-production cost multiplier is ~2-5×. If aggregate enterprise data shows compression toward 1.5×, the structural-multiplier framing across the cost cluster needs revising. If it widens, the procurement-playbook intervention becomes more urgent.
GPT-5 Pro and Claude Opus 4 reasoning-mode adoption metrics becoming public.Reasoning models are 5-20× more expensive per call. The procurement question is whether enterprise deployments are routing-by-step (cheap most of the time) or routing-by-deployment (always expensive). Adoption data settles which pattern actually holds.

Primary sources we trust for this topic

A curated list of primary research, regulator guidance, and vendor documentation for enterprise ai cost and roi. Populated on the quarterly refresh — not a link dump, not competitors.

This pillar page is refreshed quarterly. Last refresh: 19 Apr 2026. Next refresh: 18 Jul 2026.

Topic · Enterprise AI cost and ROI

What survives review

What has broken

Spoke articles

What we're watching next

Primary sources we trust for this topic