Topic · Enterprise AI cost and ROI
Verifying, tracking, and challenging the ROI claims vendors and analysts make about enterprise agentic AI.
The actual numbers. Procurement-ready cost analysis with workload-pinned math and dated pricing-page citations.
Enterprise AI cost analysis is dominated by two failure modes: vendor-supplied ROI calculators that assume their own success, and analyst headlines that quote a single percentage without naming the underlying survey question. This pillar exists because procurement teams need neither.
The McKinsey 17% EBIT-attribution figure, the CMU 30.3% agent capability gap, the GPT-5 Pro $200/month tier, the Salesforce Agentforce $800M ARR run-rate — every cost claim that gets cited at a CIO budget meeting belongs on this pillar with its underlying methodology audited. When the audit passes (Holding), the piece stays. When it doesn't (Partial / Not holding), the piece stays anyway with the correction appended; nothing is quietly removed.
What "cost" actually covers in 2026 enterprise agentic AI: per-token and per-call API economics across model providers, with workload profiles (RAG-heavy versus agent-loop-heavy) pinned in volume terms. Subscription tier value analysis — is the Pro / Business / Enterprise upcharge supported by the published rate-limit + feature delta or not? TCO comparators — model API + vector DB + observability + governance tooling stack versus cloud-native (Bedrock, Vertex AI, Foundry) integrated stack.
Failure-mode cost — what does an agent loop costing 1000× a normal completion actually look like, and what governance prevents it. Effort accounting — the realistic engineering hours per agent shipped, drawn from named-company case studies where the case study survives source verification.
Pricing data ages fast. Every cost-pillar piece declares a dated source-pull and runs on a 30-day review cadence rather than the standard 30–90.
Pillar last refreshed 2026-05-01
What survives review
- Agentic AI FinOps: the cost-governance discipline most enterprises skipped — Holding · AM-194
- Security-platform agentic AI: evaluating TCO and ROI for the buying committee — Holding · AM-174
- Agentic IAM TCO at the 2,000-employee scale: a structural cost model for the 3-year horizon — Holding · AM-180
- Agentforce vs Microsoft Copilot pricing in 2026: the deep-dive for the buying decision — Holding · AM-182
- 97 percent invest, 5 percent are ready: why enterprise AI data readiness is a budget allocation problem — Holding · AM-165
- The Energy Bill Nobody Budgeted For — Holding · AM-154
- The agent fan-out problem: when one prompt becomes 400 LLM calls — Holding · AM-149
- The CIO's playbook: what the named-success agentic AI deployments actually share — Holding · AM-010
- The MIT 95% GenAI-pilot-failure claim: what the State of AI in Business 2025 report actually measured — Holding · AM-128
- Mid-market agentic AI ROI in 90 days: what the cited data actually supports vs the vendor pitch — Holding · AM-129
- AI in IT operations: what is actually shipping in 2026, and what the savings really look like — Holding · AM-121
- Agentic-AI vs human workers: the 2026 cost economics CIOs should actually model — Holding · AM-106
- The McKinsey 17% EBIT claim: what the survey actually measured — Holding · AM-033
- Why 88% of agentic AI deployments fail — Holding · AM-029
- The McKinsey 23%: the agentic AI scaling gap — Holding · AM-030
- The CMU 30.3%: the enterprise agent capability gap — Holding · AM-031
- The CFO's agentic AI business case: TCO and ROI — Holding · AM-027
- Build vs buy vs partner for enterprise agentic AI in 2026 — Holding · AM-028
- Google AI Mode restaurant booking: the template for every partner-aggregation vertical — Holding · AM-023
- The bimodal ROI distribution in enterprise agentic AI: why the high-performing cohort is structurally distinct — Holding · AM-132
- The hidden costs of agentic AI: a CFO's guide to true TCO and ROI modeling — Holding · AM-020
- Production agentic AI cost: the layered optimisation playbook for enterprise CFOs — Holding · AM-061
What has broken
Spoke articles
- Agentic AI FinOps: the cost-governance discipline most enterprises skipped
Enterprises that scale agentic AI without a dedicated FinOps discipline for inference, covering workload-level cost allocation, spend-cap tooling, and model-routing policy, repeatedly under-budget production spend. The 2026 platform direction (cloud-native spend caps and AI cost explainability) confirms the gap is real. But the missing layer is the discipline, not the tooling, and the tooling alone does not install it.
- Security-platform agentic AI: evaluating TCO and ROI for the buying committee
Security-platform agentic AI sits in a different TCO category than the general-purpose agentic AI the CFO playbook covers. The unit of analysis is the alert and the analyst hour, not the seat or the token. The 2026 evaluation that survives audit walks the buying committee through five cost components and three discount factors against vendor-supplied ROI numbers, and gates the procurement on a 90-day in-environment baseline, not a vendor demo.
- Agentic IAM TCO at the 2,000-employee scale: a structural cost model for the 3-year horizon
The IAM TCO conversation at the 2,000-employee scale answers the CFO question that the Okta-vs-NHI-specialists matrix at AM-176 raises. The 3-year horizon prices five cost components (license, integration, operations, migration, exit) across three identity classes (human workforce, managed service accounts, agent-runtime), and reveals that the agent-runtime class is the line item growing fastest in the 2025-2026 cycle and the line item most often unpriced in the year-one budget.
- Agentforce vs Microsoft Copilot pricing in 2026: the deep-dive for the buying decision
The feature comparison of Agentforce against Microsoft Copilot lives at the /compare/ page; the pricing comparison is a separate conversation because pricing models in this category change faster than features. The 2026 pricing structure resolves on per-conversation versus per-user-seat, the publicly disclosed unit rates, the buying-committee discount expectations at enterprise scale, and the year-two renewal pattern that the order-form headline does not predict; the 30-day review cadence on this piece is calibrated to the pricing-page change frequency.
- 97 percent invest, 5 percent are ready: why enterprise AI data readiness is a budget allocation problem
Dun and Bradstreet's 2026 AI Momentum Survey of 10,000 businesses across 32 countries found that 97 percent of organisations report active AI initiatives, but only 5 percent say their data is adequately ready to support them. That gap is not primarily a technology problem. Most enterprise data environments were built for human workflows, not for autonomous AI systems operating continuously across mission-critical processes. The gap between initiative volume and data readiness is a budget-allocation failure: enterprises that treat data infrastructure as the prerequisite spend rather than a parallel track are the ones that reach scale. Enterprises that treat it as a follow-on investment do not.
- The Energy Bill Nobody Budgeted For
Nvidia says agentic AI may need up to a thousand times the compute of a chatbot. The credible enterprise range is 10x to 100x by 2030. Even the floor of that range absorbs the renewable headroom the energy transition depends on, and almost no enterprise AI roadmap is pricing it.
- The agent fan-out problem: when one prompt becomes 400 LLM calls
Production agentic systems amplify a single user request into dozens or hundreds of internal LLM calls. Most enterprise unit-economics, latency budgets, and observability setups are still priced for 1:1.
- The CIO's playbook: what the named-success agentic AI deployments actually share
Four named enterprise deployments (JPMorgan, Toshiba, Wipro, Aberdeen City Council) cleared the McKinsey scaling threshold; the documented cohort that did not, RAND's 2024 study of 65 senior data scientists, identified an 80% pilot-to-production failure rate. The five operational characteristics shared by the named-success cases are observational, citable, and distinct from the proprietary acronym frameworks that crowd the procurement deck. CIO-level visibility on per-deployment ROI is the one most often missing in the failed cohort.
- The MIT 95% GenAI-pilot-failure claim: what the State of AI in Business 2025 report actually measured
MIT NANDA's GenAI Divide report (August 2025) is the source of the 2026's most-cited bear-case statistic: 95% of generative AI pilots fail. The number is a self-reported survey result with a specific methodology, and the way it gets read in procurement decks materially overstates what the underlying data supports. The structural findings underneath the headline are more useful than the headline itself.
- Mid-market agentic AI ROI in 90 days: what the cited data actually supports vs the vendor pitch
The 240% ROI in 90 days framing is the most common mid-market agentic AI vendor pitch in 2026, and the most-cited stat that no audited mid-market deployment has actually produced. Read against the McKinsey 17%, MIT NANDA 95%, and Stanford 12/88 data, the realistic 90-day mid-market ROI band is much narrower and much more useful for procurement than the pitch suggests.
- AI in IT operations: what is actually shipping in 2026, and what the savings really look like
Deep dive into the AI-in-IT-ops market in mid-2026: ServiceNow Now Assist, Microsoft Copilot, AIOps platforms, and the gap between vendor pitch and audited reality. What is actually shipping, what is failing, and what the staff-reduction numbers honestly look like when you trace them to primary sources.
- Agentic-AI vs human workers: the 2026 cost economics CIOs should actually model
Loaded FTE cost vs total agent operational cost does not favour replacement at parity in 2026 for most roles. The math works for narrow, high-volume task categories and breaks for judgment-laden ones.
- The McKinsey 17% EBIT claim: what the survey actually measured
The McKinsey 17% EBIT-attribution figure is the most-cited single statistic in 2026 enterprise agentic AI procurement. The way it is typically read materially overstates what the underlying survey supports.
- Why 88% of agentic AI deployments fail
Stanford 2026 data: 12% of agentic AI deployments clear 300%+ ROI; 88% miss. The distribution is not a capability problem. It is a governance gap.
- The McKinsey 23%: the agentic AI scaling gap
McKinsey 2025: 23% scaling, 39% experimenting. The pilot-to-production chasm is not about model readiness. It is about operational preconditions.
- The CMU 30.3%: the enterprise agent capability gap
Carnegie Mellon 2026: 30.3% task completion for best frontier models. The deployments that work operate within the 30.3%, not around it.
- The CFO's agentic AI business case: TCO and ROI
Most agentic AI business cases fail audit. Three documents survive: TCO with named components, ROI with pre-deployment baseline, scenario-weighted NPV.
- Build vs buy vs partner for enterprise agentic AI in 2026
Most enterprises frame agentic AI as build vs buy. It's a binary on a three-body problem. Partner — the third path — is systematically under-chosen.
- Google AI Mode restaurant booking: the template for every partner-aggregation vertical
Google shipped agentic restaurant booking to eight countries on 10 April 2026. The restaurant vertical is not the story. The story is that eight named.
- GPT-5 Pro at $200 a month: what the pricing tier signals to enterprise IT
OpenAI's GPT-5 Pro tier launched in August 2025 with no benchmarks and a $200/month subscription. The pricing decision is more interpretable than the capability claim. What the tier signals for enterprise procurement and how the McKinsey 17% EBIT-attribution figure cited around the launch should actually be read.
- The bimodal ROI distribution in enterprise agentic AI: why the high-performing cohort is structurally distinct
Enterprise agentic AI ROI is bimodal, not normally distributed. Stanford DEL, Gartner, McKinsey State of AI, and MIT NANDA data converge on the same shape: a small high-performing tail and a much larger struggling body. What separates the two is operational discipline, not model selection — and the 73%/27% framing in the slug captures that pattern more cleanly than the original AI-slop body did.
- The hidden costs of agentic AI: a CFO's guide to true TCO and ROI modeling
Enterprise TCO models underestimate agentic-AI programmes by 40-60%. The surprise is not that the costs are hidden. It is that they are distributed.
- Production agentic AI cost: the layered optimisation playbook for enterprise CFOs
Production agentic-AI bills routinely run several times the POC forecast. The mechanism is structural: token economics, orchestration overhead, context drift, observability. So is the optimisation.
What we're watching next
- Frontier-model inference pricing dropping another order of magnitude before end of Q3 2026.The LLMflation curve has held since 2022. If it continues, the cost-economics math shifts the boundary between replaceable and augmentable categories. If it breaks, the augmentation-over-replacement frame strengthens further.
- Published Fortune 500 / FTSE 100 case data showing residual-team productivity recovery curves materially shorter than the 6-12 month band.The retraining-gap claim is built on the published 2024-2025 cohort. The next round of named-company audits in the back half of 2026 either confirms or contracts the dip estimate.
- McKinsey and BCG publishing late-2026 enterprise gen-AI cost-multiplier studies.The current pilot-to-production cost multiplier is ~2-5×. If aggregate enterprise data shows compression toward 1.5×, the structural-multiplier framing across the cost cluster needs revising. If it widens, the procurement-playbook intervention becomes more urgent.
- GPT-5 Pro and Claude Opus 4 reasoning-mode adoption metrics becoming public.Reasoning models are 5-20× more expensive per call. The procurement question is whether enterprise deployments are routing-by-step (cheap most of the time) or routing-by-deployment (always expensive). Adoption data settles which pattern actually holds.
Primary sources we trust for this topic
A curated list of primary research, regulator guidance, and vendor documentation for enterprise ai cost and roi. Populated on the quarterly refresh — not a link dump, not competitors.
This pillar page is refreshed quarterly. Last refresh: 19 Apr 2026. Next refresh: 18 Jul 2026.