In production agentic systems documented across the publicly observable 2025–2026 deployments, the observable band of internal LLM calls per user-facing request sits between 1:18 and 1:60 across documented deployments, with tail cases regularly exceeding 1:400 — meaning unit-economics, latency budgets, and observability scopes built on a 1:1 mental model under-provision by one to two orders of magnitude.
Claim created at publish; review on 60-day cadence (the model-capability and agentic-framework release tempo means the fan-out ratio band can shift within a single quarter). The 1:18 floor is derived from constrained, well-instrumented production deployments documented in LangSmith multi-step agent case studies and the Anthropic Building Effective Agents post (December 2024); the 1:60 upper band reflects research and orchestration agents documented in the Microsoft Magentic-One paper (arXiv 2411.04468) and Helicone cost-per-task analysis; the 1:400 tail reflects unbounded ReAct agents documented in OpenAI Agents SDK warnings and LangSmith trace data. The band is asserted as the observable range across documented case studies, not as a measured median across a systematic sample — the systematic cross-deployment study does not yet exist in the public literature. Three triggers to revisit before next cadence: (a) a frontier model that internalises multi-step reasoning within a single API call, collapsing the billing-layer fan-out ratio, which would require reframing the cost section while preserving the latency and observability sections; (b) a major observability platform (LangSmith, Helicone, Datadog AI) publishing a cross-deployment fan-out ratio study that either validates or revises the 1:18–1:60 band with systematic data; (c) provider-level task-completion pricing replacing per-call pricing, which changes the unit-economics framing without changing the detection or latency implications. Sister claims: AM-013 (Q1 2026 agentic AI thresholds — the detection-time and observability themes are directly related), AM-148 (GPT-5.5 vs Opus 4.7 cost-per-task framing — the fan-out ratio is a multiplier on the per-call cost comparison). The MTTD-for-Agents tool-use-frequency Z-score tripwire described in this article is the primary detection mechanism; full methodology at /mttd/.
/holding/AM-149/Embed this claimiframe + oEmbed
The card auto-updates when the claim's status, last-reviewed date, or correction log changes. Embedders never need to refresh — the card is rendered live from the canonical record.
Email-me when AM-149's status, next review date, or correction log changes. One email per change. No newsletter subscription, no other mail.
The claim: In production agentic systems documented across the publicly observable 2025–2026 deployments, the observable band of internal LLM calls per user-facing request sits between 1:18 and 1:60 across documented deployments, with tail cases regularly exceeding 1:400 — meaning unit-economics, latency budgets, and observability scopes built on a 1:1 mental model under-provision by one to two orders of magnitude.
About this register
The Reporting register tracks claims published from articles addressed to senior enterprise IT leaders — CIOs, IT directors, heads of platform. Claims are reviewed on a 30–90 day cadence; each review either reaffirms the claim, marks one substantive part as Partial, or marks it Not holding once the underlying evidence has been overtaken.
Recent corrections in Reporting
- AM-003 · Partial · 28 May 2026
Pricing/model drift: a $100/mo Pro tier now sits beside the $200 tier (added 9 Apr 2026) and the premium model is GPT-5.5 Pro. Core thesis holds; the single-$200-tier framing no longer matches. Re-verify current tiers at chatgpt.com/pricing.
- AM-002 · Not holding · 06 May 2026
URL state changed. The /the-agentic-ai-revolution-real-world-success-stories-and-strategic-insights-from-2024-2025/ slug now serves a deliberately rewritten retrospective (claimId AM-130, "Agentic AI 2024-2025 retrospective", published 04 May 2026) against audited primary sources. The 28 Apr 2026 redirect to /retractions/ has been lifted to allow that. AM-002 the claim remains Not holding — the original $3.50/dollar + 70% failure-rate framing was withdrawn and is not restored. AM-130 is a separate claim with its own evidence chain. Readers arriving at /holding/AM-002 see the withdrawal here; the article link surfaces the new piece at the URL the original lived at, with this entry as the audit trail.
- AM-121 · Holding · 2 May 2026
Klarna walk-back primary-source upgrade — added Siemiatkowski verbatim quotes via Bloomberg-cited-by-Fortune (9 May 2025) and the Uber-style freelance hiring detail via Entrepreneur. Closes the highest-priority evidence gap from the source dossier.
Reviews coming up in Reporting
- AM-136 · Holding · next +4d (4 Jun 2026)
Across the 24-month window May 2024 to April 2026, every major foundation-model provider (Anthropic, OpenAI, Google, AW…
- AM-020 · Holding · next +18d (18 Jun 2026)
The 40-60% TCO underestimate on enterprise agentic-AI deployments is not a cost-visibility failure — it is a cross-depa…
- AM-023 · Holding · next +18d (18 Jun 2026)
The 10 Apr 2026 Google AI Mode rollout to eight markets is the first vertical (restaurant booking) where agentic search…