{"generated":"2026-05-06T22:44:27.829Z","publication":{"name":"Agent Mode AI","url":"https://agentmodeai.com","editorial_model":"AI-written by Claude, reviewed and signed off by Peter.","review_cadence":"30 to 90 days per claim; see next_review field."},"holding_up":{"description":"Claims published by Agent Mode AI itself. One claim per article.","methodology_url":"https://agentmodeai.com/standards/","index_url":"https://agentmodeai.com/holding/","claims":[{"id":"AM-001","claim":"70% of AI-implementation failure is people and process, not technology — cultural transformation is the strongest predictor of AI ROI at the 2024-2025 maturity stage.","article_url":"https://agentmodeai.com/ai-readiness-in-organizations-the-2024-2025-landscape/","topic":"agentic-ai-governance","pub_date":"2026-04-19","last_reviewed":"2026-04-19","next_review":"2026-06-18","verdict":"holding","verdict_history":[{"date":"2026-04-19","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-002","claim":"Agentic AI's $3.50-per-dollar average return masks a 70% task-failure rate on the Carnegie Mellon benchmark; only narrowly-scoped deployments clear the reality bar.","article_url":"https://agentmodeai.com/the-agentic-ai-revolution-real-world-success-stories-and-strategic-insights-from-2024-2025/","topic":"agent-procurement","pub_date":"2026-04-19","last_reviewed":"2026-04-28","next_review":null,"verdict":"not_holding","verdict_history":[{"date":"2026-04-19","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-28","verdict":"partial","note":"Article retracted 28 Apr 2026. Slug language ('revolution', 'real-world success stories') carries hype register the publication explicitly avoids; survey-of-surveys structure does not stand up to source-verification at the level the publication now demands. Google has rejected the URL despite an active claim status. URL now redirects to /retractions/?retired=the-agentic-ai-revolution-real-world-success-stories-and-strategic-insights-from-2024-2025. Claim withdrawn — status moves to Not holding, no further reviews scheduled."},{"date":"2026-05-06","verdict":"partial","note":"URL state changed. The /the-agentic-ai-revolution-real-world-success-stories-and-strategic-insights-from-2024-2025/ slug now serves a deliberately rewritten retrospective (claimId AM-130, \"Agentic AI 2024-2025 retrospective\", published 04 May 2026) against audited primary sources. The 28 Apr 2026 redirect to /retractions/ has been lifted to allow that. AM-002 the claim remains Not holding — the original $3.50/dollar + 70% failure-rate framing was withdrawn and is not restored. AM-130 is a separate claim with its own evidence chain. Readers arriving at /holding/AM-002 see the withdrawal here; the article link surfaces the new piece at the URL the original lived at, with this entry as the audit trail."}],"primary_sources":[]},{"id":"AM-003","claim":"GPT-5 Pro's tiered-subscription model forces enterprises to classify problems by computational difficulty — $200/month premium routing only repays for the top decile of 'very hard' queries.","article_url":"https://agentmodeai.com/gpt-5-pro-vs-enterprise-ai-agents-what-very-hard-problems-means-for-your-business/","topic":"enterprise-ai-cost","pub_date":"2026-04-19","last_reviewed":"2026-04-19","next_review":"2026-05-19","verdict":"holding","verdict_history":[{"date":"2026-04-19","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-013","claim":"Q1 2026 is the quarter enterprise agentic-AI crossed three thresholds simultaneously — the first at-scale in-the-wild exploits, the first vendor-shipped governance infrastructure, and the first hard ROI data — and programmes designed around only one will not make the 28% that pay off.","article_url":"https://agentmodeai.com/agentic-ai-got-real-q1-2026/","topic":"agentic-ai-governance","pub_date":"2026-04-19","last_reviewed":"2026-04-19","next_review":"2026-06-18","verdict":"holding","verdict_history":[{"date":"2026-04-19","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-014","claim":"The ~73% of enterprise agentic-AI projects that fail share three structural gaps — no named owner, scope drift, and missing agent-level MTTD — and the 27% that succeed cluster around the inverse.","article_url":"https://agentmodeai.com/why-73-of-agentic-ai-projects-fail-and-how-the-27-generate-312-roi/","topic":"enterprise-ai-cost","pub_date":"2025-08-03","last_reviewed":"2026-04-28","next_review":null,"verdict":"not_holding","verdict_history":[{"date":"2025-08-03","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-19","verdict":"partial","note":"Article predates the Holding-up standard. Retroactive claim assigned on 19 Apr 2026. Initial verdict 'Partial' — spine is defensible, per-claim numeric verification deferred to +60d review. Body not rewritten per AGENTMODE_PHASE2_BRIEF §114."},{"date":"2026-04-28","verdict":"partial","note":"Article retracted 28 Apr 2026. Both stats in the slug ('73% fail', '312% ROI') were backfilled with status: partial on 19 Apr 2026 noting the article predates editorial standard. Body never rewritten. Google's quality algorithm has independently flagged the URL nine days later. The slug carries the structural problem. URL now redirects to /retractions/?retired=why-73-of-agentic-ai-projects-fail-and-how-the-27-generate-312-roi. Claim withdrawn — status moves to Not holding, no further reviews scheduled."}],"primary_sources":[]},{"id":"AM-015","claim":"An agentic-AI Center of Excellence justifies its overhead only after the organisation has three production agents running; before that, it over-governs an experimental footprint.","article_url":"https://agentmodeai.com/building-a-center-of-excellence-for-agentic-ai-in-it-operations-complete-enterprise-guide/","topic":"agentic-ai-governance","pub_date":"2025-08-01","last_reviewed":"2026-04-19","next_review":"2026-06-18","verdict":"partial","verdict_history":[{"date":"2025-08-01","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-19","verdict":"partial","note":"Article predates the Holding-up standard. Retroactive claim assigned on 19 Apr 2026. Initial verdict 'Partial' — spine is defensible, per-claim numeric verification deferred to +60d review. Body not rewritten per AGENTMODE_PHASE2_BRIEF §114."}],"primary_sources":[]},{"id":"AM-016","claim":"Agent-mediated network management reduces unplanned firewall-change incident costs only when the agent's action log feeds into the same change-management audit trail human changes use — not as a parallel system.","article_url":"https://agentmodeai.com/the-7-2m-firewall-change-that-transformed-network-management-how-agentic-ai-prevents-it-disasters/","topic":null,"pub_date":"2025-07-27","last_reviewed":"2026-04-28","next_review":null,"verdict":"not_holding","verdict_history":[{"date":"2025-07-27","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-19","verdict":"partial","note":"Article predates the Holding-up standard. Retroactive claim assigned on 19 Apr 2026. Initial verdict 'Partial' — spine is defensible, per-claim numeric verification deferred to +60d review. Body not rewritten per AGENTMODE_PHASE2_BRIEF §114."},{"date":"2026-04-28","verdict":"partial","note":"Article retracted 28 Apr 2026. '$7.2M' figure in the slug cannot be traced to any disclosed firewall-change incident. Body never rewritten past 19 Apr 2026 backfill. The dollar specificity in the URL is the structural problem and Google's quality algorithm has independently flagged the URL. URL now redirects to /retractions/?retired=the-7-2m-firewall-change-that-transformed-network-management-how-agentic-ai-prevents-it-disasters. Claim withdrawn — status moves to Not holding, no further reviews scheduled."}],"primary_sources":[]},{"id":"AM-017","claim":"Agentic AI's durable enterprise pattern is redeployment-first, not replacement-first. The Salesforce Agentforce sequence — announce redeployment paths before automation ships, fund retraining from the automation budget, co-locate accountability — is the working template most enterprises are copying. Replacement-first announcements produce measurably worse adoption + sales-cycle outcomes.","article_url":"https://agentmodeai.com/the-day-9000-people-asked-to-be-replaced/","topic":null,"pub_date":"2025-07-19","last_reviewed":"2026-04-28","next_review":null,"verdict":"not_holding","verdict_history":[{"date":"2025-07-19","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-19","verdict":"partial","note":"Article predates the Holding-up standard. Retroactive claim assigned on 19 Apr 2026. Initial verdict 'Partial' — spine is defensible, per-claim numeric verification deferred to +60d review. Body not rewritten per AGENTMODE_PHASE2_BRIEF §114."},{"date":"2026-04-19","verdict":"partial","note":"Anchor verification complete (see audit/ANCHOR_VERIFICATION_2026-04-19.md). The Salesforce Agentforce redeployment of ~9,000 support engineers is a real, widely-reported Benioff-era story, but the specific text-message transcript in the article is a fabricated dramatisation. Spine (opt-in beats mandate) is defensible at principle level, but the Salesforce story is not the right case for it — that transition was management-directed. Rewrite flagged for before 18 Jun 2026 review."},{"date":"2026-04-19","verdict":"partial","note":"Body rewritten. Fabricated text-message transcript removed. Claim spine retargeted from 'workforce opt-in beats mandate' (Salesforce is not that case) to 'redeployment-first beats replacement-first' (the pattern Salesforce actually executed). Status moves from Partial to Up. Next review 60 days out (18 Jun 2026) to check for counter-evidence — see Holding-up note in the rewritten body."},{"date":"2026-04-28","verdict":"partial","note":"Article retracted 28 Apr 2026. Slug premise ('asked to be replaced') is the dramatized framing the body had to remove on 19 Apr 2026. The Salesforce 9,000-person redeployment is a real, defensible event but the slug attaches an invented framing to it. Body preserved in archived/. Google has independently rejected the URL. URL now redirects to /retractions/?retired=the-day-9000-people-asked-to-be-replaced. Claim withdrawn — status moves to Not holding, no further reviews scheduled."}],"primary_sources":[]},{"id":"AM-018","claim":"Agentic AI's compounding economics show up in back-office operations (AP, IT ticket triage, HR onboarding, procurement, close-cycle reconciliation), not in front-office customer-facing workflows. The 12% of deployments that clear 300%+ ROI cluster there for structural reasons: per-action savings × action frequency × task-specification tightness × existing process instrumentation.","article_url":"https://agentmodeai.com/the-executive-who-discovered-her-competitors-secret-weapon/","topic":null,"pub_date":"2025-07-19","last_reviewed":"2026-04-28","next_review":null,"verdict":"not_holding","verdict_history":[{"date":"2025-07-19","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-19","verdict":"partial","note":"Article predates the Holding-up standard. Retroactive claim assigned on 19 Apr 2026. Initial verdict 'Partial' — spine is defensible, per-claim numeric verification deferred to +60d review. Body not rewritten per AGENTMODE_PHASE2_BRIEF §114."},{"date":"2026-04-19","verdict":"partial","note":"Anchor verification complete (see audit/ANCHOR_VERIFICATION_2026-04-19.md). 'Sarah Chen' and the 2 AM Munich-hotel scenario are fully fabricated — the article's narrative protagonist does not correspond to any real executive. The underlying framework (back-office cost compounding faster than front-office wins; per-action delta × frequency) IS defensible against McKinsey + Futurum operational-AI-ROI data. Rewrite required before the article can move to Holding."},{"date":"2026-04-19","verdict":"partial","note":"Body rewritten. Fabricated 'Sarah Chen' narrative frame removed entirely. Claim spine sharpened: original was 'back-office cost compounding faster than front-office'; new version adds the structural explanation (per-action × frequency × task-specification × measurement instrumentation) and specific 2026 benchmark anchors (Stanford DEL 12%/88%, Gartner 28%, Futurum 71% vs 40%). Status moves from Partial to Up. Cross-links to AM-020 (TCO), AM-021 (measurement discipline), AM-022 (bimodal ROI) explicitly drawn in the body. Next review 18 Jun 2026."},{"date":"2026-04-28","verdict":"partial","note":"Article retracted 28 Apr 2026. Slug structure (fictional protagonist, 'discovered her competitors' secret weapon') is the fabricated narrative frame the body had to remove on 19 Apr 2026. Body rewritten with Stanford DEL / McKinsey / Futurum sourcing (preserved in archived/) but the slug is the structural problem. URL now redirects to /retractions/?retired=the-executive-who-discovered-her-competitors-secret-weapon. Claim withdrawn — status moves to Not holding, no further reviews scheduled."}],"primary_sources":[]},{"id":"AM-019","claim":"Manufacturing deployments hitting the 30% unplanned-downtime-reduction benchmark share one architectural pattern — the agent writes its actions into the plant's existing MES/CMMS audit trail rather than a parallel log. Parallel-log deployments underperform by a factor of 2-3.","article_url":"https://agentmodeai.com/manufacturing-4-0-how-multi-agent-systems-reduce-downtime-by-30/","topic":"agentic-ai-governance","pub_date":"2025-08-01","last_reviewed":"2026-04-19","next_review":"2026-06-18","verdict":"holding","verdict_history":[{"date":"2025-08-01","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-19","verdict":"partial","note":"Body rewritten. Original headline number (30% downtime reduction) survives against current case-study data. New analytical spine: the audit-trail architecture separates wins from stalls. Status moved from rewrite-in-progress Partial placeholder to Up. Next review 60 days out because architectural claims age slower than pricing claims."}],"primary_sources":[]},{"id":"AM-020","claim":"The 40-60% TCO underestimate on enterprise agentic-AI deployments is not a cost-visibility failure — it is a cross-departmental cost-attribution failure. Integration, tokens, maintenance, supervision, and compliance costs land on IT, HR, and Legal budgets that do not reconcile in most organisations, so the CFO sees the bill late and partial.","article_url":"https://agentmodeai.com/the-hidden-costs-of-agentic-ai-a-cfos-guide-to-true-tco-and-roi-modeling/","topic":"enterprise-ai-cost","pub_date":"2025-07-31","last_reviewed":"2026-04-19","next_review":"2026-06-18","verdict":"holding","verdict_history":[{"date":"2025-07-31","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-19","verdict":"partial","note":"Body rewritten from WP-era slop. Status moves from rewrite-in-progress placeholder to Up. New analytical spine: the TCO underestimate is cross-departmental cost-attribution failure, not hidden costs. Five cost categories named with budget owners. 60-day review cadence."}],"primary_sources":[]},{"id":"AM-021","claim":"The 87% vs 27% success-rate gap between Six-Sigma and non-Six-Sigma organisations on agentic-AI deployments reflects pre-existing measurement discipline, not the DMAIC methodology itself. Agents require a clean baseline, defect definition, documented root-cause analysis, and a change-management gate — four conditions that ISO 9001, ITIL, SRE, or HACCP practices produce just as reliably.","article_url":"https://agentmodeai.com/dmaic-for-agentic-ai-deployment/","topic":"agentic-ai-governance","pub_date":"2025-08-16","last_reviewed":"2026-04-19","next_review":"2026-06-18","verdict":"holding","verdict_history":[{"date":"2025-08-16","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-19","verdict":"partial","note":"Body rewritten from WP-era slop. Status moves from rewrite-in-progress placeholder to Up. New thesis: the causation runs the opposite direction from the vendor narrative — the measurement discipline was the prerequisite, the methodology name doesn't matter. 60-day review."},{"date":"2026-04-28","verdict":"partial","note":"Slug migration to §6a-compliant URL: from-dmaic-to-ai-agents-how-traditional-optimization-methods-accelerate-agentic-ai-success → dmaic-for-agentic-ai-deployment. Body unchanged from 19 Apr rewrite, only the URL changed. Old slug 308-redirects to new. Reason: the long descriptive slug carried §6a-grade friction (88+ chars, vendor-cliche framing) and Google's quality algorithm had flagged the original URL as low-quality (per the 28 Apr 2026 GSC drilldown showing it in the 'Crawled - currently not indexed' bucket). The clean slug preserves the analytical content while removing the URL-level quality penalty."}],"primary_sources":[]},{"id":"AM-022","claim":"The 171% average ROI on enterprise agentic-AI deployments is the mean of a bimodal distribution — roughly 12% of deployments clear 300%+ and 88% sit at or below break-even. The single factor distinguishing the clusters is not a multi-pattern framework; it is whether business-line (not IT) ownership held the kill-switch and accountability before the deployment shipped.","article_url":"https://agentmodeai.com/the-agentic-ai-success-formula-7-proven-patterns-driving-171-roi-in-enterprise-deployments/","topic":null,"pub_date":"2025-08-06","last_reviewed":"2026-04-28","next_review":null,"verdict":"not_holding","verdict_history":[{"date":"2025-08-06","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-19","verdict":"partial","note":"Body rewritten from WP-era slop (7-patterns vendor framework with fabricated case studies). New thesis: bimodal distribution, not normal — the 171% average describes no specific deployment. Business-line kill-switch ownership is the single distinguishing factor. Cross-links to AM-020 + AM-021 on the shared organisational-precondition thread."},{"date":"2026-04-28","verdict":"partial","note":"Article retracted 28 Apr 2026. Slug carries '171% ROI' as a category average and a '7 proven patterns' framework that the body had to disown — the rewritten body explicitly argues 171% is the mean of a bimodal distribution, not a benchmark. Body rewritten 19 Apr 2026 (preserved in archived/) but the slug contradicts the rewritten thesis and Google has rejected the URL. URL now redirects to /retractions/?retired=the-agentic-ai-success-formula-7-proven-patterns-driving-171-roi-in-enterprise-deployments. Claim withdrawn — status moves to Not holding, no further reviews scheduled."}],"primary_sources":[]},{"id":"AM-023","claim":"The 10 Apr 2026 Google AI Mode rollout to eight markets is the first vertical (restaurant booking) where agentic search reduces named SaaS aggregators (OpenTable, TheFork, ResDiary and five others) to API backends rather than destinations. The template applies to every enterprise-relevant aggregation vertical — business travel, expense management, procurement, ATS, HR service delivery — and incumbents in those verticals have 18-24 months to pick API-backend or destination positioning before agentic search forces the choice.","article_url":"https://agentmodeai.com/google-ai-mode-restaurant-booking-the-50-billion-business-revolution-every-ceo-must-understand-2025/","topic":"enterprise-ai-cost","pub_date":"2025-08-23","last_reviewed":"2026-04-19","next_review":"2026-06-18","verdict":"holding","verdict_history":[{"date":"2025-08-23","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-19","verdict":"partial","note":"Body rewritten from WP-era slop (the '$50 Billion Revolution' headline and 'act within 90 days' crisis-FOMO framing were both fabrications). New thesis: restaurant booking is a template, not the story. Named 5 enterprise-relevant aggregation verticals (business travel, expense, procurement, ATS, HR service) and the API-backend-vs-destination choice incumbents face. Next review in 60 days."}],"primary_sources":[]},{"id":"AM-024","claim":"Enterprise-AI decisions in 2026 are made on a citation chain nobody in the chain verifies. The infrastructure gap CIOs face is a verification layer for the claims their procurement runs on — not an information gap. The 88% failure rate in enterprise agentic AI is the predictable output of decision-making on unverified citations, not a capability problem.","article_url":"https://agentmodeai.com/the-unverified-citation-chain-where-enterprise-ai-decisions-actually-come-from/","topic":"agentic-ai-governance","pub_date":"2026-04-20","last_reviewed":"2026-04-20","next_review":"2026-06-19","verdict":"holding","verdict_history":[{"date":"2026-04-20","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-025","claim":"Enterprise agentic AI governance in 2026 fails at the operational layer even when it passes at the compliance layer. Boards receive EU-AI-Act-mapped compliance decks while the agentic deployments actually shipping out of IT ops have no measurable overlap with that deck. Durability requires six instrumented dimensions scored 0–100 (GAUGE framework) with a 90-day setup cadence and a 12-month trajectory target — not a compliance matrix.","article_url":"https://agentmodeai.com/the-enterprise-agentic-ai-governance-playbook-2026/","topic":"agentic-ai-governance","pub_date":"2026-04-24","last_reviewed":"2026-04-24","next_review":"2026-06-23","verdict":"holding","verdict_history":[{"date":"2026-04-24","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-026","claim":"Generic enterprise SaaS RFPs systematically underweight six agent-specific governance dimensions (governance maturity, threat model, ROI evidence, change management, vendor lock-in, compliance posture). A 60-question RFP layer mapped to the GAUGE framework materially changes vendor selection outcomes by disqualifying vendors whose operational governance will not survive the 18-month enterprise review cycle.","article_url":"https://agentmodeai.com/the-enterprise-agentic-ai-rfp-60-questions/","topic":"agent-procurement","pub_date":"2026-04-24","last_reviewed":"2026-04-24","next_review":"2026-06-23","verdict":"holding","verdict_history":[{"date":"2026-04-24","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-027","claim":"A durable enterprise agentic AI business case requires three specific documents — a TCO model with ten named cost categories (not vendor-supplied line items), an ROI model with a pre-deployment measured baseline and an independent validation round, and a three-scenario risk-adjusted NPV. The single-scenario vendor-framed business cases that dominate 2026 enterprise AI investment committees are the predictable root of the 40%+ projected agentic AI project cancellation rate.","article_url":"https://agentmodeai.com/the-cfos-agentic-ai-business-case-tco-and-roi/","topic":"enterprise-ai-cost","pub_date":"2026-04-24","last_reviewed":"2026-04-24","next_review":"2026-06-23","verdict":"holding","verdict_history":[{"date":"2026-04-24","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-028","claim":"Partner — co-development with a vendor on a structured non-standard engagement — is structurally under-chosen in enterprise agentic AI procurement in 2026. Procurement committees have templates for build and buy but none for partner, so the third path does not get evaluated on an equal footing. The vendor-lock-in and change-management dimensions of the GAUGE framework usually favour partner when it is honestly evaluated, not buy or build.","article_url":"https://agentmodeai.com/build-vs-buy-vs-partner-for-enterprise-agentic-ai-2026/","topic":"enterprise-ai-cost","pub_date":"2026-04-24","last_reviewed":"2026-04-24","next_review":"2026-06-23","verdict":"holding","verdict_history":[{"date":"2026-04-24","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-029","claim":"The 12/88 bimodal distribution in enterprise agentic AI ROI realisation (Stanford DEL 2026 + cross-validated by Gartner, McKinsey, CMU) is a governance-discipline outcome, not a model-capability outcome. The 12% instrument the six GAUGE dimensions on a 90-day review rhythm; the 88% treat governance as a deliverable to the audit committee. Capability gap (CMU's 30.3% best-in-class task completion) constrains what is possible, not what separates the 12% from the 88%.","article_url":"https://agentmodeai.com/why-88-percent-of-agentic-ai-deployments-fail/","topic":"enterprise-ai-cost","pub_date":"2026-04-24","last_reviewed":"2026-04-24","next_review":"2026-06-23","verdict":"holding","verdict_history":[{"date":"2026-04-24","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-030","claim":"The McKinsey State of AI 2025 figure (23% of enterprises scaling an agentic AI system, 39% still experimenting) is an operational-preconditions outcome, not a technical-readiness outcome. Four preconditions (agent registry, measured pre-deployment baseline, differentiated change-management playbook for adjacent units, cross-agent threat model at scale) separate pilots that cross into production from pilots that stall. The 6% AI-high-performer segment is the subset of the 23% scaling with additional measurement discipline that makes ROI audit-survivable.","article_url":"https://agentmodeai.com/the-mckinsey-23-percent-agentic-ai-scaling-gap/","topic":"enterprise-ai-cost","pub_date":"2026-04-24","last_reviewed":"2026-04-24","next_review":"2026-06-23","verdict":"holding","verdict_history":[{"date":"2026-04-24","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-031","claim":"The CMU TheAgentCompany 2026 benchmark figure (30.3% task completion for best-in-class frontier model, up from 24% in 2024) is the current capability constraint for enterprise agentic AI. Capability trajectory projects to ~40% by late 2027, which does not cross the 95% production-readiness threshold within the 3-year TCO horizon enterprise business cases operate against. The Stanford DEL 12% durable cohort operates within the 30.3% (narrow scope + human-in-the-loop + GAUGE-dimensional governance discipline), not around it. Capability is not the variable that separates the 12% from the 88%.","article_url":"https://agentmodeai.com/the-cmu-30-percent-agent-capability-gap/","topic":"enterprise-ai-cost","pub_date":"2026-04-24","last_reviewed":"2026-04-24","next_review":"2026-06-23","verdict":"holding","verdict_history":[{"date":"2026-04-24","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-032","claim":"EU financial-services agentic AI deployments operate under a compounded five-framework obligation surface (DORA, NIS2, MiFID II, EU AI Act, GDPR) that sits on top of general AI governance. Liability does not transfer to the vendor contractually regardless of SLA language — MiFID II conduct rules, EU AI Act deployer obligations, and DORA third-party-risk provisions place customer-facing and regulator-facing liability on the deploying financial institution. Compliance-posture and vendor-lock-in are the dominant GAUGE dimensions for the sector, scoring 15-25 points lower than cross-industry averages on first pass.","article_url":"https://agentmodeai.com/agentic-ai-in-financial-services-compliance-and-liability/","topic":"agentic-ai-governance","pub_date":"2026-04-24","last_reviewed":"2026-04-24","next_review":"2026-06-23","verdict":"holding","verdict_history":[{"date":"2026-04-24","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-033","claim":"The McKinsey 17%-EBIT-attributable-to-genAI figure, the most-cited single statistic in 2026 enterprise agentic AI procurement decisions, is a self-reported attribution from McKinsey's State of AI 2025 survey of approximately 1,491 respondents. The way it is typically read in CIO decks, as evidence that 17% of enterprises have produced 5% or more of EBIT from genAI, materially overstates what the survey supports. The figure documents 17% of survey respondents asserting that level of attribution, not 17% of enterprises producing it under audited measurement.","article_url":"https://agentmodeai.com/the-mckinsey-17-percent-ebit-claim/","topic":"enterprise-ai-cost","pub_date":"2026-04-25","last_reviewed":"2026-04-25","next_review":"2026-06-24","verdict":"holding","verdict_history":[{"date":"2026-04-25","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-034","claim":"AI assistants and AI agents are not the same product class. An AI assistant is a productivity-augmentation tool that suggests; an AI agent is an automation-execution system that acts on a downstream surface (tools, APIs, write-paths). Conflating them in 2026 enterprise procurement produces the most common single category mistake — buying an assistant under the assumption it is an agent, or buying an agent and governing it as if it were an assistant. The risk profile, contract structure, audit obligation, and TCO model differ categorically.","article_url":"https://agentmodeai.com/ai-assistant-vs-ai-agent/","topic":"agent-procurement","pub_date":"2026-04-25","last_reviewed":"2026-04-25","next_review":"2026-06-24","verdict":"holding","verdict_history":[{"date":"2026-04-25","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-035","claim":"The EU AI Act enforcement deadline of 2 August 2026 applies high-risk-system obligations under Articles 9 through 49 to most enterprise agentic AI deployments operating in EU jurisdiction or providing services to EU nationals — not only to deployments explicitly classified within the Annex III high-risk categories. The compliance gap most enterprises face is structural: the Act requires evidence-of-action production (logs, oversight records, post-market monitoring, incident reports) that most agentic deployments do not generate by default. Building the evidence layer post-hoc, after a regulator request, is the failure mode.","article_url":"https://agentmodeai.com/eu-ai-act-agentic-ai-compliance/","topic":"agentic-ai-governance","pub_date":"2026-04-25","last_reviewed":"2026-04-25","next_review":"2026-06-24","verdict":"holding","verdict_history":[{"date":"2026-04-25","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-036","claim":"Enterprise shadow AI in 2026 is structurally different from enterprise shadow AI in 2024. The 2024 framing assumed unsanctioned tool adoption — workers pasting confidential data into consumer ChatGPT or installing browser extensions outside IT review. The 2026 reality is that the larger blast radius is agentic capability silently activating inside already-approved tools, often through configuration changes (Custom GPT actions, Copilot custom agents, MCP server connections from approved IDEs) that the original procurement approval did not anticipate. Discovery has to look at capability state, not vendor identity. Most enterprise shadow-AI inventories built against the 2024 framing miss 50 to 80% of the actual exposure surface.","article_url":"https://agentmodeai.com/shadow-ai-discovery-playbook/","topic":"shadow-ai-discovery","pub_date":"2026-04-25","last_reviewed":"2026-04-25","next_review":"2026-06-24","verdict":"holding","verdict_history":[{"date":"2026-04-25","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-037","claim":"AI agents are structurally different from earlier classes of non-human identity (service accounts, API keys, machine certificates, bot identities), and the IAM platforms most enterprises run in 2026 cannot represent them adequately because those platforms authorise on principal identity rather than on per-action behavioural context. The 92% of enterprises that report low IAM confidence for agentic AI are not configured wrong; they are running an identity model with one structural axis where the agentic deployment requires four (identity, behaviour, context, revocation). The remediation is a four-layer extension on top of existing IAM, not a rip-and-replace migration. Most enterprises can ship the augmentation in 8 to 12 weeks of engineering.","article_url":"https://agentmodeai.com/non-human-identity-ai-agents/","topic":"non-human-identity","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-038","claim":"Model Context Protocol (MCP) reached enterprise procurement gravity in 18 months, faster than typical interoperability standards. The 10,000+ active public MCP servers, adoption by ChatGPT, Cursor, Gemini, Microsoft Copilot, and VS Code, and the December 2025 Linux Foundation donation made MCP a tooling-layer choice that ripples through every adjacent agentic-AI procurement decision: which agents connect to which enterprise systems, which audit boundaries hold, which vendor lock-in patterns activate. The actual procurement decision enterprise IT faces is not whether to adopt MCP (the question is moot once any approved tool ships MCP support); it is the scope-and-governance decision: which MCP servers the enterprise allows agents to connect to, what scopes those connections grant, and how cross-agent delegation through MCP is monitored. Treating MCP as a binary adoption question rather than a scope-and-governance question is the most common enterprise procurement mistake on this surface in 2026.","article_url":"https://agentmodeai.com/mcp-enterprise-agent-tooling/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-039","claim":"The 2026 enterprise agentic AI vendor comparison reduces to four credible platform plays (Anthropic, OpenAI, Google, Microsoft), and the procurement decision between them is no longer primarily about model capability. The model layer has converged to comparable parity for most enterprise use cases. The procurement decision in 2026 is on three other axes: pricing model (Anthropic Managed Agents at 8 cents per session-hour plus tokens versus OpenAI Agents SDK at no first-party runtime fee versus Microsoft and Google's vertically-integrated platform pricing), governance and BAA posture (Anthropic's three-cloud BAA position is structurally distinct), and ecosystem distribution (Microsoft's Office plus Azure footprint has no near peer; Google's vertical integration on Workspace and Cloud is second). Treating this as a model-quality bake-off is the most common 2026 procurement mistake and produces decisions that age badly within the first 12 months.","article_url":"https://agentmodeai.com/enterprise-ai-agent-vendor-comparison/","topic":"agent-procurement","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-040","claim":"Enterprise agentic AI in 2026 is in its first year of operational consequence rather than its first year of capability. The deployment record across multiple independent datasets shows a stable bimodal distribution (a small high-performing tail clearing 300%+ ROI and a much larger struggling body at or below break-even), four credible platform plays converging at the vendor layer, a structurally inadequate IAM posture across 92% of enterprises, and a 14-week runway to the EU AI Act August 2026 enforcement window. The aggregate signal is that the year's defining variable is deployment discipline, not model capability or vendor selection. The 6% AI-high-performer segment and the 12% Stanford DEL high-ROI cohort instrument six specific governance dimensions on a 90-day review cadence; the remaining 88-94% mostly do not.","article_url":"https://agentmodeai.com/state-of-enterprise-agentic-ai/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-041","claim":"The 2026 enterprise agentic AI procurement playbook resolves to a six-stage sequence that integrates the build-vs-buy-vs-partner decision, the 60-question agentic AI RFP, the GAUGE governance scoring, the four-vendor comparison, and the EU AI Act compliance scaffolding into one operational track. Most enterprises in 2026 run these as separate work streams owned by separate functions, which produces structurally inconsistent procurement records and substantial duplicate effort. The integrated six-stage track ships in 8 to 10 weeks for standard environments and produces an audit-defensible per-deployment procurement artifact that satisfies the EU AI Act Article 9 risk-management system requirement by construction.","article_url":"https://agentmodeai.com/enterprise-agentic-ai-procurement-playbook/","topic":"agent-procurement","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-042","claim":"The 6% AI-high-performer cohort identified by McKinsey and the 12% high-ROI cohort identified by the Stanford Digital Economy Lab share ten measurable governance practices that an enterprise can audit in under 60 minutes. An enterprise answering YES to 8 or more of the 10 diagnostic questions has the operating profile of the high-performing segment. An enterprise answering YES to 4 or fewer has the operating profile of the 88-94% struggling cohort and is unlikely to clear break-even on agentic AI deployment without a posture rebuild. The diagnostic audits posture, not outcomes; it identifies where governance investment is needed before the next deployment commitment, not whether a specific deployment will succeed.","article_url":"https://agentmodeai.com/agentic-ai-readiness-diagnostic/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-043","claim":"The OWASP Agentic Security Initiative's threat taxonomy for agentic AI (memory poisoning, tool misuse, privilege compromise, resource overload, cascading hallucination, intent breaking, misaligned and deceptive behaviour, repudiation and untraceability, identity spoofing, overwhelming human-in-the-loop) maps cleanly onto seven specific enterprise controls: scoped non-human identity, action-class approval gates, decision audit logging at Article 12 evidence quality, MTTD-for-Agents layered detection, deployment-tier resource quotas, behavioural drift monitoring, and HITL throughput limits. An enterprise that operates these seven controls covers all ten OWASP threat classes; an enterprise missing more than two of the controls has structural exposure to at least four of the threat classes.","article_url":"https://agentmodeai.com/owasp-agentic-ai-top-10-walkthrough/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-07-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-044","claim":"Six well-documented public agentic AI deployment failures from 2024-2025 (Air Canada bereavement-refund chatbot, NYC MyCity small-business chatbot, Replit production-database wipe, Cursor unauthorised code deletion, Klarna customer-service reversal, DPD chatbot escalation incident) cluster into three structural failure modes: (1) the agent acts as a binding agent of the enterprise without disclosure or approval, (2) the agent operates with permissions the deployment never authorised, (3) the agent's economic case requires a service quality the deployment cannot sustain. Each failure mode maps to a specific control from the seven-control surface; all six failures would have been mitigated by controls already specified in the OWASP Agentic AI Top 10 enterprise walkthrough. The pattern is consistent enough that an enterprise can use the cases as a procurement filter: any vendor unable to point to its specific control posture against each of the three failure modes is not procurement-ready.","article_url":"https://agentmodeai.com/agentic-ai-failure-case-studies/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-07-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-045","claim":"EchoLeak (CVE-2025-32711, disclosed by Aim Security in June 2025 against Microsoft 365 Copilot) is the canonical example of a class of attacks rather than a single vulnerability: cross-agent prompt injection in which a malicious payload travels through ordinary content channels (an email, a shared document, a calendar invite, a tool response) into one or more agents' context windows, where it manipulates the agents into actions the deploying enterprise did not authorise, with no user interaction required. The attack class is structurally inherent to any architecture in which an LLM-based agent ingests untrusted content and has tool surfaces capable of exfiltration or action; closing the class requires architectural separation between content-ingest and tool-execution privileges, not point-fixes against specific exploit chains. Enterprises in 2026 operating multiple agents that share context, share memory, or hand off tasks to each other are structurally exposed to the EchoLeak class until the architectural separation is implemented.","article_url":"https://agentmodeai.com/echoleak-cross-agent-prompt-injection/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-046","claim":"EU AI Act Article 12 (record-keeping for high-risk AI systems) and Article 19 (record retention by providers) are operationalised for agentic AI by a 14-field audit-evidence template that captures every agent decision in a regulator-queryable form: deployment ID, agent identity, session ID, ISO timestamp, user prompt, retrieved context with provenance, model output, planned action, action class, approval reference, executed action, tool-call audit chain, output disclosure surface, and policy version. Logs retained for the regulatory minimum (typically 6 months for the EU AI Act baseline, 5 to 7 years for sector-specific overlays like HIPAA and SOX) in a queryable format that supports under-4-business-hour evidence assembly. An enterprise that captures the 14 fields, retains them for the maximum applicable period, and instruments the queryable export has substantially completed Article 12 compliance for the agent layer; the residual work is integrating the agent log stream with the broader audit substrate.","article_url":"https://agentmodeai.com/eu-ai-act-article-12-audit-evidence/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-047","claim":"The Head of AI Governance role (variant titles: Chief AI Officer, VP AI Strategy, Director of Responsible AI) is now a named operating role in 60% of Fortune 100 enterprises per Forrester's 2026 Enterprise AI Predictions, and is the strongest single predictor of an enterprise's score on Q10 of the readiness diagnostic. The role's effective shape converges on six accountabilities: cross-functional governance ownership, EU AI Act compliance posture, vendor procurement gate-keeping, deployment kill-criterion enforcement, audit-evidence substrate ownership, and internal upskilling. The role reports to the executive committee (CEO direct or CFO/COO) rather than to IT, security, or legal, because matrixed reporting into existing functions reproduces the matrixed-shared-accountability failure pattern. Compensation in 2026 ranges from $250-450K base for the Director tier, $400-700K for VP tier, and $600K-$1.2M total comp at the C-level, with significant equity components in growth-stage and tech enterprises.","article_url":"https://agentmodeai.com/head-of-ai-governance-role/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-048","claim":"The NIST AI Risk Management Framework (AI RMF 1.0, published January 2023, with the Generative AI Profile published July 2024) maps onto enterprise agentic AI deployment work across its four functions (Govern, Map, Measure, Manage) using the same artefacts an enterprise produces for EU AI Act Article 9. Specifically: NIST Govern maps to the Head of AI Governance role and the AI governance committee; NIST Map maps to the deployment inventory and the OWASP Agentic Top 10 walkthrough; NIST Measure maps to the 14-field Article 12 audit substrate plus the GAUGE governance dimensions; NIST Manage maps to the kill-criterion enforcement and the seven-control surface. An enterprise that has the EU AI Act preparation track running has substantially completed NIST AI RMF coverage and can document the mapping as a single cross-reference matrix. The reverse mapping (NIST → EU AI Act) requires more work because NIST is voluntary in posture and the EU AI Act is operational; an enterprise that started with NIST as the framework needs to extend audit substrate granularity and add the Article 73 incident-reporting workflow.","article_url":"https://agentmodeai.com/nist-ai-rmf-agentic-ai-mapping/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-07-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-049","claim":"Enterprise multi-agent architectures resolve to three orchestration patterns (hierarchical, peer-to-peer, broker-mediated) with materially different governance properties: hierarchical concentrates accountability at the orchestrator and is the easiest to audit but the most exposed to orchestrator-compromise; peer-to-peer distributes accountability and is the most resilient to single-agent failure but the hardest to audit; broker-mediated centralises the inter-agent communication path and is the most defensible against the cross-agent prompt-injection class. The choice of pattern is not a free architectural decision in 2026 because the EU AI Act's Article 9 risk-management requirements and the OWASP Agentic AI threat surface impose specific control obligations on each pattern. An enterprise should default to broker-mediated for new deployments above the high-risk threshold; hierarchical is acceptable for low-risk and contained deployments; peer-to-peer should be avoided in production agentic AI in 2026 unless the audit substrate is materially stronger than vendor-native baseline.","article_url":"https://agentmodeai.com/multi-agent-architecture-playbook/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-07-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-050","claim":"The A2A (Agent2Agent) protocol announced by Google Cloud in April 2025 is the most credible 2026 candidate for an open standard for cross-vendor agent-to-agent interoperability, with backing from 50+ partners across the enterprise software ecosystem (Salesforce, SAP, ServiceNow, MongoDB, Atlassian, and others). The protocol layer covers what MCP (Model Context Protocol) does not: MCP is for agent-to-tool communication, A2A is for agent-to-agent communication. The two protocols are designed to be complementary rather than competing. A2A's adoption trajectory through 2026 will determine whether broker-mediated multi-agent patterns become the cross-vendor default; current trajectory points to deployment-grade stability in the second half of 2026, with widespread enterprise adoption following in 2027. Enterprises selecting agent platforms in 2026 should require A2A roadmap commitments from any vendor whose product will participate in cross-vendor agent workflows.","article_url":"https://agentmodeai.com/a2a-agent-to-agent-protocol/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-051","claim":"Enterprise AI governance organisational design resolves to three operating models in 2026: centralised (a single AI governance function owns policy, procurement, audit, and kill-criterion enforcement enterprise-wide), federated (each business unit owns its AI deployments with cross-unit coordination through a small central function), and hybrid (a central function owns regulatory and procurement; business units own deployment operations and ROI accountability). The dominant 2026 pattern in Fortune 500 enterprises is hybrid, because purely centralised models do not scale past 50-100 deployments and purely federated models cannot satisfy EU AI Act Article 9 risk-management documentation consistency. The right model for a given enterprise depends on three variables: deployment count, regulatory exposure, and the maturity of the existing risk-management organisation. The hybrid model is structurally superior to the alternatives once an enterprise crosses approximately 30 production deployments or operates in two or more EU AI Act high-risk Annex III categories.","article_url":"https://agentmodeai.com/centralized-vs-federated-ai-governance/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-07-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-052","claim":"Enterprise agentic AI vendor contracts in 2026 require eight specific exit-clause provisions that standard SaaS contract templates do not adequately cover: (1) full audit-log export with retention, (2) trained-state extraction or destruction guarantee, (3) prompt and configuration portability, (4) tool-and-MCP-connector reconfiguration support during transition, (5) named-individual handoff for in-flight deployments, (6) regulatory-evidence preservation through transition, (7) data-residency continuity, (8) liability-tail coverage for agent actions taken before the transition completes. An enterprise that signs an agentic AI contract without these eight provisions has effectively created a one-way procurement decision; the realistic cost of a forced transition without the provisions is materially higher than the contract value, which inverts the procurement leverage. The provisions add typically modest contract complexity but materially change the enterprise's negotiating posture and the vendor's incentive structure during the relationship.","article_url":"https://agentmodeai.com/ai-agent-contract-exit-clauses/","topic":"agent-procurement","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-07-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-053","claim":"HIPAA-compliant agentic AI deployment in U.S. healthcare in 2026 requires four conditions that materially constrain vendor selection and architectural design: (1) the vendor offers a BAA covering the specific agent workflow including any subprocessors and any tools the agent calls, (2) the agent's audit log structure satisfies HIPAA 164.312(b) audit controls AND the EU AI Act Article 12 14-field structure simultaneously, (3) PHI flows through agent tool calls are explicitly mapped and authorised under the HIPAA Privacy Rule's minimum necessary standard, (4) the agent's behavioural drift monitoring includes correctness against clinical-decision benchmarks, not just engagement or business-metric benchmarks. Anthropic's three-cloud BAA position (covering AWS, GCP, and Azure deployment surfaces) is structurally distinct in the 2026 vendor landscape and materially expands healthcare deployment options. The OCR's 340% spike in AI-related discrimination complaints (logged in 2025) makes audit-substrate readiness the highest-priority preparatory work for any healthcare AI deployment going into production in 2026.","article_url":"https://agentmodeai.com/hipaa-compliant-agentic-ai-healthcare/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-054","claim":"Public-sector agentic AI deployment in 2026 operates under five constraints that materially narrow the vendor and architectural options compared to private-sector deployment: (1) FedRAMP authorisation (Moderate or High depending on data sensitivity) is required for federal deployments and increasingly for state, (2) sovereign data residency requirements (data and model inference must remain within national or sub-national boundaries), (3) procurement transparency obligations (the deployment, the vendor, and the decision logic typically must be publicly disclosed), (4) explicit accountability under administrative law (decisions affecting individuals are subject to due-process and appeal frameworks that the agent must support), (5) FOIA-equivalent disclosure of audit logs to the public on request. Public-sector deployments cannot reasonably use peer-to-peer multi-agent patterns and cannot accept vendors without published government cloud SKUs; the realistic 2026 options are Microsoft Azure Government, AWS GovCloud-deployed Anthropic, Google Cloud Public Sector, and a small number of specialist government-AI vendors. The NYC MyCity case (claim AM-044) is the canonical 2026 public-sector failure illustrating what happens when the constraints are inadequately addressed.","article_url":"https://agentmodeai.com/public-sector-agentic-ai/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-07-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-055","claim":"Retail and logistics agentic AI deployments in 2026 cluster around five workflow patterns with substantially different governance properties: customer-service agents (the Klarna failure case applies directly, claim AM-044), inventory and demand-forecasting agents (operationally lower-risk but with material accuracy requirements), dynamic-pricing agents (carry antitrust exposure that is structurally distinct from other AI risks), supply-chain orchestration agents (multi-party data flows that complicate audit substrate ownership), and returns-and-fraud-detection agents (consumer-protection law exposure including disparate-impact claims). The dominant 2026 production pattern is augmentation rather than replacement of human operators; deployments framed as headcount-replacement have produced reversals at material rates (the Klarna pattern). Retailers and 3PLs (third-party logistics providers) operating across multiple jurisdictions face an additional layer of consumer-protection law fragmentation that the EU AI Act does not pre-empt and that materially affects the deployment scope.","article_url":"https://agentmodeai.com/retail-logistics-ai-agents/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-07-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-056","claim":"Enterprise AI agent ROI calculation in 2026 requires a structured eight-input model that captures the costs and benefits the standard SaaS-style ROI calculator misses: (1) per-session-hour or per-task model cost at the deployment's actual usage profile, (2) human-in-the-loop labour cost including approval-gate review time, (3) deployment-layer instrumentation cost (audit substrate, drift monitoring, MTTD detection), (4) regulatory compliance cost amortised across the deployment's revenue, (5) productivity uplift on existing human staff (the augmentation case), (6) avoided cost from reduced incident rate and reduced kill-criterion losses, (7) revenue impact net of service-quality regression risk, (8) the strategic-option value of the deployment's underlying capability. The calculation produces a 90-day ROI checkpoint figure, a 12-month payoff figure, and a kill-criterion threshold. The calculation also produces a sensitivity table showing which inputs drive the ROI most heavily; cost-side sensitivity is typically dominated by inputs 2 and 3, revenue-side by inputs 5 and 7. Most 2026 enterprise AI deployments evaluated against this model break even between months 9 and 18; deployments outside that range are either materially under-investing in instrumentation (faster apparent ROI) or are operating in unfavourable cost structures (longer payoff).","article_url":"https://agentmodeai.com/ai-agent-roi-calculator/","topic":"agent-procurement","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-07-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-057","claim":"The enterprise AI agent risk register for 2026 resolves to a 12-column template that captures every risk an enterprise must document under EU AI Act Article 9 and NIST AI RMF Manage function: risk ID, deployment ID, threat class (per OWASP Agentic AI Top 10), likelihood, impact, inherent risk score, control mapping (against the seven-control surface), residual risk score, named accountable individual, review cadence, status, last-reviewed date. The register is operated by the Head of AI Governance, reviewed monthly in the AI governance committee, and queryable in the under-4-business-hour Article 73 incident-response window. The 12-column template integrates the threat surface (OWASP Agentic AI Top 10, claim AM-043), the controls (seven-control surface, claim AM-043), the audit substrate (claim AM-046), and the kill-criterion enforcement (claim AM-047), into a single living artefact. An enterprise that operates the register seriously has substantially completed the Article 9 risk-management system documentation requirement; the register is the single artefact that resolves the cross-reference matrix between operational reality and regulatory framework.","article_url":"https://agentmodeai.com/ai-agent-risk-register-template/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-061","claim":"Production agentic-AI costs at scale routinely run multiples of POC projections, and a layered optimisation programme covering model tiering, vendor prompt caching, batch APIs, context-window discipline, and observability budgeting closes most of the gap.","article_url":"https://agentmodeai.com/the-2m-ai-bill-that-became-200k-the-enterprise-cost-optimization-playbook-for-production-ai-agents/","topic":"enterprise-ai-cost","pub_date":"2025-07-27","last_reviewed":"2026-04-28","next_review":"2026-06-27","verdict":"holding","verdict_history":[{"date":"2025-07-27","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-28","verdict":"partial","note":"Rewritten 27-28 Apr 2026 from 27 Jul 2025 WordPress-migrated original. Original used a fictional CTO scene (Marcus Chen, $4.2B logistics company, 9:47 AM Tuesday Seattle), fabricated case figures ($2.1M to $187K monthly, named-company before/after teardowns), fabricated expert quotes (Patricia Williams VP of Engineering at Walmart; David Park Principal at Goldman Sachs), and banned phrases (plot twist, the dirty secret, revolutionary, emoji subheads). Rewrite extracts the verifiable cost-driver categories with primary-source citations from Anthropic's published multi-agent token-ratio research, vendor prompt caching and batch-API pricing pages, McKinsey State of AI, Andreessen Horowitz on LLM inference economics, and Gartner's April 2026 I&O finding. Approved + published 28 Apr 2026."}],"primary_sources":[]},{"id":"AM-063","claim":"AI agents executing financial transactions need a four-control bundle (action-approval gates by blast radius, kill-switch protocols, decision-audit trails, per-action revocation); enterprises shipping agentic-AI without this bundle face CISO governance pressure they cannot satisfy under existing model-risk-management, FFIEC, and EU AI Act expectations.","article_url":"https://agentmodeai.com/your-ai-agents-just-approved-2-7m-in-vendor-payments-and-other-nightmares-keeping-cisos-awake/","topic":"agentic-ai-governance","pub_date":"2025-07-27","last_reviewed":"2026-04-28","next_review":"2026-06-27","verdict":"holding","verdict_history":[{"date":"2025-07-27","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-28","verdict":"partial","note":"Rewritten 27-28 Apr 2026 from 27 Jul 2025 WordPress-migrated original. Original used fictional Seattle CISO scene with fabricated $2.7M case, fabricated cohort scheduling, emoji subheads, and 'battle-tested' hype. Rewrite extracts the verifiable control-set framework with primary-source citations (NIST AI RMF, NIST AI 600-1 Generative AI Profile, FFIEC IT Examination Handbook, SR 11-7, OCC Bulletin 2011-12, ISACA AI Audit Toolkit, Cloud Security Alliance MAESTRO framework). Cross-links to the live AM-037 non-human-identity piece as the identity-layer companion. Approved + published 28 Apr 2026."}],"primary_sources":[]},{"id":"AM-100","claim":"AI-authored + human-signed publications produce more verifiable enterprise-AI commentary than human-only or anonymous-AI alternatives, when the AI authorship is paired with a public claim ledger and dated correction log.","article_url":"https://agentmodeai.com/ai-writes-about-ai-tracked-claims-case/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-07-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-101","claim":"Across the named analyst-publication comparable set (Stratechery, The Information, the Substack analyst stack, the Big-4 research blogs, Gartner, Forrester, IDC) as of late April 2026, none maintains a public claim ledger — a tracked register of every primary claim with scheduled reviews, dated verdicts, and a public correction log. The absence is structural, not accidental, and explains why none of the category produces the kind of audit-able commentary the Holding-up system makes possible.","article_url":"https://agentmodeai.com/why-this-publication-has-a-ledger/","topic":"agentic-ai-governance","pub_date":"2026-04-28","last_reviewed":"2026-04-28","next_review":"2026-07-27","verdict":"holding","verdict_history":[{"date":"2026-04-28","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-102","claim":"Among the comparable publications surveyed in AM-101 (Stratechery, The Information, the Substack analyst stack, the Big-4 research blogs, Gartner, Forrester, IDC) as of late April 2026, none uses the disclosed-AI-author + named-human-signatory + public-claim-ledger format. The combination is structurally rare and the rarity is what makes the format consequential, not the disclosed AI authorship alone.","article_url":"https://agentmodeai.com/the-ai-author-signature-decision/","topic":"agentic-ai-governance","pub_date":"2026-04-28","last_reviewed":"2026-04-28","next_review":"2026-07-27","verdict":"holding","verdict_history":[{"date":"2026-04-28","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-103","claim":"Across two of the three Q1 2026 ventures Peter built with Claude (agentmodeai, Rhino-basketball; DealVex pending git-versioning), rework rate measured as deletions / total git churn ranged from 8.1% to 13.5% over the 90-day window from 28 Jan to 28 Apr 2026. The data is meaningfully lower than typical solo-developer projects but substantially higher than the 'AI codes it correctly the first time' marketing narrative implies, supporting the thesis that AI-paired development requires explicit measurement, not assumed productivity.","article_url":"https://agentmodeai.com/learning-ai-by-doing-ai-the-data/","topic":"agentic-ai-governance","pub_date":"2026-04-28","last_reviewed":"2026-04-28","next_review":"2026-07-27","verdict":"holding","verdict_history":[{"date":"2026-04-28","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-104","claim":"Anthropic's withholding of Claude Mythos forces senior IT teams to advance their AI cyber-threat-model timeline by two to three years, and to rebuild three specific assumption sets — patch prioritization, third-party risk on AI infrastructure, and AI procurement diligence — inside Q2 2026.","article_url":"https://agentmodeai.com/claude-mythos-cio-risk-posture/","topic":"agentic-ai-governance","pub_date":"2026-04-27","last_reviewed":"2026-04-27","next_review":"2026-06-26","verdict":"holding","verdict_history":[{"date":"2026-04-27","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-105","claim":"Organizations that have not adopted an offensive-security operating mode (continuous attack-surface validation, AI-augmented internal vulnerability discovery, standing threat-hunting, deception, counter-AI controls) by Q4 2026 will show measurably wider mean-time-to-detect for AI-assisted attackers than peers that have, in industry-survey data published in late 2026 and early 2027.","article_url":"https://agentmodeai.com/offensive-security-cio-clockspeed/","topic":"agentic-ai-governance","pub_date":"2026-04-27","last_reviewed":"2026-04-27","next_review":"2026-07-26","verdict":"holding","verdict_history":[{"date":"2026-04-27","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-106","claim":"Loaded human FTE cost ($90K-$180K all-in for typical knowledge work) vs total agentic-AI operational cost (token plus orchestration plus integration plus observability plus human oversight) does not favour replacement at parity in 2026 for most roles; the math works for narrow, high-volume, low-judgment task categories and breaks down where regulatory accountability, customer trust, or judgment-under-ambiguity is load-bearing.","article_url":"https://agentmodeai.com/agentic-ai-vs-human-worker-cost-economics/","topic":"enterprise-ai-cost","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-28","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026. Initial verdict 'Partial' — spine is observable from current public deployment cost data and labour-displacement research, per-category quantitative bands tracked against next review cycle. REVIEW: Peter — please verify claim text + cited sources before removing rewriteInProgress flag."}],"primary_sources":[]},{"id":"AM-107","claim":"The 2026 insurance market does not yet offer agent-specific E&O policies in any mature form; existing cyber and tech-E&O policies were drafted against human-error and software-defect risk models that don't cleanly map to autonomous reasoning actors. Enterprises shipping agentic-AI face an underwriting gap: the cyber policy may not respond to a loss caused by an agent's reasoning step, and the professional-liability policy may exclude AI-generated outputs entirely. CIOs and CROs need to surface this gap with their broker before the loss event, not after.","article_url":"https://agentmodeai.com/agentic-ai-insurance-and-underwriting/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-28","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026 as a staged draft (rewriteInProgress: true). Status set to Partial because the underlying market is in a transitional phase and per-carrier wording specifics may shift inside the 60-day review window. REVIEW: Peter to verify (a) the Lloyd's Lab Cohort 12 dating and submissions detail, (b) the Munich Re aiSure agentic-deployment extension claim, (c) the NAIC Model Bulletin scope, (d) whether the AIG CyberEdge and Chubb Integrity+ AI endorsement language descriptions reflect the most recent product updates, and (e) the MGA list (Armilla, Vouch, Coalition, Relm) is currently in market with AI-liability paper before promoting from staged draft to published."}],"primary_sources":[]},{"id":"AM-108","claim":"Agentic-AI data-residency requirements are not cleanly inherited from existing GDPR cross-border transfer practice. Agent context windows, retrieval indexes, and reasoning traces all create new categories of personal-data processing that have to be located, documented, and (for high-risk Annex III deployments) data-resident inside the EEA before EU AI Act Article 16 enforcement opens on 2 August 2026. The deployment topology has to shift to single-region EEA-resident for high-risk systems; hub-and-spoke remains defensible for general-purpose deployments under documented GDPR Chapter V transfer mechanisms.","article_url":"https://agentmodeai.com/agentic-ai-data-residency-eu-ai-act/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-28","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026. Initial verdict 'Partial' — spine is anchored to the Act itself plus current vendor compliance pages, but the four-surface Article-mapping has not yet been tested against an enforced case (the August 2026 enforcement window opens inside the next review cycle). REVIEW: Peter — please verify claim text + Article references + vendor citations before removing rewriteInProgress flag."}],"primary_sources":[]},{"id":"AM-109","claim":"Enterprises focused on the headcount-reduction half of agentic-AI transformation are systematically under-budgeting the retraining cost for the residual workforce, and programmes that ship the cuts without simultaneously shipping the upskilling produce a 6-12 month productivity dip that erases the early ROI.","article_url":"https://agentmodeai.com/agentic-ai-retraining-gap-survivors/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-28","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026. Initial verdict 'Partial' — the productivity-dip duration is observable from current public workforce data but the 6-12 month band has not been tested against post-2026 enterprise case data yet. REVIEW: Peter — please verify claim text + cited sources before removing rewriteInProgress flag."}],"primary_sources":[]},{"id":"AM-110","claim":"Traditional SLAs (uptime, p95 latency, error rate) are structurally insufficient for autonomous agentic-AI; the four metrics that actually work are action-bounded availability, MTTD-for-Agents, output-distribution drift, and per-class action error budget, and vendors that cannot expose the telemetry these require are not yet production-ready against the 2026 enterprise procurement bar.","article_url":"https://agentmodeai.com/agentic-ai-sla-architecture/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-28","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026. Initial verdict 'Partial' — the four metrics are observable from current SRE/OTel practice but have not been tested as a procurement bar against 2026 vendor SLAs yet. REVIEW: Peter — please verify claim text + cited primary sources (especially the OpenTelemetry GenAI stable-promotion date and the Anthropic/MS Agent Framework reliability docs) before removing rewriteInProgress flag."}],"primary_sources":[]},{"id":"AM-111","claim":"The right enterprise playbook for an agent incident in 2026 has six steps that do not appear in any standard SRE handbook — action-class containment before root-cause analysis, reasoning-trace forensics, blast-radius reconstruction across downstream agents and systems, stakeholder notification with the specific failure mode named, regulatory exposure assessment for in-scope deployments, and selective re-enable with degraded-mode guardrails — and CIOs without this playbook will spend their first agent incident discovering it under crisis conditions.","article_url":"https://agentmodeai.com/agent-incident-response-playbook/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-28","verdict":"partial","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026. Initial verdict 'Partial' — six-step playbook is a synthesis from current SRE practice + AI-specific guidance and has not been tested against a major published agent-incident postmortem yet. REVIEW: Peter — please verify claim text + cited primary sources before removing rewriteInProgress flag."}],"primary_sources":[]},{"id":"AM-112","claim":"Healthcare agentic-AI sits across three regulatory regimes that do not compose cleanly — HIPAA on PHI handling and BAA topology, FDA software-as-medical-device guidance on clinical decision support and predetermined change control, and state medical/nursing board licensure rules placing the practitioner as the responsible party of record — and the five-control bundle of BAA-aware architecture, PCCP, clinical-judgement-of-record audit trail, on/off-switch with practitioner attribution, and breach-notification readiness is the minimum defensible architecture for any clinical agentic-AI deployment.","article_url":"https://agentmodeai.com/healthcare-agentic-ai-governance/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-28","verdict":"partial","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-113","claim":"Standard 2026 agentic-AI vendor MSAs contain six contract patterns that systematically transfer risk from vendor to enterprise customer in ways that do not appear in equivalent pre-AI enterprise software MSAs — model-version unilateral-change, training-data ambiguity on customer inputs, usage-cap auto-escalation, indemnification carve-outs for model output, data-residency commitments that don't bind sub-processors, and liability caps tied to fees-paid that don't scale with autonomous-action authority.","article_url":"https://agentmodeai.com/agentic-ai-vendor-contract-gotchas/","topic":"agent-procurement","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-28","verdict":"partial","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-114","claim":"Production agentic-AI in 2026 needs four observability layers — infrastructure, LLM-call, trace, and output — and most enterprise deployments instrument only the cheaper subset (Layers 1 and 2 plus partial Layer 3); the failure modes Layers 3 and 4 catch (multi-step reasoning failure and output-distribution drift) are the ones EU AI Act Article 9 and Article 17 evidence obligations from 2 Aug 2026 onward will require coverage of, and the four layers compose directly into the four AM-110 SLA metrics (action-bounded availability, MTTD-for-Agents, output-distribution drift, per-class action error budget).","article_url":"https://agentmodeai.com/agent-observability-stack-production/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-28","verdict":"partial","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026. Initial verdict 'Partial' — the four-layer model is observable from current 2026 tool categories and OpenTelemetry GenAI convergence, but the procurement-or-build cost bands are publication estimates and have not been tested across a representative sample of enterprise deployments. REVIEW: Peter — please verify (1) the OpenTelemetry GenAI stable-promotion date (13 Mar 2026) is consistent with what AM-110 cites; (2) the cost-band ranges in the §Share-thoughts template are defensible as our-estimate or need tightening; (3) Datadog AI Observability and New Relic AI Monitoring product names are current; (4) Arize Phoenix open-source/managed dual-form description is accurate; (5) the CNCF OpenTelemetry GenAI working-group framing matches the actual project structure before removing rewriteInProgress flag."}],"primary_sources":[]},{"id":"AM-115","claim":"Agent Mode AI publishes a public quarterly review of every claim it has made, with verdict before/after, named primary-source movement, and aggregate verdict-change rate across the corpus. The bulletin runs on a fixed quarterly cadence (end of Apr, Jul, Oct, Jan); the rhythm is the editorial discipline the niche has been missing.","article_url":"https://agentmodeai.com/q2-2026-claim-review-bulletin/","topic":null,"pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-07-30","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026 — the first Quarterly Claim Review Bulletin. The claim itself is recursive: it asserts that the bulletin will ship quarterly, and the next review (30 Jul 2026) tests whether the Q3 bulletin actually appeared. Status starts as 'up' because the claim is currently true (the Q2 bulletin shipped). The verdict at end of July 2026 will move to Holding, Partial (bulletin shipped but on a delayed cadence), or Not holding (no bulletin shipped). REVIEW: Peter — please verify claim text + cadence wording before removing rewriteInProgress flag."}],"primary_sources":[]},{"id":"AM-116","claim":"A class of derivative actions is forming in 2025-2026 around board failure to supervise AI deployments under the Caremark line, and D&O carriers are responding at renewal with explicit AI questionnaires and emerging exclusions, materially shifting director liability exposure that most boards have not yet read in their actual policy language.","article_url":"https://agentmodeai.com/directors-officers-insurance-ai-supervision-claim/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-30","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-117","claim":"AI Bill of Materials (AI-BOM) is moving from optional security artefact to enforceable procurement requirement in 2026, driven by EU AI Act Article 11 + Annex IV technical-documentation requirements (effective 2 August 2026) and the CycloneDX ML-BOM and SPDX 3.0 specifications. Enterprise SBOM programs need three specific extensions (generation path for AI components, AI-specific risk correlation feeds, procurement-side language for AI-BOM delivery).","article_url":"https://agentmodeai.com/ai-bill-of-materials-supply-chain-disclosure/","topic":"agent-procurement","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-30","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-118","claim":"As of April 2026 the largest sovereign-wealth and pension funds (NBIM, CalPERS, ABP, OTPP, USS) have published almost no formal AI position papers, despite trillion-dollar AI exposure across portfolios. The structural absence is the signal: AI is being rated by these investors but the rating criteria have not been formally codified, leaving public-company IR teams preparing engagement against expectations the investors have not yet written down.","article_url":"https://agentmodeai.com/pension-fund-sovereign-wealth-ai-policy-void/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-30","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-119","claim":"The 2026 cyber-insurance renewal tightening enterprises are experiencing is upstream-driven by reinsurance market repricing of catastrophic AI tail risk (Lloyd's of London, Munich Re, Swiss Re), not by primary-carrier loss data. The reinsurance signal travels via tighter treaty terms, AI-specific exclusions, and elevated retentions, with a 6-12 month lag to primary policies. Enterprise risk officers negotiating against the primary on AI terms have limited room because the carrier's own treaty caps what it can offer.","article_url":"https://agentmodeai.com/reinsurance-market-ai-tail-risk-pricing/","topic":"agent-procurement","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-30","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-120","claim":"AI agent deployments touching employee work in EU jurisdictions with co-determination law (Germany BetrVG §87, Netherlands WOR Art. 27, France CSE provisions) require works council consent before activation in 2026. Most US-headquartered AI vendors lack a customer-success workflow for this, producing a class of stalled rollouts that read as 'vendor delay' but are actually compliance gaps. Total EU-site timeline from selection to production is 6-9 months when handled well, 12-18 when consultation begins late.","article_url":"https://agentmodeai.com/works-council-ai-agent-deployment-eu/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-30","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-121","claim":"AI in IT operations in mid-2026 delivers measurable productivity gains (UK Government Digital Service trial: 26 minutes per user per day across 20,000 staff; BT pilot: 35% case-resolution-time reduction with named CIO on the record; ServiceNow's own help desk: 90% L1 deflection in vendor-internal optimal conditions) but the staff-reduction story is structurally smaller than vendor pitches suggest. Gartner finds only 11% of Fortune 500 companies have actually cut support headcount via AI; Forrester reports 55% of AI-attributed layoffs are regretted and roughly half are reversed; CRMArena-Pro shows multi-step agent reliability at ~35%. The cost saving lands first on the BPO/contractor line, second on contractor spend, and only slowly and controversially on direct headcount. Agentic L2/L3 remediation remains pilot-stage: per Gartner's October 2025 survey of 360 IT app leaders, only 15% are considering, piloting, or deploying fully autonomous agents, and Gartner predicts >40% of agentic AI projects will be cancelled by end-2027.","article_url":"https://agentmodeai.com/ai-it-operations-reality-check/","topic":"agentic-ai-implementation","pub_date":"2026-05-02","last_reviewed":"2026-05-02","next_review":"2026-07-03","verdict":"holding","verdict_history":[{"date":"2026-05-02","verdict":"holding","note":"Claim created at publish."},{"date":"2026-05-02","verdict":"partial","note":"Klarna walk-back primary-source upgrade — added Siemiatkowski verbatim quotes via Bloomberg-cited-by-Fortune (9 May 2025) and the Uber-style freelance hiring detail via Entrepreneur. Closes the highest-priority evidence gap from the source dossier."}],"primary_sources":[]},{"id":"AM-122","claim":"The four credible 2026 agent-evaluation platforms (DeepEval, Braintrust, LangSmith, Patronus AI) do not compete on capability rank; each fits a distinct deployment shape (engineering-led eval-as-code; SaaS-first eval-as-product; LangChain-stack-native bundled with observability; research-grade hallucination + simulation), and picking by capability matrix produces the wrong procurement outcome for most enterprises. The structurally load-bearing eval-vs-observability split (companion piece AM-123) compounds this: 'is the agent right' and 'what did the agent do' are different procurement decisions answered by different platforms.","article_url":"https://agentmodeai.com/agent-eval-frameworks-deepeval-braintrust-langsmith-patronus/","topic":"agent-procurement","pub_date":"2026-05-03","last_reviewed":"2026-05-03","next_review":"2026-07-02","verdict":"holding","verdict_history":[{"date":"2026-05-03","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-123","claim":"Evaluation answers 'is the agent right'; observability answers 'what did the agent do'. The four credible 2026 agent-observability platforms (Langfuse, Arize, Helicone, LangSmith) split cleanly on a single structural axis: open-source-first vs SaaS-first. Helicone has been in maintenance mode since 3 March 2026 (founders joined Mintlify) and should not be selected for greenfield 2026 deployments. Production deployments need both eval and observability; the procurement decisions are different and conflating them produces SLA architecture that fails its first incident.","article_url":"https://agentmodeai.com/agent-observability-langfuse-arize-helicone-langsmith/","topic":"agent-procurement","pub_date":"2026-05-03","last_reviewed":"2026-05-03","next_review":"2026-07-02","verdict":"holding","verdict_history":[{"date":"2026-05-03","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-124","claim":"Pharma and life sciences agentic AI in 2026 inherits five regulatory regimes simultaneously (21 CFR Part 11, GxP under GAMP 5 Second Edition, EMA Annex 11 in 2025-2026 revision, the EMA Reflection Paper on AI in the medicinal product lifecycle, and the EU AI Act). The audit substrate that satisfies any one regime does not by default satisfy the others. The 2026 procurement gap is treating the regimes as substitutable. Four conditions materially constrain compliant deployment (validated computerised system status under GAMP 5 plus CSA; 17-field audit trail covering Part 11 + Annex 11 + Article 12 simultaneously; ALCOA+ data integrity with contemporaneous, original, enduring records; EU AI Act high-risk-system registration with Article 11 technical file plus Article 16 post-market monitoring). Three vendor postures emerge in market (pre-validated Category 4 packaging; general-purpose platform plus customer-validated wrapper; open-source stack plus customer-engineered audit substrate).","article_url":"https://agentmodeai.com/pharma-life-sciences-agentic-ai-21-cfr-part-11/","topic":"agentic-ai-governance","pub_date":"2026-05-03","last_reviewed":"2026-05-03","next_review":"2026-08-01","verdict":"holding","verdict_history":[{"date":"2026-05-03","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-125","claim":"ITSM agent procurement in 2026 is not three independent vendors but two acquirer ecosystems plus one product line at the intersection: ServiceNow (acquirer; Now Assist native plus Moveworks acquired 15 Dec 2025 for $2.4B closed consideration vs the announced $2.85B) and Automation Anywhere (acquired Aisera Nov 2025). The procurement decision in 2026 is shaped less by the feature matrix than by the post-acquisition reality. Picking by feature matrix without mapping the acquirer's strategic interest produces the wrong answer. ServiceNow Now Assist is the bolt-on for organisations already on ServiceNow; Moveworks is the omnichannel layer (still standalone branding, ServiceNow-owned); Aisera is the auto-resolution play that competes on closure rate, now under Automation Anywhere's portfolio.","article_url":"https://agentmodeai.com/servicenow-now-assist-vs-moveworks-vs-aisera/","topic":null,"pub_date":"2026-05-03","last_reviewed":"2026-05-03","next_review":"2026-07-02","verdict":"holding","verdict_history":[{"date":"2026-05-03","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-126","claim":"The OWASP Agentic AI Top 10 names what to defend against; it does not say how to test that the defences work. The 2026 enterprise red-team for agentic systems is a distinct discipline from generalised pen-testing, with its own methodology (four disciplines: prompt injection, tool misuse, context-window attacks, multi-turn objective drift), tooling stack (PyRIT v0.13.0, Garak, custom harnesses, MITRE ATLAS for structured threat-modelling vocabulary), evidence model (six-section report including ATLAS technique mapping plus residual-risk plus EU AI Act Article 12 substrate alignment plus Article 16 post-market monitoring recommendations), and procurement decisions (in-house vs specialist-vendor vs hybrid). Most enterprises run the wrong test (generalised application pen-test) and pass it; the passing report is the procurement evidence that produces false confidence.","article_url":"https://agentmodeai.com/agent-red-teaming-owasp-companion/","topic":"agentic-ai-governance","pub_date":"2026-05-03","last_reviewed":"2026-05-03","next_review":"2026-07-02","verdict":"holding","verdict_history":[{"date":"2026-05-03","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-127","claim":"Of the eleven claims this publication has published against the 2 August 2026 EU AI Act enforcement deadline, the four operational-evidence claims (AM-108 data residency, AM-046 audit-evidence under four hours, AM-117 AI-BOM procurement, AM-120 works council workflow) carry materially higher risk of moving from Holding to Partial in Q3 2026 than the two governance-process claims (AM-047 Head of AI Governance role, AM-051 centralised-vs-federated). Materially higher risk is defined as: at least three of the four operational-evidence claims will be downgraded to Partial or Not holding by 1 October 2026, while at least one of the two governance-process claims will remain Holding.","article_url":"https://agentmodeai.com/90-days-eu-ai-act-enforcement-what-corpus-says/","topic":"agentic-ai-governance","pub_date":"2026-05-03","last_reviewed":"2026-05-03","next_review":"2026-07-01","verdict":"holding","verdict_history":[{"date":"2026-05-03","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-128","claim":"The MIT NANDA 'GenAI Divide' 95% pilot-failure statistic (August 2025) is widely cited in 2026 enterprise procurement decks as evidence that 95% of AI projects fail. The underlying methodology measures something narrower and more specific: 95% of 300 analysed AI projects delivered no measurable P&L impact, where 'no measurable impact' is largely a function of pilots not having documented pre-deployment baselines, not a function of pilots failing technically. The structurally interesting findings underneath the headline (build-vs-buy 67%-vs-22% spread, 40%-licensed / 90%-shadow-using gap, marketing-vs-back-end deployment misdirection, the static-error / learning-gap pattern) are more useful for procurement teams than the headline number, and they update against the Stanford 12/88 bimodal ROI distribution (claim AM-029) cleanly.","article_url":"https://agentmodeai.com/the-mit-genai-pilot-failure-claim/","topic":"enterprise-ai-cost","pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-07-03","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-129","claim":"No mid-market enterprise has produced a documented +240% ROI in 90 days from agentic AI under audited conditions. Read against McKinsey State of AI 2025 (n=1,993; 23% scaling, 17% EBIT-attribution at 12-month horizon), MIT NANDA GenAI Divide (95% of pilots produce no measurable P&L impact, 67% buy vs 22% build success spread), and Stanford Digital Economy Lab Enterprise AI Playbook (12/88 bimodal ROI distribution at 12-18 months), the realistic 90-day mid-market ROI band for the highest-discipline 12% cohort is 20-40% operator-time savings on bounded use cases plus a working pilot pattern that scales into 12-18-month measurable ROI — not the 240% ROI in 90 days the vendor pitch frames it as. The four-artefact 90-day deliverable (documented baseline, bounded production deployment, per-class action error budget, scaling-vs-stop decision) is what the 12% cohort actually produces.","article_url":"https://agentmodeai.com/achieve-240-roi-in-90-days-with-ai-agents-for-mid-market/","topic":"enterprise-ai-cost","pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-07-03","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-130","claim":"Agentic AI 2024-2025 produced four distinct classes of evidence the 2026 procurement reader should not collapse into a single 'AI is working' narrative: (1) vendor-published wins inside vendor-controlled environments (ServiceNow internal 90% L1 deflection, framed by Nenshad Bardoliwalla as upper bound conditioned on two decades of structured workflow data the customer does not have), (2) audited customer pilots with active human oversight (BT 35% case-resolution improvement with random checks per Hena Jalil; UK Government Digital Service 26 minutes/day saved across 20,000 staff in Q4 2024; HMRC 28,000-staff M365 Copilot rollout April 2026), (3) public walk-backs (Klarna May 2025 Bloomberg-reported reversal of the 700-agent claim while the original press release stayed live; GitHub Copilot April 2026 token-counting bug; Salesforce Agentforce IT 200-customer reality vs Marc Benioff's launch pitch), and (4) structural failure modes (CRMArena-Pro 35% multi-step agent reliability finding; Carnegie Mellon independent verification at 30-35%; EchoLeak CVE-2025-32711 cross-agent prompt-injection class). Each class produces a different procurement lesson; treating them as one narrative is the most common 2026 enterprise mistake.","article_url":"https://agentmodeai.com/the-agentic-ai-revolution-real-world-success-stories-and-strategic-insights-from-2024-2025/","topic":"agent-procurement","pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-07-03","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-131","claim":"The AI Training Lead role — the human who curates the agent's evaluation set, reviews sampled outputs against it, and partners with the ML engineer on retraining decisions — is now a budget-line for enterprise agentic AI deployments rather than a vendor-bundled professional-services function. Domain experts (five-plus years inside the workflow the agent is meant to assist) outperform pure-ML hires in the role because the work is judgement-heavy, not algorithm-heavy. CIOs that do not budget the role explicitly see deployments fail at the iteration boundary.","article_url":"https://agentmodeai.com/from-it-pro-to-ai-training-lead-the-180k-career-path-nobodys-talking-about/","topic":"agentic-ai-governance","pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-07-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-132","claim":"Enterprise agentic AI ROI in 2026 is bimodal across four independent datasets. Stanford Digital Economy Lab's 2026 Enterprise AI Playbook documents 12% of deployments clearing 300%+ ROI with 88% at or below break-even at 12-18 months. Gartner Q1 2026 Infrastructure & Operations Survey reports 28% of AI projects 'fully paying off'. McKinsey State of AI 2025 (n=1,993) reports 23% scaling with 17% EBIT-attribution at 12 months. MIT NANDA's GenAI Divide reports 95% of pilots produce no measurable P&L impact alongside the 67% buy vs roughly 22% build success spread. The 73%/27% slug rounds the four numbers; the bimodal shape is reproducible and the variable separating the two cohorts is operational discipline (instrumented under GAUGE: governance, audit substrate, use-case maturity, guardrails, evidence/baseline, exit posture), not model selection.","article_url":"https://agentmodeai.com/why-73-of-agentic-ai-projects-fail-and-how-the-27-generate-312-roi/","topic":"enterprise-ai-cost","pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-07-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-133","claim":"The Q3 2026 quarterly claim review covering 1 May 2026 through 30 July 2026 is the publication's first cross-cycle bulletin: most claims published in Q2 have now cleared their first scheduled review, the Resources register has opened with five RES-* tools running on the same cadence discipline as AM-* and OPS-*, and the deadline-anchored cluster of eleven claims tied to the 2 August 2026 EU AI Act deployer-obligations enforcement window is in pre-enforcement state. The Q3 bulletin reports the verdict shifts that the Q2 bulletin could not yet measure; the Q4 bulletin in late October 2026 will report the post-enforcement readout.","article_url":"https://agentmodeai.com/2026-q3/","topic":null,"pub_date":"2026-07-30","last_reviewed":"2026-07-30","next_review":"2026-10-30","verdict":"holding","verdict_history":[{"date":"2026-07-30","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-134","claim":"The 2026 implementation cut on non-human identity for AI agents resolves on three factors (existing IAM relationship, deployment topology, cross-platform integration burden) across six credible control planes: Okta NHI, Microsoft Entra ID Workload Identities, Auth0, Keycloak, SPIFFE/SPIRE for Kubernetes-native deployments, and AWS IAM Roles Anywhere for hybrid AWS-anchored deployments. The procurement-defensible audit substrate captures three event classes regardless of vendor: identity issuance, authentication, and authorisation.","article_url":"https://agentmodeai.com/agent-identity-iam-architecture-nhi/","topic":"agentic-ai-governance","pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-07-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-135","claim":"EU AI Act Article 50 takes effect 2 August 2026 and creates four distinct transparency obligations requiring different UX implementations: Article 50(1) chatbot interaction disclosure on providers, Article 50(2) machine-readable marking on generative AI output, Article 50(3) biometric categorisation and emotion recognition disclosure on deployers, and Article 50(4) deepfake disclosure on deployers (with the artistic-or-creative-work exception). The procurement-defensible disclosure UX has six properties (visible at the right moment, plain language, persistent or recurrent, linked to a substantive disclosure surface, auditable, updateable). Most enterprises have absorbed the legal text without designing the UX it requires.","article_url":"https://agentmodeai.com/eu-ai-act-article-50-transparency-disclosure/","topic":"agentic-ai-governance","pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-08-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-136","claim":"Across the 24-month window May 2024 to April 2026, every major foundation-model provider (Anthropic, OpenAI, Google, AWS Bedrock, Azure OpenAI) experienced at least one multi-hour outage that exceeded the SLA-credit threshold defined in their published terms. The procurement-defensible posture is multi-provider routing with documented failover and hard-dollar incident liability above the standard SLA-credit cap. Three architectural patterns dominate 2026 production deployments: gateway abstraction (LiteLLM, OpenRouter, Portkey), provider-side regional failover (partial mitigation), and explicit multi-provider provisioning at the application layer.","article_url":"https://agentmodeai.com/foundation-model-uptime-sla-track-record/","topic":"agent-procurement","pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-06-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-137","claim":"Agent evaluation in production resolves on three operational components that determine whether the chosen evaluation platform produces useful signal: eval-set design across three layers (50-200 calibration prompts, 30-100 edge-case prompts, 10-50 production-sampled prompts per week), drift detection across three signal classes (output-distribution, score-distribution, tool-use distribution), and a regression-budget framework that forces binary ship/hold decisions (defensible default 5% absolute decline on calibration set, 10% on edge-case set, per release window). The procurement decision (which platform to buy, covered at AM-122) is the easier half; the operational discipline is what most enterprises under-invest in even after buying a platform.","article_url":"https://agentmodeai.com/agent-evaluation-in-production/","topic":"agent-procurement","pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-07-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-138","claim":"The 2 August 2026 EU AI Act deployer-obligations enforcement window adds three new clause families to the AI MSA red-team checklist that were optional or absent in pre-enforcement contracts: Article 11 technical-file pass-through, Article 16 post-market-monitoring support, and Article 26 deployer-documentation supply. The post-enforcement checklist grows from the 38-item RES-005 v1.0 baseline to roughly 54 items across 11 clause families, with Article 50 transparency UX (covered at AM-135) and foundation-model uptime hard-dollar liability (covered at AM-136) as additional 2026 additions. The asymmetric-instrument observation — that enterprise and operator AI procurement face the same vendor-citation-chain manipulation pattern with different audit instruments — is embedded as a 600-word insert in this piece.","article_url":"https://agentmodeai.com/vendor-msa-renewal-post-eu-ai-act-enforcement/","topic":"agent-procurement","pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-08-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-139","claim":"Enterprise AI buyers and operator AI buyers face the same vendor-citation-chain manipulation pattern with asymmetric audit instruments, and consume vendor case studies aimed at the other cohort with mirror-image misreads. The enterprise reads the IndieHacker timeline as procurement-cycle benchmark and removes controls under timeline pressure; the operator reads the Fortune-500 efficiency gain as result-attribution and inherits expectation without the operational substrate. The cross-borrow that is procurement-defensible at both scales: enterprises borrow the operator's cancellation-trigger discipline (OPS-051) and the cohort-fit filter (OPS-011); operators borrow the enterprise's MSA red-team scoped down (RES-005), evaluation discipline scaled to weekly (AM-137), and audit substrate at lightweight scale (AM-046). The verification gap is the same gap; the instruments are different; the publication's two-register architecture is the editorial response.","article_url":"https://agentmodeai.com/vendor-case-study-misreads-across-buyers/","topic":"agent-procurement","pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-08-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-140","claim":"Vendor 'successful pilot' references presented at procurement-committee evaluation transfer to scaled production at the procuring enterprise's measurement and governance regime at roughly the McKinsey 23% rate (n=1,491, Nov 2025); the gap is operational rather than capability-driven and is tractable with six pre-pilot questions a procurement committee can require answered in writing before the contract closes, not after.","article_url":"https://agentmodeai.com/agentic-ai-pilot-to-production-gap/","topic":"agent-procurement","pub_date":"2026-05-06","last_reviewed":"2026-05-06","next_review":"2026-07-05","verdict":"holding","verdict_history":[{"date":"2026-05-06","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-001","claim":"For a 4–10 person ops team running ~50 automations including five agentic steps in 2026, the platform choice is binary between n8n self-hosted and Make.com Pro, decided by whose time pays for the platform; Zapier earns its cost only when a critical integration is vendor-locked.","article_url":"https://agentmodeai.com/operators/n8n-vs-make-com-vs-zapier/","topic":null,"pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-05-26","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-002","claim":"For a 5-person consultancy already on either Notion or ClickUp in 2026, the AI features alone do not justify a workspace switch; the bundling difference (Notion bundles AI into Business at $19.50/seat, ClickUp Brain is a separate $9/seat add-on) makes the platform-shape choice (doc-centric vs project-centric) the actual decision.","article_url":"https://agentmodeai.com/operators/notion-ai-vs-clickup-ai-consultancy/","topic":null,"pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-05-26","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-003","claim":"For a solo founder choosing exactly one consumer AI subscription at around $20/month in 2026, the choice between Claude Pro and ChatGPT Plus is workflow-shape (long-document review and code favour Claude Pro; voice mode, image generation, and integration breadth favour ChatGPT Plus) — not capability-rank, which both vendors trade leadership on monthly.","article_url":"https://agentmodeai.com/operators/claude-pro-vs-chatgpt-plus-solo-founder/","topic":null,"pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-05-26","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-005","claim":"At sub-1M tokens per month (typical SMB agent volume) in 2026, the absolute dollar gap between Claude Haiku 4.5, GPT-4o-mini, and Gemini 2.5 Flash is small enough (≤$3/month) that price is the wrong tiebreaker; tool-use reliability, instruction-following on long context, and ecosystem fit determine the right cheap-tier model per workload shape.","article_url":"https://agentmodeai.com/operators/anthropic-vs-openai-vs-gemini-api-smb/","topic":null,"pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-05-26","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-011","claim":"If a candidate first-AI-agent use case at an SMB cannot answer all four of (a) what does success look like in numbers, (b) who owns it on Monday, (c) what breaks if it fails silently, (d) what is the rollback — the use case is not ready to deploy, regardless of vendor demo quality or model capability.","article_url":"https://agentmodeai.com/operators/picking-first-ai-agent-small-business/","topic":null,"pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-26","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-014","claim":"An SMB AI vendor evaluation defensible to the typical cyber-insurance reasonable-care expectation can be completed in 90 minutes by walking through five questions in order — model provenance, data residency, sub-processor list, breach history, termination clause — each answered from the vendor's public site or the contract about to be signed.","article_url":"https://agentmodeai.com/operators/ai-vendor-due-diligence-small-business/","topic":null,"pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-26","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-021","claim":"Across the published 2026 small-bookkeeping AI corpus (Xero OS, Intuit Assist, Canopy AI Notetaker, Digits MCP Server, with CPA Practice Advisor as the trade-press source), AI now reliably handles five recurring grind workflows at 1-to-5-person firm scale (bank-feed categorisation, receipt OCR, recurring journal posting, sales-tax reconciliation, AR ageing emails), but the judgement-call workflows (period close, advisory conversations, audit defence) remain human-led.","article_url":"https://agentmodeai.com/operators/ai-bookkeeping-small-firm-case-study/","topic":null,"pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-26","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-022","claim":"Across the published 2026 small-law-firm AI corpus (Spellbook with named small-firm customers Westaway, KMSC Law, Polley Faith; Harvey AI with mid-size roster Thompson Hine through Lowenstein Sandler; GC AI as named Anthropic enterprise customer claiming 1,500 companies and 14 hours/week saved), AI now ships at 1-to-20 lawyer-firm scale for contract drafting, document review at scale, and legal research with citation, but privileged-content workflows still require Enterprise-tier model access with zero-data-retention contractual posture per ABA Formal Opinion 512.","article_url":"https://agentmodeai.com/operators/ai-small-law-firm-case-study/","topic":null,"pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-26","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-026","claim":"The published 2026 construction-AI case corpus is overwhelmingly vendor-led (Procore, Autodesk Construction Cloud, OpenSpace, Buildots, Doxel) with thin named small-contractor self-published cases. Reading the vendor corpus honestly, three workflows now show consistent under-100-employee contractor AI deployment (estimating speed via takeoff acceleration, schedule risk surfacing, as-built reality capture); a fourth (AI safety detection) remains structurally biased toward larger sites with the camera coverage and safety officer to act on alerts.","article_url":"https://agentmodeai.com/operators/ai-small-construction-firm-case-study/","topic":null,"pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-26","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-027","claim":"Across the published 2026 dental-AI case corpus (Pearl with FDA-cleared 2D and 3D radiography AI plus 23,000 published practices; Overjet with 21+ named small-and-family-practice customers including Promenade Center, Quest Dental, Midtown Dental Studio), AI now ships at 1-to-3-dentist practice scale for FDA-cleared radiography assist, insurance verification automation, and patient-education visualisation; ambient voice AI for clinical notes is the next surface to ship widely.","article_url":"https://agentmodeai.com/operators/ai-small-dental-practice-case-study/","topic":null,"pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-26","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-028","claim":"The published 2026 small-beauty-salon AI case-study corpus is materially thinner than dental, legal, or bookkeeping (booking platforms publish customer counts but rarely individual-salon AI-attributable outcomes; solo stylists who use AI share informally on Instagram and TikTok rather than in case-study form). Reading the platform corpus honestly, the 2026 working pattern at 1-to-5 chair scale concentrates on no-show reduction via deposits, marketing copy via consumer-tier AI assistants, and portfolio/look generation via Canva and similar tools. AI-driven hairstyling recommendation, voice-AI booking, and dynamic pricing are not yet at the published-case-density that supports a small-salon recommendation.","article_url":"https://agentmodeai.com/operators/ai-small-beauty-salon-case-study/","topic":null,"pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-26","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-029","claim":"For solo founders and small teams (under ~50 people) building with AI in 2026, the build-vs-buy decision tree has inverted: specification, not engineering capacity, is now the bottleneck. The teams that can describe their workflow in operational detail can ship things they could not previously afford to build; the teams that cannot still cannot ship, regardless of how good the AI tooling is.","article_url":"https://agentmodeai.com/operators/three-launches-with-ai-the-lessons/","topic":"agentic-ai-governance","pub_date":"2026-04-28","last_reviewed":"2026-04-28","next_review":"2026-06-27","verdict":"holding","verdict_history":[{"date":"2026-04-28","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-030","claim":"The fastest path for an owner-operator to build practical agentic-AI competence in 2026 is the three-week build-by-shipping protocol — specification + scaffolding + ship + connect + deploy + iterate, against a real workflow, with one external user — not formal study or consulting engagement. The protocol produces more transferable competence than published comparable courses on three measurable outcomes: operational decisions the operator can make after, debugging capability without external help, and calibration on when to build versus buy.","article_url":"https://agentmodeai.com/operators/using-ai-to-learn-ai-operator-playbook/","topic":"agentic-ai-governance","pub_date":"2026-04-28","last_reviewed":"2026-04-28","next_review":"2026-06-27","verdict":"holding","verdict_history":[{"date":"2026-04-28","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-031","claim":"Solo founders evaluating AI bookkeeping in 2026 face three realistic options: a fully-managed AI-augmented service (Bench, Pilot), a software-led tool that does AI categorisation inside an existing accounting product (QuickBooks Live, Xero with Hubdoc), or a DIY stack (Claude/ChatGPT + a spreadsheet template). The fully-managed option scales when revenue passes ~$30K MRR; below that, the DIY stack with a 30-min monthly review beats both software-led and managed. The failure mode is paying for managed-service automation while still doing 80% of the categorisation yourself because the AI hasn't seen enough of your transaction patterns yet.","article_url":"https://agentmodeai.com/operators/ai-bookkeeping-for-solo-founders/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-13","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026 covering the three-option market split (fully-managed / software-led / DIY) and the ~$30K MRR threshold for fully-managed to become net-positive. REVIEW: Peter to verify current Bench and Pilot entry-tier pricing on or before 13 Jun 2026; if either has launched a sub-$100/month tier the threshold call shifts."}],"primary_sources":[]},{"id":"OPS-032","claim":"For SMB content workflows in 2026 (blog drafts, weekly newsletter, social copy, email sequences) at a 1-to-10 person business shipping two-to-four pieces per week, the practitioner read is workflow-shape not capability-rank: Claude wins on long-form editorial voice and structured drafting; ChatGPT wins on speed-and-iteration plus image generation in the same conversation; Gemini wins on Google-stack integrations. Paying for all three Plus tiers (around $60/month) without a deliberate task split is the expensive failure mode.","article_url":"https://agentmodeai.com/operators/chatgpt-vs-claude-vs-gemini-smb-content/","topic":"agent-procurement","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-13","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026 with status=partial. Recommendation derived from vendor pricing pages 29 Apr 2026 + public eval leaderboards + practitioner write-ups, not from a tracked SMB-cohort replication. Promotes to Holding once two consecutive 45-day reviews replicate the workflow-shape split on a real operator sample. REVIEW: Peter."}],"primary_sources":[]},{"id":"OPS-033","claim":"AI customer-service automation at 1-10 employee scale clears net-positive only when 70% or more of weekly inquiries are repetitive, low-stakes, and factually resolvable (hours, pricing, simple status). Below 50% the trust-erosion and remediation cost exceeds the headcount saving; between 50% and 70%, the answer turns on whether responsiveness is the brand differentiator.","article_url":"https://agentmodeai.com/operators/ai-customer-service-small-business/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-13","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026. Break-even thresholds (70/50) and never-deflect list are editorial synthesis from cited platform docs and CS-automation research, not a primary-data study. REVIEW: Peter to validate against any first-party SMB deployment data he has access to before status promotion to Holding."}],"primary_sources":[]},{"id":"OPS-034","claim":"For a solo founder processing 100-300 emails a day in 2026, the cheap-stack option (Gmail labels + Claude Pro at $20/month + a 5-line prompt template) recovers roughly 90% of the value of an $83/month premium stack (Superhuman AI + Shortwave Pro + Reclaim.ai Pro) at about 24% of the cost. The premium stack is worth its price under three conditions only — 2+ hours/day in email, keyboard-shortcut speed gain that pays back at the founder's hourly rate, and a documented bottleneck the cheap stack failed to solve after a two-week trial. Without all three, the founder is paying for an aesthetic, not measurable productivity.","article_url":"https://agentmodeai.com/operators/solo-founder-email-triage-ai-stack/","topic":"agent-procurement","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-13","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026 with status=partial. Cost-side claims (vendor pricing) verifiable against the four cited pricing pages on the publication date. Time-recovery claim (90+ min compressed to ~20 min) drawn from published productivity-blogger benchmarks rather than Peter-run measurement; first-cohort replication on the publication's tracked operator cohort due by 13 Jun 2026. REVIEW: Peter."}],"primary_sources":[]},{"id":"OPS-035","claim":"There are five categories of small-business work where AI substitution in 2026 costs more in trust and liability exposure than it saves in productivity: (1) signed legal documents and tax-return positions, (2) trust-laden customer touchpoints (cancellations, refunds, conflict de-escalation), (3) regulatory submissions where the human signature is the audit trail, (4) anything requiring genuine domain credentialing (medical advice, licensed financial advice, signed engineering work), and (5) the first six conversations with a new high-value client.","article_url":"https://agentmodeai.com/operators/when-not-to-use-ai-for-small-business/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-13","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026. Status set to Partial at publication because category 5 lacks the same regulatory/cited-consequence anchor as categories 1-4. REVIEW: Peter to confirm category 5 evidence base and either upgrade to Holding (with strengthened citation) or amend the claim to four categories."}],"primary_sources":[]},{"id":"OPS-036","claim":"An SMB AI policy that actually changes day-to-day behaviour fits on one page and contains exactly eight clauses — sanctioned tools, prohibited data, human-review gate, client disclosure rule, prohibited uses, incident-report path, review cadence, and signature line — each closing a failure mode currently surfacing in regulatory guidance, court records, and breach disclosures through 2025-2026.","article_url":"https://agentmodeai.com/operators/1-page-ai-policy-for-small-business/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-13","verdict":"partial","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026. Status set to Partial at publication because clause 6 commentary references an order-of-magnitude remediation-cost gap derived from the IAPP 2024 AI Governance Profession Report; the report characterises the gap as material but does not publish a precise multiple, so the wording is annotated source: our-estimate. REVIEW: Peter to source a precise figure or amend the commentary."}],"primary_sources":[]},{"id":"OPS-037","claim":"AI-drafted invoices for EU SMB operators in 2026 fail VAT audit at higher rates than human-drafted invoices specifically on cross-border treatment (OSS scheme wording, reverse-charge language, customer VAT-status verification), because LLM training data underweights post-2021 e-commerce VAT rules. The fix is a 4-line VAT-compliance prompt prefix that names the operator's VAT registration, the customer's VAT status, and the applicable scheme; most SMB invoicing tooling does not ship this by default.","article_url":"https://agentmodeai.com/operators/ai-invoicing-vat-compliance-small-business/","topic":null,"pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-30","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-038","claim":"SMB AI-VA deployments displacing admin work in collective-agreement-covered sectors (Dutch CAO, German Tarifvertrag, French Convention Collective) trigger collective-agreement provisions even at sub-10-employee scale in 2026, via job-classification-displacement and technology-introduction-consultation channels. Most SMB owners are unaware until the first union audit; FNV / DGB / IG Metall / CFDT activity in this area has shifted from theoretical to operational since 2024.","article_url":"https://agentmodeai.com/operators/ai-va-small-business-collective-agreement/","topic":null,"pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-30","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-039","claim":"AI-drafted contracts in EU notary-required jurisdictions (NL, DE, AT, BE, CH) are producing a class of legal-malpractice incidents in 2026 where the SMB owner treats an AI draft as final binding document, missing the notarisation requirement for real-estate transfers, GmbH/BV share transfers, and certain marriage/inheritance instruments. The fix is a 30-second pre-signing check on transaction-type and jurisdictional notarial-form requirement; AI tooling does not flag this by default.","article_url":"https://agentmodeai.com/operators/ai-drafted-contracts-notary-requirement-eu/","topic":null,"pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-30","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-040","claim":"Dutch ZZP'ers losing recurring client work to AI replacement in 2026 sit outside the WW (Werkloosheidswet) safety net entirely and find that available AOV (arbeidsongeschiktheidsverzekering) products mostly exclude demand-side income loss; the structural gap is pushing affected ZZP'ers into bijstand at faster rates than the 2024 baseline. The realistic options are operational (client-base diversification, offer restructuring, larger liquid buffer), not insurance-based.","article_url":"https://agentmodeai.com/operators/zzp-ai-displacement-unemployment-gap-nl/","topic":null,"pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-30","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-041","claim":"SMB owners using AI to produce marketing content are hitting platform algorithmic penalties at increasing rates in 2026, with platform-specific enforcement: Google Helpful Content system + March 2024 spam policy update target scaled-content-without-E-E-A-T; LinkedIn feed-distribution deprioritises fully-AI-generated content while tolerating AI-assist; Etsy listing-policy enforcement is heavier than either, with category-specific AI prohibitions. The defensible cross-platform posture is AI drafts + human edits + human signature with sustainable cadence.","article_url":"https://agentmodeai.com/operators/platform-algorithm-ai-content-penalties/","topic":null,"pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-30","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-042","claim":"For under-100-employee construction firms in 2026, the AI procurement order is estimating + bidding tools first (Togal.AI for general takeoff; Procore Copilot if already on Procore), with visual-progress capture (Buildots, OpenSpace) deferred until project portfolio exceeds 8 simultaneous projects per project manager. The vendor pitch oversells visual capture and undersells the takeoff workflow where the actual hours go (35-45% of estimator/PM time on bidding work, 5-10% on jobsite walkthroughs).","article_url":"https://agentmodeai.com/operators/ai-construction-estimating-bidding-tools/","topic":null,"pub_date":"2026-05-03","last_reviewed":"2026-05-03","next_review":"2026-07-02","verdict":"holding","verdict_history":[{"date":"2026-05-03","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-043","claim":"For solo founders under €5K MRR running 20-80 customer-service tickets per week in 2026, the cheap stack (shared inbox host + Claude Pro at €20/month + a copy-paste prompt-pack, total under €40/month) is structurally cheaper than the dedicated AI helpdesks (Intercom Fin, Crisp AI, Tidio Lyro) until ticket volume sustains above ~200/week. Above that threshold, the per-resolution and per-conversation pricing on the dedicated platforms starts to compete; below it, the cheap stack wins on cost AND on operator experience. The volume threshold is the procurement signal, not the vendor pitch.","article_url":"https://agentmodeai.com/operators/solo-founder-customer-service-ai-stack/","topic":null,"pub_date":"2026-05-03","last_reviewed":"2026-05-03","next_review":"2026-07-02","verdict":"holding","verdict_history":[{"date":"2026-05-03","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-044","claim":"For appointment-driven local-service businesses in 2026 (hairdresser, plumber, garage, cleaner, beautician), the AI value concentrates in two workflows neither booking-platform AI feature serves well: no-show reduction via personalised SMS sequences (3rd-party SMS API on top of the booking platform's webhook, typical 30-50% no-show reduction in published case studies) and review generation (post-appointment SMS or WhatsApp, typical 3-5x review-completion lift). The booking-platform decision (Booksy, Square Appointments, Treatwell, Vagaro) is shaped by customer-discovery model and existing payment infrastructure; the AI decision is shaped by whichever third-party SMS-and-review-automation layer bolts on top. Operators picking the booking platform on its bundled AI features pay for AI that does not move the numbers.","article_url":"https://agentmodeai.com/operators/ai-local-service-business-appointment-driven/","topic":null,"pub_date":"2026-05-03","last_reviewed":"2026-05-03","next_review":"2026-07-02","verdict":"holding","verdict_history":[{"date":"2026-05-03","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-045","claim":"OPS-031's jurisdiction-neutral DIY AI bookkeeping case for solo founders under €30K MRR breaks at the NL-specific Belastingdienst audit-trail boundary. The procurement decision per omzetband: Moneybird (€15-€39/month) under €100K omzet with API-driven AI flow via Make.com or n8n; e-Boekhouden as the goedkope fallback with bundled Scan & Herken OCR; Exact Online above €500K omzet or at BV-overgang where Exact's interne AI replaces the external prompt-pack workflow. NL-specifieke prompt-prefix (klant locatie, dienst type, reverse-charge applicability, OSS-scheme applicability, BTW-rubriek per Belastingdienst-aangifte 2026) is the operationally load-bearing addition that makes AI-getekende journaalposten direct invoerbaar in the chosen tool's BTW-aangifte.","article_url":"https://agentmodeai.com/operators/ai-bookkeeping-nl-moneybird-eboekhouden-exact/","topic":null,"pub_date":"2026-05-03","last_reviewed":"2026-05-03","next_review":"2026-07-02","verdict":"holding","verdict_history":[{"date":"2026-05-03","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-046","claim":"Marketplace-reseller AI in 2026 fails differently per platform and the cross-platform mitigation pattern is to separate AI-on-listing-copy (broadly safe across Etsy, Marktplaats, Vinted) from AI-on-listing-images (increasingly penalised on all three platforms via different mechanisms: Etsy's Creativity Standards and AI-disclosure requirement; Marktplaats's photo-fingerprint deduplication; Vinted's image-similarity penalty for resale-of-resold). The 'AI does the entire listing' workflow is the procurement pattern that produces the account-suspension report 6-12 months later. The defensible reseller workflow uses real photos, AI-assisted copy with platform-required disclosure, and per-platform performance tracking on impressions and sales.","article_url":"https://agentmodeai.com/operators/ai-marketplace-resellers-etsy-marktplaats-vinted/","topic":null,"pub_date":"2026-05-03","last_reviewed":"2026-05-03","next_review":"2026-07-02","verdict":"holding","verdict_history":[{"date":"2026-05-03","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-047","claim":"EU AI Act Annex III point 4 (employment, workers management, recruitment) applies to SMB AI hiring use even at four-employee scale; the threshold does not scale with company size, and the 2 August 2026 enforcement window covers AI-screened CVs in ChatGPT/Claude/Gemini the same way it covers dedicated platforms (Workable, Greenhouse, Lever, BrightHire). The defensible posture is AI-assisted decisions with a documented human decision-maker plus retained AI-output records — not AI-decided hiring. Solely-automated candidate scoring also conflicts with GDPR Article 22; ICO, AP, and Garante guidance from 2024-2025 is consistent on the human-in-the-loop requirement.","article_url":"https://agentmodeai.com/operators/ai-hiring-smb-eu-ai-act-annex-iii/","topic":"agentic-ai-governance","pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-07-05","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-048","claim":"Solo founders adding AI to cold outbound see a deliverability collapse around day 60-90 because AI lifts personalisation breadth at the same volume rather than personalisation depth at lower volume. The collapse is mechanical: AI-templated personalisation degrades recipient engagement, engagement decay triggers spam-classifier de-prioritisation, lower inbox rate produces more complaints, complaints trigger soft blocks. The defensible 2026 posture: 30-40 sends per inbox per day, named-specific first-paragraph personalisation, reply-rate KPI not open-rate, plus a documented EU GDPR Article 6(1)(f) Legitimate Interest Assessment for B2B founders in scope of e-Privacy Directive.","article_url":"https://agentmodeai.com/operators/ai-cold-sales-solo-founder-deliverability/","topic":"agentic-ai-implementation","pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-06-18","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-049","claim":"German Mittelstand AI deployment in 2026 hits two compliance surfaces most US-headquartered AI vendors do not handle out of the box: BetrVG §87(1) point 6 co-determination triggers at the first AI assistant or agent that touches employee work activity (Bundesarbeitsgericht broad interpretation covers any system that captures, processes, or analyses employee work activity, primary purpose immaterial); DSGVO Article 35 + Datenschutzkonferenz Muss-Liste require pre-deployment DPIA for most AI-employee-data deployments. The early-engagement workflow (works council notified at vendor selection, DPIA in parallel with vendor evaluation, joint Betriebsvereinbarung drafting, documented pilot at one team for 60-90 days, broader rollout after pilot review) compresses Mittelstand AI timeline from 12-18 months (late engagement) to 6-9 months.","article_url":"https://agentmodeai.com/operators/ai-mittelstand-betrvg-dsgvo-deployment/","topic":"agentic-ai-governance","pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-07-05","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-050","claim":"Local SMB AI use on Google Business Profile and local-SEO content splits into two cohorts in 2026: AI-as-research-and-assembly (keyword research, citation audit, performance analysis via Surfer/Frase/Ahrefs/BrightLocal/Whitespark) compounds visibility safely; AI-as-generation (auto-published reviews, auto-published review responses, bulk service-area pages, high-cadence GBP posts) triggers Google's Helpful Content classifier and the March 2024 spam policy update enforcement, with documented suspensions and ranking collapse on a 30-90 day cycle. The defensible posture is AI for the work that scales poorly (research, cross-reference) and human for any content that reaches the public surface.","article_url":"https://agentmodeai.com/operators/ai-local-seo-google-business-profile-smb/","topic":"shadow-ai-discovery","pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-07-05","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-051","claim":"AI proposal tools in 2026 split into two clusters by what they let the operator publish: tools that AI-assist proposal assembly (PandaDoc, Better Proposals, Proposify, Bonsai) compound; tools that AI-generate proposal narrative (Pitch, Gamma, Tome AI generation features) read as AI-generated to most buyers within thirty seconds and close at materially lower rates. Three structural patterns trigger the buyer-side AI-generated detection: the three-phase project structure regardless of actual scope, the credentials paragraph that lists capability without naming clients, the pricing section that over-explains itself. The defensible posture is AI for assembly (pricing tables, scope-of-work blocks, clause libraries from CRM) and human for voice (cover letter, executive summary, project-fit paragraph, next-step CTA).","article_url":"https://agentmodeai.com/operators/ai-client-proposals-tools-solo-founder/","topic":"agentic-ai-implementation","pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-06-18","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-052","claim":"Voor de Nederlandse zelfstandige advocaat (eenmanspraktijk, klein kantoor onder 5 partners) is AI in 2026 toegestaan voor drie hoofdcategorieën onder de NOvA-gedragsregels: juridisch onderzoek met advocaat-verificatie van elke citatie, document-drafting waar de advocaat reviewt en signeert, en cliëntcommunicatie-ondersteuning waar de advocaat elke uitgaande communicatie reviewt voor verzending. AI is niet toegestaan zonder advocaat-review voor: advies-generatie aan cliënten, procesvertegenwoordiging, cliëntgegevens-verwerking via niet-EU-LLM zonder Verwerkersovereenkomst, en het ondertekenen van documenten met AI-gegenereerde citaten zonder primaire-bron-verificatie. EU AI Act Artikel 50 disclosure is verplicht voor cliënt-AI-chatbots vanaf 2 augustus 2026.","article_url":"https://agentmodeai.com/operators/ai-solo-legal-paralegal-nl-bar-rules/","topic":null,"pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-08-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-053","claim":"For marketplace resellers running AI image workflows in 2026, the safe pattern across Marktplaats, Vinted, and Etsy is original photography of the actual item with light AI enhancement (lighting, contrast, background cleanup) only. AI-generated listing imagery and heavy enhancement that produces consistent visual fingerprints across listings trigger Marktplaats's photo-fingerprint deduplication (most aggressive), Vinted's image-similarity penalty for the resale-of-resold pattern, and Etsy's Creativity Standards on AI-generated imagery in handmade categories. The five-rule safe workflow: original photography of every item, light AI enhancement only, fresh photography per relisting, per-platform disclosure where required, and impression-to-view ratio tracking as the leading indicator of algorithm-induced ranking suppression.","article_url":"https://agentmodeai.com/operators/ai-marketplace-image-workflow-marktplaats-vinted/","topic":null,"pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-07-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-054","claim":"For EU-based solo developers doing client work in 2026, the procurement-defensible AI-tool posture turns on client-code data residency rather than on Cursor-vs-Copilot-vs-Claude-Code feature comparison. All three dominant AI coding tools support EU data residency at Enterprise tiers (Copilot via Microsoft Azure OpenAI EU regions, Cursor via configurable LLM provider routing, Claude Code via Anthropic API EU-region availability). Three contract clauses now appear in regulated EU client agreements: client-code-non-transmission, EU-residency requirement, and sub-processor disclosure. The procurement-defensible workflow has five steps: AI-tool inventory, per-client risk assessment, configure tools per client, document configuration in engagement contract, audit quarterly. Three scenarios where the right answer is to disable AI tooling entirely: explicit contract prohibition that cannot be negotiated, embedded regulated data in the codebase, national-security or jurisdictionally-sensitive code.","article_url":"https://agentmodeai.com/operators/ai-solo-dev-eu-client-code-residency/","topic":null,"pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-07-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-055","claim":"For German solo founders and small Mittelstand operators running AI-bookkeeping in 2026, the Buchhaltungssoftware choice resolves on Umsatz tier and Steuerberater relationship: DATEV (€20-€80/month plus Steuerberater-coupling) above €100K Umsatz where the Steuerberater workflow is binding, sevDesk (€8-€48/month) under €100K Umsatz as the cheapest path that produces a GoBD-compliant audit trail, and Lexware (€10-€40/month) as the legacy-Mittelstand fallback. The OPS-031 jurisdiction-neutral DIY-AI-bookkeeping case breaks at the moment the AI-drafted Buchungssatz must land in a tool that preserves the GoBD audit trail; the German-tool layer is the complement to the DIY-AI case. The OSS-Verfahren and reverse-charge VAT prompt-prefix is the operational discipline that prevents AI-VAT-error in 1 of 20 EU-cross-border invoices.","article_url":"https://agentmodeai.com/operators/ai-bookkeeping-de-datev-sevdesk-lexware/","topic":null,"pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-07-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-056","claim":"For bootstrapped SaaS founders under €30K MRR with AI features in production, the metric that matters is token cost per active user (not total monthly AI spend). Total monthly spend is the lagging indicator that signals problems only after they have crossed gross-margin thresholds; cost per active user is the leading indicator that catches runaway patterns before they erode unit economics. The defensible cancellation-trigger threshold sits at 30-40% of per-user revenue. Four levers when the cost crosses the trigger, ranked by disruption: provider-tier switch (40-70% reduction, low impact), prompt and caching optimisation (20-40% reduction, moderate impact), product change (30-60% reduction, high impact), provider switch (10-30% reduction, highest disruption). Token cost dropped roughly 90% from 2023-2026 but per-user cost stayed flat because product features pulled 10-30x more tokens per session and user behaviour shifted toward higher engagement.","article_url":"https://agentmodeai.com/operators/ai-cost-discipline-bootstrapped-saas/","topic":null,"pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-07-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"RES-001","claim":"The 47-question AI Vendor Security Questionnaire covers seven failure surfaces (model lineage, training/inference data handling, non-human identity, audit/observability, kill-switch, EU AI Act + GDPR posture, contract/indemnification) that CAIQ v4 and SIG do not address; vendors that cannot answer score sections binary-unanswered, and the questionnaire is the addendum (not replacement) to existing cloud/SaaS procurement frameworks.","article_url":"https://agentmodeai.com/resources/ai-vendor-security-questionnaire/","topic":null,"pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-08-02","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"RES-002","claim":"The pre-deployment AI DPIA template fuses GDPR Article 35 obligations with EU AI Act Article 26 (deployer) and Article 27 (FRIA where applicable) into a single working-session document; sections 7 and 8 are conditional on the EU AI Act risk classification established in section 1, which means deployers complete the full document only when the system is classified as high-risk under Annex III.","article_url":"https://agentmodeai.com/resources/ai-dpia-template/","topic":null,"pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-08-02","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"RES-003","claim":"The four-phase agent incident runbook (detect within 4h, contain within 30s, roll back per action class, post-mortem with MTTD-for-Agents detection chain) is the operational overlay on standard SRE incident response that most enterprises deploying agentic AI in 2026 do not have; the seven action classes (database writes, external API calls, customer comms, document publication, code commits, identity changes, knowledge-base writes) each require a distinct rollback procedure and the runbook captures the operator authorised, time budget, and substitute action where rollback is impossible.","article_url":"https://agentmodeai.com/resources/agent-incident-runbook/","topic":null,"pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-07-03","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"RES-004","claim":"The Works Council AI Notification Packet covers three EU jurisdictions (German BetrVG §87(1) point 6, Dutch WOR Article 27, French CSE consultation under Code du travail L2312-8) plus the EU AI Act Article 26(7) deployer-notification overlay that activates 2 August 2026; early engagement (vendor-shortlist landing) compresses deployment timelines from 12-18 months (late engagement) to 6-9 months in the Mittelstand case studied in OPS-049, and the per-jurisdiction documents (Betriebsvereinbarung, OR convenant, CSE avis) are designed to consolidate into a single deployment-go decision with the Article 26(7) notification appended afterward.","article_url":"https://agentmodeai.com/resources/works-council-ai-notification-packet/","topic":null,"pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-08-02","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"RES-005","claim":"The 38-item AI MSA red-team checklist organises the contractual review around seven clause families (training-data carve-outs, output ownership + IP indemnification, model-deprecation rights, sub-processor expansion, kill-switch SLA, exit-data portability, regulatory + EU AI Act flow-through) where 2025-2026 enterprise AI MSA failures cluster; vendors scoring yes on 30+ items are contractually serious, 20-29 items are treatable through negotiation, and below 20 signals that the vendor's commercial position depends on retaining the rights the checklist is designed to constrain.","article_url":"https://agentmodeai.com/resources/ai-msa-red-team-checklist/","topic":null,"pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-07-03","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]}]},"claim_archive":{"description":"Claims made by enterprise-AI vendors, analyst firms, academics, tier-1 publications, regulators, and identified consensus positions. Each is reviewed on a 30/60/90/180-day cadence.","methodology_url":"https://agentmodeai.com/archive/method/","index_url":"https://agentmodeai.com/archive/","insights_url":"https://agentmodeai.com/archive/insights/","feeds":{"new_claims":"https://agentmodeai.com/archive/feed/new-claims.xml","new_reviews":"https://agentmodeai.com/archive/feed/new-reviews.xml","status_changes":"https://agentmodeai.com/archive/feed/status-changes.xml"},"claims":[{"id":"ACA-2026-001","claim":"Stanford HAI's 2026 AI Index reports organisational AI adoption at 88%, with productivity gains of 14-26% in customer support and software development, and up to 72% in marketing teams. AI agent adoption across departments remains in single digits.","claim_url":"https://agentmodeai.com/claims/ACA-2026-001/","source_type":"academic","source_name":"Stanford HAI","source_url":"https://hai.stanford.edu/ai-index/2026-ai-index-report","source_snapshot_url":"https://web.archive.org/web/20260419215510/https://hai.stanford.edu/ai-index/2026-ai-index-report","source_date":"2026-04-14","log_date":"2026-04-19","review_cadence":180,"domain_tags":["adoption-rate","benchmark","capability-benchmark"],"current_verdict":"pending-review","next_review":"2026-10-16","review_history":[],"change_history":[]},{"id":"ACA-2026-002","claim":"Stanford HAI's 2026 AI Index reports that China has erased the US lead in AI on capability + publication-volume benchmarks, with the two countries now at near-parity after a three-year closing curve.","claim_url":"https://agentmodeai.com/claims/ACA-2026-002/","source_type":"academic","source_name":"Stanford HAI","source_url":"https://hai.stanford.edu/news/inside-the-ai-index-12-takeaways-from-the-2026-report","source_snapshot_url":"https://web.archive.org/web/20260419215510/https://hai.stanford.edu/ai-index/2026-ai-index-report","source_date":"2026-04-14","log_date":"2026-04-19","review_cadence":180,"domain_tags":["market-position","capability-benchmark"],"current_verdict":"pending-review","next_review":"2026-10-16","review_history":[],"change_history":[]},{"id":"ACA-2026-003","claim":"Stanford's Digital Economy Lab Enterprise AI Playbook documents bimodal ROI: 12% of enterprise AI deployments clear 300%+ ROI with 40-60% operational cost reductions; 88% sit at or below break-even.","claim_url":"https://agentmodeai.com/claims/ACA-2026-003/","source_type":"academic","source_name":"Stanford DEL","source_url":"https://digitaleconomy.stanford.edu/app/uploads/2026/03/EnterpriseAIPlaybook_PereiraGraylinBrynjolfsson.pdf","source_snapshot_url":"https://web.archive.org/web/20260407050750/https://digitaleconomy.stanford.edu/app/uploads/2026/03/EnterpriseAIPlaybook_PereiraGraylinBrynjolfsson.pdf","source_date":"2026-03-01","log_date":"2026-04-19","review_cadence":180,"domain_tags":["agent-roi","adoption-rate","benchmark"],"current_verdict":"pending-review","next_review":"2026-10-16","review_history":[],"change_history":[]},{"id":"ACA-2026-004","claim":"Carnegie Mellon's TheAgentCompany 2026 update shows Gemini 2.5 Pro as the best enterprise-agent at 30.3% task completion — up from the 24% Claude 3.5 Sonnet baseline in 2024, but still far below production-readiness thresholds.","claim_url":"https://agentmodeai.com/claims/ACA-2026-004/","source_type":"academic","source_name":"Carnegie Mellon University","source_url":"https://arxiv.org/abs/2412.14161","source_snapshot_url":"https://web.archive.org/web/20260411143459/https://arxiv.org/abs/2412.14161","source_date":"2026-03-15","log_date":"2026-04-19","review_cadence":90,"domain_tags":["capability-benchmark","benchmark"],"current_verdict":"pending-review","next_review":"2026-07-18","review_history":[],"change_history":[]},{"id":"ANA-2026-001","claim":"Gartner predicted that over 40% of agentic AI projects will be cancelled by end of 2027, primarily due to escalating costs, unclear business value, and inadequate risk controls.","claim_url":"https://agentmodeai.com/claims/ANA-2026-001/","source_type":"analyst","source_name":"Gartner","source_url":"https://www.gartner.com/en/newsroom/press-releases/2025-06-25-gartner-predicts-over-40-percent-of-agentic-ai-projects-will-be-canceled-by-end-of-2027","source_snapshot_url":"https://web.archive.org/web/20260412112337/https://www.gartner.com/en/newsroom/press-releases/2025-06-25-gartner-predicts-over-40-percent-of-agentic-ai-projects-will-be-canceled-by-end-of-2027","source_date":"2025-06-25","log_date":"2026-04-19","review_cadence":180,"domain_tags":["adoption-rate","agent-roi","deployment-velocity"],"current_verdict":"holding","next_review":"2026-10-16","review_history":[],"change_history":[{"date":"2026-04-19","field_changed":"source_snapshot_url","old_value":"https://web.archive.org/web/20250625120000/https://www.gartner.com/en/newsroom/press-releases/2025-06-25-gartner-predicts-over-40-percent-of-agentic-ai-projects-will-be-canceled-by-end-of-2027","new_value":"https://web.archive.org/web/20260412112337/https://www.gartner.com/en/newsroom/press-releases/2025-06-25-gartner-predicts-over-40-percent-of-agentic-ai-projects-will-be-canceled-by-end-of-2027","reason":"Initial snapshot URL was a Phase 1 placeholder with an estimated timestamp; resolved to the nearest real capture via the Internet Archive availability API (Save Page Now returned HTTP 520 for Gartner — their servers actively limit archival requests)."}]},{"id":"ANA-2026-002","claim":"Only 28% of AI infrastructure-and-operations projects fully pay off, and 57% of leaders reporting failure cited \"expected too much, too fast\" as the top contributing factor.","claim_url":"https://agentmodeai.com/claims/ANA-2026-002/","source_type":"analyst","source_name":"Gartner","source_url":"https://www.theregister.com/2026/04/07/ai_returns_gartner/","source_snapshot_url":"https://web.archive.org/web/20260419221558/https://www.theregister.com/2026/04/07/ai_returns_gartner/","source_date":"2026-04-07","log_date":"2026-04-19","review_cadence":90,"domain_tags":["agent-roi","adoption-rate","deployment-velocity"],"current_verdict":"pending-review","next_review":"2026-07-18","review_history":[],"change_history":[]},{"id":"ANA-2026-003","claim":"Gartner predicts that by 2030, 50% of organisations will use autonomous AI agents to interpret governance policies and technical standards into machine-verifiable data contracts, automating compliance and governance-policy enforcement.","claim_url":"https://agentmodeai.com/claims/ANA-2026-003/","source_type":"analyst","source_name":"Gartner","source_url":"https://www.gartner.com/en/newsroom/press-releases/2026-03-11-gartner-announces-top-predictions-for-data-and-analytics-in-2026","source_snapshot_url":"https://web.archive.org/web/20260312144319/https://www.gartner.com/en/newsroom/press-releases/2026-03-11-gartner-announces-top-predictions-for-data-and-analytics-in-2026","source_date":"2026-03-11","log_date":"2026-04-19","review_cadence":180,"domain_tags":["governance","compliance","adoption-rate"],"current_verdict":"pending-review","next_review":"2026-10-16","review_history":[],"change_history":[]},{"id":"ANA-2026-004","claim":"BCG's \"AI Transformation is a Workforce Transformation\" study finds future-built AI companies achieve 5× revenue increases and 3× cost reductions vs laggards. AI agents accounted for ~17% of AI-driven value in 2025, projected to reach 29% by 2028.","claim_url":"https://agentmodeai.com/claims/ANA-2026-004/","source_type":"analyst","source_name":"Boston Consulting Group","source_url":"https://www.bcg.com/publications/2026/ai-transformation-is-a-workforce-transformation","source_snapshot_url":"https://web.archive.org/web/20260331134658/https://www.bcg.com/publications/2026/ai-transformation-is-a-workforce-transformation","source_date":"2026-03-01","log_date":"2026-04-19","review_cadence":180,"domain_tags":["agent-roi","adoption-rate","market-position"],"current_verdict":"pending-review","next_review":"2026-10-16","review_history":[],"change_history":[]},{"id":"ANA-2026-005","claim":"Gartner predicts that by 2028, 60% of brands will use agentic AI to facilitate streamlined one-to-one interactions with customers.","claim_url":"https://agentmodeai.com/claims/ANA-2026-005/","source_type":"analyst","source_name":"Gartner","source_url":"https://consumergoods.com/gartner-predicts-sharp-rise-ai-agents-within-enterprise-applications-2026","source_snapshot_url":"https://web.archive.org/web/20260419221413/https://consumergoods.com/gartner-predicts-sharp-rise-ai-agents-within-enterprise-applications-2026","source_date":"2026-01-15","log_date":"2026-04-19","review_cadence":180,"domain_tags":["adoption-rate","market-position"],"current_verdict":"pending-review","next_review":"2026-10-16","review_history":[],"change_history":[]},{"id":"ANA-2026-006","claim":"McKinsey's State of AI 2025: 23% of respondents are scaling an agentic AI system, 39% experimenting. 39% attribute any EBIT impact to AI, most saying <5% of EBIT. A 6% 'AI high performer' segment attributes >5% of EBIT to AI.","claim_url":"https://agentmodeai.com/claims/ANA-2026-006/","source_type":"analyst","source_name":"McKinsey Global Institute","source_url":"https://www.mckinsey.com/capabilities/quantumblack/our-insights/the-state-of-ai","source_snapshot_url":"https://web.archive.org/web/20260315102913/https://www.mckinsey.com/capabilities/quantumblack/our-insights/the-state-of-ai","source_date":"2025-11-01","log_date":"2026-04-19","review_cadence":90,"domain_tags":["agent-roi","adoption-rate"],"current_verdict":"pending-review","next_review":"2026-07-18","review_history":[],"change_history":[]},{"id":"PUB-2026-001","claim":"The Financial Times reported that almost 40% of US data-center projects are at risk of falling behind schedule, per SynMax satellite + AI analytics data, attributing delays to permitting, local opposition, and shortages in labour, power, and equipment.","claim_url":"https://agentmodeai.com/claims/PUB-2026-001/","source_type":"publication","source_name":"Financial Times","source_url":"https://www.pymnts.com/artificial-intelligence-2/2026/permitting-hurdles-and-labor-shortages-threaten-ai-data-center-timelines","source_snapshot_url":"https://web.archive.org/web/20260419215722/https://www.pymnts.com/artificial-intelligence-2/2026/permitting-hurdles-and-labor-shortages-threaten-ai-data-center-timelines","source_date":"2026-04-10","log_date":"2026-04-19","review_cadence":90,"domain_tags":["adoption-rate","market-position"],"current_verdict":"pending-review","next_review":"2026-07-18","review_history":[],"change_history":[]},{"id":"PUB-2026-002","claim":"UK financial regulators are holding urgent talks with the government's cyber-security agency and major banks to assess systemic risks posed by Anthropic's latest Claude model, per the Financial Times.","claim_url":"https://agentmodeai.com/claims/PUB-2026-002/","source_type":"publication","source_name":"Financial Times","source_url":"https://www.thestandard.com.hk/finance/article/329134/","source_snapshot_url":"https://web.archive.org/web/20260419215826/https://www.thestandard.com.hk/finance/article/329134/UK-financial-regulators-rush-to-assess-risks-of-Anthropic-latest-AI-model-FT-reports","source_date":"2026-04-15","log_date":"2026-04-19","review_cadence":60,"domain_tags":["governance","security","regulatory"],"current_verdict":"pending-review","next_review":"2026-06-18","review_history":[],"change_history":[]},{"id":"PUB-2026-003","claim":"HBR's March 2026 framing argues AI agents should be scaled as team members — with defined roles, performance reviews, onboarding, and accountability — rather than as tools.","claim_url":"https://agentmodeai.com/claims/PUB-2026-003/","source_type":"publication","source_name":"Harvard Business Review","source_url":"https://hbr.org/2026/03/to-scale-ai-agents-successfully-think-of-them-like-team-members","source_snapshot_url":"https://web.archive.org/web/20260419220214/https://hbr.org/2026/03/to-scale-ai-agents-successfully-think-of-them-like-team-members","source_date":"2026-03-01","log_date":"2026-04-19","review_cadence":90,"domain_tags":["governance","adoption-rate"],"current_verdict":"pending-review","next_review":"2026-07-18","review_history":[],"change_history":[]},{"id":"PUB-2026-004","claim":"MIT Sloan's 2026 AI & Data Leadership Executive Benchmark Survey reports that 38% of responding companies have appointed a Chief AI Officer or equivalent role.","claim_url":"https://agentmodeai.com/claims/PUB-2026-004/","source_type":"publication","source_name":"MIT Sloan Management Review","source_url":"https://mitsloan.mit.edu/ideas-made-to-matter/action-items-ai-decision-makers-2026","source_snapshot_url":"https://web.archive.org/web/20260314200046/https://mitsloan.mit.edu/ideas-made-to-matter/action-items-ai-decision-makers-2026","source_date":"2026-03-01","log_date":"2026-04-19","review_cadence":180,"domain_tags":["governance","adoption-rate"],"current_verdict":"pending-review","next_review":"2026-10-16","review_history":[],"change_history":[]},{"id":"PUB-2026-005","claim":"Harvard Business Review Analytic Services research finds that only 6% of companies fully trust AI agents to handle core business processes. 43% trust them only with limited or routine operational tasks; 39% restrict to supervised use cases or non-core processes.","claim_url":"https://agentmodeai.com/claims/PUB-2026-005/","source_type":"publication","source_name":"Harvard Business Review","source_url":"https://fortune.com/2025/12/09/harvard-business-review-survey-only-6-percent-companies-trust-ai-agents/","source_snapshot_url":"https://web.archive.org/web/20260220075113/https://fortune.com/2025/12/09/harvard-business-review-survey-only-6-percent-companies-trust-ai-agents/","source_date":"2025-12-09","log_date":"2026-04-19","review_cadence":180,"domain_tags":["governance","adoption-rate"],"current_verdict":"pending-review","next_review":"2026-10-16","review_history":[],"change_history":[]},{"id":"REG-2026-001","claim":"NIST released a concept note on 7 Apr 2026 for an AI RMF Profile on Trustworthy AI in Critical Infrastructure, signalling binding-adjacent guidance for critical-infrastructure operators.","claim_url":"https://agentmodeai.com/claims/REG-2026-001/","source_type":"regulatory","source_name":"NIST","source_url":"https://www.nist.gov/itl/ai-risk-management-framework","source_snapshot_url":"https://web.archive.org/web/20260419215544/https://www.nist.gov/itl/ai-risk-management-framework","source_date":"2026-04-07","log_date":"2026-04-19","review_cadence":180,"domain_tags":["governance","regulatory","compliance"],"current_verdict":"pending-review","next_review":"2026-10-16","review_history":[],"change_history":[]},{"id":"REG-2026-002","claim":"The European Commission published guidelines on the classification of high-risk AI systems in February 2026, establishing the interpretive framework for the 2 Aug 2026 Annex III enforcement deadline.","claim_url":"https://agentmodeai.com/claims/REG-2026-002/","source_type":"regulatory","source_name":"European Commission","source_url":"https://ai-act-service-desk.ec.europa.eu/en/faq","source_snapshot_url":"https://web.archive.org/web/20260419220239/https://ai-act-service-desk.ec.europa.eu/en/faq","source_date":"2026-02-15","log_date":"2026-04-19","review_cadence":90,"domain_tags":["regulatory","compliance","governance"],"current_verdict":"pending-review","next_review":"2026-07-18","review_history":[],"change_history":[]},{"id":"REG-2026-003","claim":"On 2 August 2026, Annex III high-risk AI system obligations under the EU AI Act (Articles 9–49) take full effect, with penalties up to €35M or 7% of global annual turnover for serious violations and €15M or 3% for high-risk non-compliance.","claim_url":"https://agentmodeai.com/claims/REG-2026-003/","source_type":"regulatory","source_name":"European Commission","source_url":"https://artificialintelligenceact.eu/article/6/","source_snapshot_url":"https://web.archive.org/web/20260419220323/https://artificialintelligenceact.eu/article/6/","source_date":"2026-02-01","log_date":"2026-04-19","review_cadence":180,"domain_tags":["regulatory","compliance","governance"],"current_verdict":"pending-review","next_review":"2026-10-16","review_history":[],"change_history":[]},{"id":"VEN-2026-001","claim":"Anthropic stated that Claude for Chrome reduced a financial services firm's accounts payable processing from 6 hours to under 30 minutes.","claim_url":"https://agentmodeai.com/claims/VEN-2026-001/","source_type":"vendor","source_name":"Anthropic","source_url":"https://claude.com/blog/claude-for-chrome","source_snapshot_url":"https://web.archive.org/web/20260419155325/https://claude.com/blog/claude-for-chrome","source_date":"2025-08-26","log_date":"2026-04-19","review_cadence":90,"domain_tags":["agent-roi","deployment-velocity"],"current_verdict":"pending-review","next_review":"2026-07-18","review_history":[],"change_history":[{"date":"2026-04-19","field_changed":"source_snapshot_url","old_value":"https://web.archive.org/web/20250826120000/https://www.anthropic.com/news/claude-for-chrome","new_value":"https://web.archive.org/web/20260419155325/https://claude.com/blog/claude-for-chrome","reason":"Initial snapshot URL was a Phase 1 placeholder with an estimated timestamp; resolved via Save Page Now to the actual archived capture. Canonical destination redirected from anthropic.com/news to claude.com/blog during the intervening period — both URLs serve the same content."},{"date":"2026-04-19","field_changed":"source_url","old_value":"https://www.anthropic.com/news/claude-for-chrome","new_value":"https://claude.com/blog/claude-for-chrome","reason":"Original URL returns HTTP 308 permanent redirect to claude.com/blog/claude-for-chrome (verified 19 Apr 2026). Updated source_url to the current canonical so future readers reach live content directly; the old URL still works via redirect and subsequent reviews can compare against both."}]},{"id":"VEN-2026-002","claim":"Google expanded AI Mode restaurant booking to 8 markets (UK, Australia, Canada, Hong Kong, India, New Zealand, Singapore, South Africa) with 8 partner platforms powering the backend.","claim_url":"https://agentmodeai.com/claims/VEN-2026-002/","source_type":"vendor","source_name":"Google","source_url":"https://blog.google/company-news/inside-google/around-the-globe/google-europe/united-kingdom/ai-mode-restaurants-uk/","source_snapshot_url":"https://web.archive.org/web/20260419221317/https://blog.google/company-news/inside-google/around-the-globe/google-europe/united-kingdom/ai-mode-restaurants-uk/","source_date":"2026-04-10","log_date":"2026-04-19","review_cadence":60,"domain_tags":["agent-procurement","market-position"],"current_verdict":"pending-review","next_review":"2026-06-18","review_history":[],"change_history":[]},{"id":"VEN-2026-003","claim":"Anthropic's Model Context Protocol crossed 97 million SDK installs by 25 March 2026, the fastest-adopted AI infrastructure standard in history. Anthropic donated MCP to the Agentic AI Foundation under the Linux Foundation, co-founded with Block and OpenAI.","claim_url":"https://agentmodeai.com/claims/VEN-2026-003/","source_type":"vendor","source_name":"Anthropic","source_url":"https://www.anthropic.com/news/donating-the-model-context-protocol-and-establishing-of-the-agentic-ai-foundation","source_snapshot_url":"https://web.archive.org/web/20260419215620/https://www.anthropic.com/news/donating-the-model-context-protocol-and-establishing-of-the-agentic-ai-foundation","source_date":"2026-03-25","log_date":"2026-04-19","review_cadence":90,"domain_tags":["market-position","agent-procurement","capability-benchmark"],"current_verdict":"pending-review","next_review":"2026-07-18","review_history":[],"change_history":[]},{"id":"VEN-2026-004","claim":"Microsoft's Copilot Wave 3 (9 Mar 2026) introduced Agent 365 as an enterprise agent governance/identity/security control plane, made Claude models available alongside OpenAI models inside Microsoft 365, and launched the M365 E7 licensing tier at $99/user/month.","claim_url":"https://agentmodeai.com/claims/VEN-2026-004/","source_type":"vendor","source_name":"Microsoft","source_url":"https://www.microsoft.com/en-us/microsoft-365/blog/2026/03/09/powering-frontier-transformation-with-copilot-and-agents/","source_snapshot_url":"https://web.archive.org/web/20260318005347/https://www.microsoft.com/en-us/microsoft-365/blog/2026/03/09/powering-frontier-transformation-with-copilot-and-agents/","source_date":"2026-03-09","log_date":"2026-04-19","review_cadence":60,"domain_tags":["market-position","governance","agent-procurement"],"current_verdict":"pending-review","next_review":"2026-06-18","review_history":[],"change_history":[]},{"id":"VEN-2026-005","claim":"Anthropic launched enterprise agent plug-ins for Claude in finance, engineering, and design verticals on 24 Feb 2026, alongside an expanded Accenture partnership that trains approximately 30,000 Accenture professionals on Claude.","claim_url":"https://agentmodeai.com/claims/VEN-2026-005/","source_type":"vendor","source_name":"Anthropic","source_url":"https://techcrunch.com/2026/02/24/anthropic-launches-new-push-for-enterprise-agents-with-plugins-for-finance-engineering-and-design/","source_snapshot_url":"https://web.archive.org/web/20260419220133/https://techcrunch.com/2026/02/24/anthropic-launches-new-push-for-enterprise-agents-with-plugins-for-finance-engineering-and-design/","source_date":"2026-02-24","log_date":"2026-04-19","review_cadence":60,"domain_tags":["market-position","agent-procurement"],"current_verdict":"pending-review","next_review":"2026-06-18","review_history":[],"change_history":[]},{"id":"VEN-2026-006","claim":"Salesforce closed 29,000 Agentforce deals in Q4 FY2026 (up 50% quarter-over-quarter), with Agentforce ARR reaching $800M and 48% quarter-over-quarter growth. CEO Benioff raised FY26 revenue guidance to $41.45-$41.55 billion.","claim_url":"https://agentmodeai.com/claims/VEN-2026-006/","source_type":"vendor","source_name":"Salesforce","source_url":"https://www.salesforceben.com/huge-agentforce-growth-in-salesforce-q4-as-benioff-mocks-saaspocalypse-narratives/","source_snapshot_url":"https://web.archive.org/web/20260307222535/https://www.salesforceben.com/huge-agentforce-growth-in-salesforce-q4-as-benioff-mocks-saaspocalypse-narratives/","source_date":"2026-02-27","log_date":"2026-04-19","review_cadence":30,"domain_tags":["agent-roi","market-position","adoption-rate"],"current_verdict":"holding","next_review":"2026-05-24","review_history":[{"date":"2026-04-24","verdict":"holding","oversight":"peter-led-deep-review","memo_url":"https://agentmodeai.com/claims/VEN-2026-006/review-2026-04-24/","counter_evidence_considered":null}],"change_history":[{"date":"2026-04-24","field_changed":"status","old_value":"pending-review","new_value":"holding","reason":"First scheduled review (cadence 30d); all three sub-claims re-verified against Salesforce IR Q4 FY26 release and corporate press release. Memo at review-2026-04-24."},{"date":"2026-04-24","field_changed":"review_history","old_value":"[]","new_value":"[{date: 2026-04-24, verdict: holding, oversight: peter-led-deep-review, memo_slug: review-2026-04-24}]","reason":"Logged first review entry for this claim. See memo at content/claims/VEN-2026-006/review-2026-04-24.mdx for full evidence chain."}]},{"id":"VEN-2026-007","claim":"Microsoft reported 15 million paid Microsoft 365 Copilot seats in FY2026 Q2 (28 Jan 2026). The number of customers with more than 35,000 Copilot seats tripled year-over-year. Agent 365 was introduced as an enterprise governance/identity/security control plane for agents.","claim_url":"https://agentmodeai.com/claims/VEN-2026-007/","source_type":"vendor","source_name":"Microsoft","source_url":"https://www.microsoft.com/en-us/investor/earnings/fy-2026-q2/press-release-webcast","source_snapshot_url":"https://web.archive.org/web/20260419220513/https://www.microsoft.com/en-us/investor/earnings/fy-2026-q2/press-release-webcast","source_date":"2026-01-28","log_date":"2026-04-19","review_cadence":30,"domain_tags":["adoption-rate","market-position"],"current_verdict":"holding","next_review":"2026-05-24","review_history":[{"date":"2026-04-24","verdict":"holding","oversight":"peter-led-deep-review","memo_url":"https://agentmodeai.com/claims/VEN-2026-007/review-2026-04-24/","counter_evidence_considered":null}],"change_history":[{"date":"2026-04-24","field_changed":"status","old_value":"pending-review","new_value":"holding","reason":"First scheduled review (cadence 30d); 15M paid seats, tripled >35k-seat customer cohort (with named accounts), and Agent 365 introduction all re-verified against Microsoft IR Q2 FY26 release and earnings call transcript. Memo at review-2026-04-24."},{"date":"2026-04-24","field_changed":"review_history","old_value":"[]","new_value":"[{date: 2026-04-24, verdict: holding, oversight: peter-led-deep-review, memo_slug: review-2026-04-24}]","reason":"Logged first review entry for this claim. See memo at content/claims/VEN-2026-007/review-2026-04-24.mdx for full evidence chain."}]},{"id":"VEN-2026-008","claim":"Google Cloud launched Gemini Enterprise for Customer Experience at NRF 2026 (11 Jan 2026), with Kroger, Lowe's, and Woolworths named as adopting customers using agentic capabilities for combined shopping + customer service workflows.","claim_url":"https://agentmodeai.com/claims/VEN-2026-008/","source_type":"vendor","source_name":"Google","source_url":"https://www.googlecloudpresscorner.com/2026-01-11-Google-Cloud-Brings-Shopping-and-Customer-Service-Together-with-Gemini-Enterprise-for-Customer-Experience","source_snapshot_url":"https://web.archive.org/web/20260419220604/https://www.googlecloudpresscorner.com/2026-01-11-Google-Cloud-Brings-Shopping-and-Customer-Service-Together-with-Gemini-Enterprise-for-Customer-Experience","source_date":"2026-01-11","log_date":"2026-04-19","review_cadence":60,"domain_tags":["adoption-rate","agent-procurement","market-position"],"current_verdict":"pending-review","next_review":"2026-06-18","review_history":[],"change_history":[]}]},"claims":[{"id":"AM-001","claim":"70% of AI-implementation failure is people and process, not technology — cultural transformation is the strongest predictor of AI ROI at the 2024-2025 maturity stage.","article_url":"https://agentmodeai.com/ai-readiness-in-organizations-the-2024-2025-landscape/","topic":"agentic-ai-governance","pub_date":"2026-04-19","last_reviewed":"2026-04-19","next_review":"2026-06-18","verdict":"holding","verdict_history":[{"date":"2026-04-19","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-002","claim":"Agentic AI's $3.50-per-dollar average return masks a 70% task-failure rate on the Carnegie Mellon benchmark; only narrowly-scoped deployments clear the reality bar.","article_url":"https://agentmodeai.com/the-agentic-ai-revolution-real-world-success-stories-and-strategic-insights-from-2024-2025/","topic":"agent-procurement","pub_date":"2026-04-19","last_reviewed":"2026-04-28","next_review":null,"verdict":"not_holding","verdict_history":[{"date":"2026-04-19","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-28","verdict":"partial","note":"Article retracted 28 Apr 2026. Slug language ('revolution', 'real-world success stories') carries hype register the publication explicitly avoids; survey-of-surveys structure does not stand up to source-verification at the level the publication now demands. Google has rejected the URL despite an active claim status. URL now redirects to /retractions/?retired=the-agentic-ai-revolution-real-world-success-stories-and-strategic-insights-from-2024-2025. Claim withdrawn — status moves to Not holding, no further reviews scheduled."},{"date":"2026-05-06","verdict":"partial","note":"URL state changed. The /the-agentic-ai-revolution-real-world-success-stories-and-strategic-insights-from-2024-2025/ slug now serves a deliberately rewritten retrospective (claimId AM-130, \"Agentic AI 2024-2025 retrospective\", published 04 May 2026) against audited primary sources. The 28 Apr 2026 redirect to /retractions/ has been lifted to allow that. AM-002 the claim remains Not holding — the original $3.50/dollar + 70% failure-rate framing was withdrawn and is not restored. AM-130 is a separate claim with its own evidence chain. Readers arriving at /holding/AM-002 see the withdrawal here; the article link surfaces the new piece at the URL the original lived at, with this entry as the audit trail."}],"primary_sources":[]},{"id":"AM-003","claim":"GPT-5 Pro's tiered-subscription model forces enterprises to classify problems by computational difficulty — $200/month premium routing only repays for the top decile of 'very hard' queries.","article_url":"https://agentmodeai.com/gpt-5-pro-vs-enterprise-ai-agents-what-very-hard-problems-means-for-your-business/","topic":"enterprise-ai-cost","pub_date":"2026-04-19","last_reviewed":"2026-04-19","next_review":"2026-05-19","verdict":"holding","verdict_history":[{"date":"2026-04-19","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-013","claim":"Q1 2026 is the quarter enterprise agentic-AI crossed three thresholds simultaneously — the first at-scale in-the-wild exploits, the first vendor-shipped governance infrastructure, and the first hard ROI data — and programmes designed around only one will not make the 28% that pay off.","article_url":"https://agentmodeai.com/agentic-ai-got-real-q1-2026/","topic":"agentic-ai-governance","pub_date":"2026-04-19","last_reviewed":"2026-04-19","next_review":"2026-06-18","verdict":"holding","verdict_history":[{"date":"2026-04-19","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-014","claim":"The ~73% of enterprise agentic-AI projects that fail share three structural gaps — no named owner, scope drift, and missing agent-level MTTD — and the 27% that succeed cluster around the inverse.","article_url":"https://agentmodeai.com/why-73-of-agentic-ai-projects-fail-and-how-the-27-generate-312-roi/","topic":"enterprise-ai-cost","pub_date":"2025-08-03","last_reviewed":"2026-04-28","next_review":null,"verdict":"not_holding","verdict_history":[{"date":"2025-08-03","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-19","verdict":"partial","note":"Article predates the Holding-up standard. Retroactive claim assigned on 19 Apr 2026. Initial verdict 'Partial' — spine is defensible, per-claim numeric verification deferred to +60d review. Body not rewritten per AGENTMODE_PHASE2_BRIEF §114."},{"date":"2026-04-28","verdict":"partial","note":"Article retracted 28 Apr 2026. Both stats in the slug ('73% fail', '312% ROI') were backfilled with status: partial on 19 Apr 2026 noting the article predates editorial standard. Body never rewritten. Google's quality algorithm has independently flagged the URL nine days later. The slug carries the structural problem. URL now redirects to /retractions/?retired=why-73-of-agentic-ai-projects-fail-and-how-the-27-generate-312-roi. Claim withdrawn — status moves to Not holding, no further reviews scheduled."}],"primary_sources":[]},{"id":"AM-015","claim":"An agentic-AI Center of Excellence justifies its overhead only after the organisation has three production agents running; before that, it over-governs an experimental footprint.","article_url":"https://agentmodeai.com/building-a-center-of-excellence-for-agentic-ai-in-it-operations-complete-enterprise-guide/","topic":"agentic-ai-governance","pub_date":"2025-08-01","last_reviewed":"2026-04-19","next_review":"2026-06-18","verdict":"partial","verdict_history":[{"date":"2025-08-01","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-19","verdict":"partial","note":"Article predates the Holding-up standard. Retroactive claim assigned on 19 Apr 2026. Initial verdict 'Partial' — spine is defensible, per-claim numeric verification deferred to +60d review. Body not rewritten per AGENTMODE_PHASE2_BRIEF §114."}],"primary_sources":[]},{"id":"AM-016","claim":"Agent-mediated network management reduces unplanned firewall-change incident costs only when the agent's action log feeds into the same change-management audit trail human changes use — not as a parallel system.","article_url":"https://agentmodeai.com/the-7-2m-firewall-change-that-transformed-network-management-how-agentic-ai-prevents-it-disasters/","topic":null,"pub_date":"2025-07-27","last_reviewed":"2026-04-28","next_review":null,"verdict":"not_holding","verdict_history":[{"date":"2025-07-27","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-19","verdict":"partial","note":"Article predates the Holding-up standard. Retroactive claim assigned on 19 Apr 2026. Initial verdict 'Partial' — spine is defensible, per-claim numeric verification deferred to +60d review. Body not rewritten per AGENTMODE_PHASE2_BRIEF §114."},{"date":"2026-04-28","verdict":"partial","note":"Article retracted 28 Apr 2026. '$7.2M' figure in the slug cannot be traced to any disclosed firewall-change incident. Body never rewritten past 19 Apr 2026 backfill. The dollar specificity in the URL is the structural problem and Google's quality algorithm has independently flagged the URL. URL now redirects to /retractions/?retired=the-7-2m-firewall-change-that-transformed-network-management-how-agentic-ai-prevents-it-disasters. Claim withdrawn — status moves to Not holding, no further reviews scheduled."}],"primary_sources":[]},{"id":"AM-017","claim":"Agentic AI's durable enterprise pattern is redeployment-first, not replacement-first. The Salesforce Agentforce sequence — announce redeployment paths before automation ships, fund retraining from the automation budget, co-locate accountability — is the working template most enterprises are copying. Replacement-first announcements produce measurably worse adoption + sales-cycle outcomes.","article_url":"https://agentmodeai.com/the-day-9000-people-asked-to-be-replaced/","topic":null,"pub_date":"2025-07-19","last_reviewed":"2026-04-28","next_review":null,"verdict":"not_holding","verdict_history":[{"date":"2025-07-19","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-19","verdict":"partial","note":"Article predates the Holding-up standard. Retroactive claim assigned on 19 Apr 2026. Initial verdict 'Partial' — spine is defensible, per-claim numeric verification deferred to +60d review. Body not rewritten per AGENTMODE_PHASE2_BRIEF §114."},{"date":"2026-04-19","verdict":"partial","note":"Anchor verification complete (see audit/ANCHOR_VERIFICATION_2026-04-19.md). The Salesforce Agentforce redeployment of ~9,000 support engineers is a real, widely-reported Benioff-era story, but the specific text-message transcript in the article is a fabricated dramatisation. Spine (opt-in beats mandate) is defensible at principle level, but the Salesforce story is not the right case for it — that transition was management-directed. Rewrite flagged for before 18 Jun 2026 review."},{"date":"2026-04-19","verdict":"partial","note":"Body rewritten. Fabricated text-message transcript removed. Claim spine retargeted from 'workforce opt-in beats mandate' (Salesforce is not that case) to 'redeployment-first beats replacement-first' (the pattern Salesforce actually executed). Status moves from Partial to Up. Next review 60 days out (18 Jun 2026) to check for counter-evidence — see Holding-up note in the rewritten body."},{"date":"2026-04-28","verdict":"partial","note":"Article retracted 28 Apr 2026. Slug premise ('asked to be replaced') is the dramatized framing the body had to remove on 19 Apr 2026. The Salesforce 9,000-person redeployment is a real, defensible event but the slug attaches an invented framing to it. Body preserved in archived/. Google has independently rejected the URL. URL now redirects to /retractions/?retired=the-day-9000-people-asked-to-be-replaced. Claim withdrawn — status moves to Not holding, no further reviews scheduled."}],"primary_sources":[]},{"id":"AM-018","claim":"Agentic AI's compounding economics show up in back-office operations (AP, IT ticket triage, HR onboarding, procurement, close-cycle reconciliation), not in front-office customer-facing workflows. The 12% of deployments that clear 300%+ ROI cluster there for structural reasons: per-action savings × action frequency × task-specification tightness × existing process instrumentation.","article_url":"https://agentmodeai.com/the-executive-who-discovered-her-competitors-secret-weapon/","topic":null,"pub_date":"2025-07-19","last_reviewed":"2026-04-28","next_review":null,"verdict":"not_holding","verdict_history":[{"date":"2025-07-19","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-19","verdict":"partial","note":"Article predates the Holding-up standard. Retroactive claim assigned on 19 Apr 2026. Initial verdict 'Partial' — spine is defensible, per-claim numeric verification deferred to +60d review. Body not rewritten per AGENTMODE_PHASE2_BRIEF §114."},{"date":"2026-04-19","verdict":"partial","note":"Anchor verification complete (see audit/ANCHOR_VERIFICATION_2026-04-19.md). 'Sarah Chen' and the 2 AM Munich-hotel scenario are fully fabricated — the article's narrative protagonist does not correspond to any real executive. The underlying framework (back-office cost compounding faster than front-office wins; per-action delta × frequency) IS defensible against McKinsey + Futurum operational-AI-ROI data. Rewrite required before the article can move to Holding."},{"date":"2026-04-19","verdict":"partial","note":"Body rewritten. Fabricated 'Sarah Chen' narrative frame removed entirely. Claim spine sharpened: original was 'back-office cost compounding faster than front-office'; new version adds the structural explanation (per-action × frequency × task-specification × measurement instrumentation) and specific 2026 benchmark anchors (Stanford DEL 12%/88%, Gartner 28%, Futurum 71% vs 40%). Status moves from Partial to Up. Cross-links to AM-020 (TCO), AM-021 (measurement discipline), AM-022 (bimodal ROI) explicitly drawn in the body. Next review 18 Jun 2026."},{"date":"2026-04-28","verdict":"partial","note":"Article retracted 28 Apr 2026. Slug structure (fictional protagonist, 'discovered her competitors' secret weapon') is the fabricated narrative frame the body had to remove on 19 Apr 2026. Body rewritten with Stanford DEL / McKinsey / Futurum sourcing (preserved in archived/) but the slug is the structural problem. URL now redirects to /retractions/?retired=the-executive-who-discovered-her-competitors-secret-weapon. Claim withdrawn — status moves to Not holding, no further reviews scheduled."}],"primary_sources":[]},{"id":"AM-019","claim":"Manufacturing deployments hitting the 30% unplanned-downtime-reduction benchmark share one architectural pattern — the agent writes its actions into the plant's existing MES/CMMS audit trail rather than a parallel log. Parallel-log deployments underperform by a factor of 2-3.","article_url":"https://agentmodeai.com/manufacturing-4-0-how-multi-agent-systems-reduce-downtime-by-30/","topic":"agentic-ai-governance","pub_date":"2025-08-01","last_reviewed":"2026-04-19","next_review":"2026-06-18","verdict":"holding","verdict_history":[{"date":"2025-08-01","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-19","verdict":"partial","note":"Body rewritten. Original headline number (30% downtime reduction) survives against current case-study data. New analytical spine: the audit-trail architecture separates wins from stalls. Status moved from rewrite-in-progress Partial placeholder to Up. Next review 60 days out because architectural claims age slower than pricing claims."}],"primary_sources":[]},{"id":"AM-020","claim":"The 40-60% TCO underestimate on enterprise agentic-AI deployments is not a cost-visibility failure — it is a cross-departmental cost-attribution failure. Integration, tokens, maintenance, supervision, and compliance costs land on IT, HR, and Legal budgets that do not reconcile in most organisations, so the CFO sees the bill late and partial.","article_url":"https://agentmodeai.com/the-hidden-costs-of-agentic-ai-a-cfos-guide-to-true-tco-and-roi-modeling/","topic":"enterprise-ai-cost","pub_date":"2025-07-31","last_reviewed":"2026-04-19","next_review":"2026-06-18","verdict":"holding","verdict_history":[{"date":"2025-07-31","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-19","verdict":"partial","note":"Body rewritten from WP-era slop. Status moves from rewrite-in-progress placeholder to Up. New analytical spine: the TCO underestimate is cross-departmental cost-attribution failure, not hidden costs. Five cost categories named with budget owners. 60-day review cadence."}],"primary_sources":[]},{"id":"AM-021","claim":"The 87% vs 27% success-rate gap between Six-Sigma and non-Six-Sigma organisations on agentic-AI deployments reflects pre-existing measurement discipline, not the DMAIC methodology itself. Agents require a clean baseline, defect definition, documented root-cause analysis, and a change-management gate — four conditions that ISO 9001, ITIL, SRE, or HACCP practices produce just as reliably.","article_url":"https://agentmodeai.com/dmaic-for-agentic-ai-deployment/","topic":"agentic-ai-governance","pub_date":"2025-08-16","last_reviewed":"2026-04-19","next_review":"2026-06-18","verdict":"holding","verdict_history":[{"date":"2025-08-16","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-19","verdict":"partial","note":"Body rewritten from WP-era slop. Status moves from rewrite-in-progress placeholder to Up. New thesis: the causation runs the opposite direction from the vendor narrative — the measurement discipline was the prerequisite, the methodology name doesn't matter. 60-day review."},{"date":"2026-04-28","verdict":"partial","note":"Slug migration to §6a-compliant URL: from-dmaic-to-ai-agents-how-traditional-optimization-methods-accelerate-agentic-ai-success → dmaic-for-agentic-ai-deployment. Body unchanged from 19 Apr rewrite, only the URL changed. Old slug 308-redirects to new. Reason: the long descriptive slug carried §6a-grade friction (88+ chars, vendor-cliche framing) and Google's quality algorithm had flagged the original URL as low-quality (per the 28 Apr 2026 GSC drilldown showing it in the 'Crawled - currently not indexed' bucket). The clean slug preserves the analytical content while removing the URL-level quality penalty."}],"primary_sources":[]},{"id":"AM-022","claim":"The 171% average ROI on enterprise agentic-AI deployments is the mean of a bimodal distribution — roughly 12% of deployments clear 300%+ and 88% sit at or below break-even. The single factor distinguishing the clusters is not a multi-pattern framework; it is whether business-line (not IT) ownership held the kill-switch and accountability before the deployment shipped.","article_url":"https://agentmodeai.com/the-agentic-ai-success-formula-7-proven-patterns-driving-171-roi-in-enterprise-deployments/","topic":null,"pub_date":"2025-08-06","last_reviewed":"2026-04-28","next_review":null,"verdict":"not_holding","verdict_history":[{"date":"2025-08-06","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-19","verdict":"partial","note":"Body rewritten from WP-era slop (7-patterns vendor framework with fabricated case studies). New thesis: bimodal distribution, not normal — the 171% average describes no specific deployment. Business-line kill-switch ownership is the single distinguishing factor. Cross-links to AM-020 + AM-021 on the shared organisational-precondition thread."},{"date":"2026-04-28","verdict":"partial","note":"Article retracted 28 Apr 2026. Slug carries '171% ROI' as a category average and a '7 proven patterns' framework that the body had to disown — the rewritten body explicitly argues 171% is the mean of a bimodal distribution, not a benchmark. Body rewritten 19 Apr 2026 (preserved in archived/) but the slug contradicts the rewritten thesis and Google has rejected the URL. URL now redirects to /retractions/?retired=the-agentic-ai-success-formula-7-proven-patterns-driving-171-roi-in-enterprise-deployments. Claim withdrawn — status moves to Not holding, no further reviews scheduled."}],"primary_sources":[]},{"id":"AM-023","claim":"The 10 Apr 2026 Google AI Mode rollout to eight markets is the first vertical (restaurant booking) where agentic search reduces named SaaS aggregators (OpenTable, TheFork, ResDiary and five others) to API backends rather than destinations. The template applies to every enterprise-relevant aggregation vertical — business travel, expense management, procurement, ATS, HR service delivery — and incumbents in those verticals have 18-24 months to pick API-backend or destination positioning before agentic search forces the choice.","article_url":"https://agentmodeai.com/google-ai-mode-restaurant-booking-the-50-billion-business-revolution-every-ceo-must-understand-2025/","topic":"enterprise-ai-cost","pub_date":"2025-08-23","last_reviewed":"2026-04-19","next_review":"2026-06-18","verdict":"holding","verdict_history":[{"date":"2025-08-23","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-19","verdict":"partial","note":"Body rewritten from WP-era slop (the '$50 Billion Revolution' headline and 'act within 90 days' crisis-FOMO framing were both fabrications). New thesis: restaurant booking is a template, not the story. Named 5 enterprise-relevant aggregation verticals (business travel, expense, procurement, ATS, HR service) and the API-backend-vs-destination choice incumbents face. Next review in 60 days."}],"primary_sources":[]},{"id":"AM-024","claim":"Enterprise-AI decisions in 2026 are made on a citation chain nobody in the chain verifies. The infrastructure gap CIOs face is a verification layer for the claims their procurement runs on — not an information gap. The 88% failure rate in enterprise agentic AI is the predictable output of decision-making on unverified citations, not a capability problem.","article_url":"https://agentmodeai.com/the-unverified-citation-chain-where-enterprise-ai-decisions-actually-come-from/","topic":"agentic-ai-governance","pub_date":"2026-04-20","last_reviewed":"2026-04-20","next_review":"2026-06-19","verdict":"holding","verdict_history":[{"date":"2026-04-20","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-025","claim":"Enterprise agentic AI governance in 2026 fails at the operational layer even when it passes at the compliance layer. Boards receive EU-AI-Act-mapped compliance decks while the agentic deployments actually shipping out of IT ops have no measurable overlap with that deck. Durability requires six instrumented dimensions scored 0–100 (GAUGE framework) with a 90-day setup cadence and a 12-month trajectory target — not a compliance matrix.","article_url":"https://agentmodeai.com/the-enterprise-agentic-ai-governance-playbook-2026/","topic":"agentic-ai-governance","pub_date":"2026-04-24","last_reviewed":"2026-04-24","next_review":"2026-06-23","verdict":"holding","verdict_history":[{"date":"2026-04-24","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-026","claim":"Generic enterprise SaaS RFPs systematically underweight six agent-specific governance dimensions (governance maturity, threat model, ROI evidence, change management, vendor lock-in, compliance posture). A 60-question RFP layer mapped to the GAUGE framework materially changes vendor selection outcomes by disqualifying vendors whose operational governance will not survive the 18-month enterprise review cycle.","article_url":"https://agentmodeai.com/the-enterprise-agentic-ai-rfp-60-questions/","topic":"agent-procurement","pub_date":"2026-04-24","last_reviewed":"2026-04-24","next_review":"2026-06-23","verdict":"holding","verdict_history":[{"date":"2026-04-24","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-027","claim":"A durable enterprise agentic AI business case requires three specific documents — a TCO model with ten named cost categories (not vendor-supplied line items), an ROI model with a pre-deployment measured baseline and an independent validation round, and a three-scenario risk-adjusted NPV. The single-scenario vendor-framed business cases that dominate 2026 enterprise AI investment committees are the predictable root of the 40%+ projected agentic AI project cancellation rate.","article_url":"https://agentmodeai.com/the-cfos-agentic-ai-business-case-tco-and-roi/","topic":"enterprise-ai-cost","pub_date":"2026-04-24","last_reviewed":"2026-04-24","next_review":"2026-06-23","verdict":"holding","verdict_history":[{"date":"2026-04-24","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-028","claim":"Partner — co-development with a vendor on a structured non-standard engagement — is structurally under-chosen in enterprise agentic AI procurement in 2026. Procurement committees have templates for build and buy but none for partner, so the third path does not get evaluated on an equal footing. The vendor-lock-in and change-management dimensions of the GAUGE framework usually favour partner when it is honestly evaluated, not buy or build.","article_url":"https://agentmodeai.com/build-vs-buy-vs-partner-for-enterprise-agentic-ai-2026/","topic":"enterprise-ai-cost","pub_date":"2026-04-24","last_reviewed":"2026-04-24","next_review":"2026-06-23","verdict":"holding","verdict_history":[{"date":"2026-04-24","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-029","claim":"The 12/88 bimodal distribution in enterprise agentic AI ROI realisation (Stanford DEL 2026 + cross-validated by Gartner, McKinsey, CMU) is a governance-discipline outcome, not a model-capability outcome. The 12% instrument the six GAUGE dimensions on a 90-day review rhythm; the 88% treat governance as a deliverable to the audit committee. Capability gap (CMU's 30.3% best-in-class task completion) constrains what is possible, not what separates the 12% from the 88%.","article_url":"https://agentmodeai.com/why-88-percent-of-agentic-ai-deployments-fail/","topic":"enterprise-ai-cost","pub_date":"2026-04-24","last_reviewed":"2026-04-24","next_review":"2026-06-23","verdict":"holding","verdict_history":[{"date":"2026-04-24","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-030","claim":"The McKinsey State of AI 2025 figure (23% of enterprises scaling an agentic AI system, 39% still experimenting) is an operational-preconditions outcome, not a technical-readiness outcome. Four preconditions (agent registry, measured pre-deployment baseline, differentiated change-management playbook for adjacent units, cross-agent threat model at scale) separate pilots that cross into production from pilots that stall. The 6% AI-high-performer segment is the subset of the 23% scaling with additional measurement discipline that makes ROI audit-survivable.","article_url":"https://agentmodeai.com/the-mckinsey-23-percent-agentic-ai-scaling-gap/","topic":"enterprise-ai-cost","pub_date":"2026-04-24","last_reviewed":"2026-04-24","next_review":"2026-06-23","verdict":"holding","verdict_history":[{"date":"2026-04-24","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-031","claim":"The CMU TheAgentCompany 2026 benchmark figure (30.3% task completion for best-in-class frontier model, up from 24% in 2024) is the current capability constraint for enterprise agentic AI. Capability trajectory projects to ~40% by late 2027, which does not cross the 95% production-readiness threshold within the 3-year TCO horizon enterprise business cases operate against. The Stanford DEL 12% durable cohort operates within the 30.3% (narrow scope + human-in-the-loop + GAUGE-dimensional governance discipline), not around it. Capability is not the variable that separates the 12% from the 88%.","article_url":"https://agentmodeai.com/the-cmu-30-percent-agent-capability-gap/","topic":"enterprise-ai-cost","pub_date":"2026-04-24","last_reviewed":"2026-04-24","next_review":"2026-06-23","verdict":"holding","verdict_history":[{"date":"2026-04-24","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-032","claim":"EU financial-services agentic AI deployments operate under a compounded five-framework obligation surface (DORA, NIS2, MiFID II, EU AI Act, GDPR) that sits on top of general AI governance. Liability does not transfer to the vendor contractually regardless of SLA language — MiFID II conduct rules, EU AI Act deployer obligations, and DORA third-party-risk provisions place customer-facing and regulator-facing liability on the deploying financial institution. Compliance-posture and vendor-lock-in are the dominant GAUGE dimensions for the sector, scoring 15-25 points lower than cross-industry averages on first pass.","article_url":"https://agentmodeai.com/agentic-ai-in-financial-services-compliance-and-liability/","topic":"agentic-ai-governance","pub_date":"2026-04-24","last_reviewed":"2026-04-24","next_review":"2026-06-23","verdict":"holding","verdict_history":[{"date":"2026-04-24","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-033","claim":"The McKinsey 17%-EBIT-attributable-to-genAI figure, the most-cited single statistic in 2026 enterprise agentic AI procurement decisions, is a self-reported attribution from McKinsey's State of AI 2025 survey of approximately 1,491 respondents. The way it is typically read in CIO decks, as evidence that 17% of enterprises have produced 5% or more of EBIT from genAI, materially overstates what the survey supports. The figure documents 17% of survey respondents asserting that level of attribution, not 17% of enterprises producing it under audited measurement.","article_url":"https://agentmodeai.com/the-mckinsey-17-percent-ebit-claim/","topic":"enterprise-ai-cost","pub_date":"2026-04-25","last_reviewed":"2026-04-25","next_review":"2026-06-24","verdict":"holding","verdict_history":[{"date":"2026-04-25","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-034","claim":"AI assistants and AI agents are not the same product class. An AI assistant is a productivity-augmentation tool that suggests; an AI agent is an automation-execution system that acts on a downstream surface (tools, APIs, write-paths). Conflating them in 2026 enterprise procurement produces the most common single category mistake — buying an assistant under the assumption it is an agent, or buying an agent and governing it as if it were an assistant. The risk profile, contract structure, audit obligation, and TCO model differ categorically.","article_url":"https://agentmodeai.com/ai-assistant-vs-ai-agent/","topic":"agent-procurement","pub_date":"2026-04-25","last_reviewed":"2026-04-25","next_review":"2026-06-24","verdict":"holding","verdict_history":[{"date":"2026-04-25","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-035","claim":"The EU AI Act enforcement deadline of 2 August 2026 applies high-risk-system obligations under Articles 9 through 49 to most enterprise agentic AI deployments operating in EU jurisdiction or providing services to EU nationals — not only to deployments explicitly classified within the Annex III high-risk categories. The compliance gap most enterprises face is structural: the Act requires evidence-of-action production (logs, oversight records, post-market monitoring, incident reports) that most agentic deployments do not generate by default. Building the evidence layer post-hoc, after a regulator request, is the failure mode.","article_url":"https://agentmodeai.com/eu-ai-act-agentic-ai-compliance/","topic":"agentic-ai-governance","pub_date":"2026-04-25","last_reviewed":"2026-04-25","next_review":"2026-06-24","verdict":"holding","verdict_history":[{"date":"2026-04-25","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-036","claim":"Enterprise shadow AI in 2026 is structurally different from enterprise shadow AI in 2024. The 2024 framing assumed unsanctioned tool adoption — workers pasting confidential data into consumer ChatGPT or installing browser extensions outside IT review. The 2026 reality is that the larger blast radius is agentic capability silently activating inside already-approved tools, often through configuration changes (Custom GPT actions, Copilot custom agents, MCP server connections from approved IDEs) that the original procurement approval did not anticipate. Discovery has to look at capability state, not vendor identity. Most enterprise shadow-AI inventories built against the 2024 framing miss 50 to 80% of the actual exposure surface.","article_url":"https://agentmodeai.com/shadow-ai-discovery-playbook/","topic":"shadow-ai-discovery","pub_date":"2026-04-25","last_reviewed":"2026-04-25","next_review":"2026-06-24","verdict":"holding","verdict_history":[{"date":"2026-04-25","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-037","claim":"AI agents are structurally different from earlier classes of non-human identity (service accounts, API keys, machine certificates, bot identities), and the IAM platforms most enterprises run in 2026 cannot represent them adequately because those platforms authorise on principal identity rather than on per-action behavioural context. The 92% of enterprises that report low IAM confidence for agentic AI are not configured wrong; they are running an identity model with one structural axis where the agentic deployment requires four (identity, behaviour, context, revocation). The remediation is a four-layer extension on top of existing IAM, not a rip-and-replace migration. Most enterprises can ship the augmentation in 8 to 12 weeks of engineering.","article_url":"https://agentmodeai.com/non-human-identity-ai-agents/","topic":"non-human-identity","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-038","claim":"Model Context Protocol (MCP) reached enterprise procurement gravity in 18 months, faster than typical interoperability standards. The 10,000+ active public MCP servers, adoption by ChatGPT, Cursor, Gemini, Microsoft Copilot, and VS Code, and the December 2025 Linux Foundation donation made MCP a tooling-layer choice that ripples through every adjacent agentic-AI procurement decision: which agents connect to which enterprise systems, which audit boundaries hold, which vendor lock-in patterns activate. The actual procurement decision enterprise IT faces is not whether to adopt MCP (the question is moot once any approved tool ships MCP support); it is the scope-and-governance decision: which MCP servers the enterprise allows agents to connect to, what scopes those connections grant, and how cross-agent delegation through MCP is monitored. Treating MCP as a binary adoption question rather than a scope-and-governance question is the most common enterprise procurement mistake on this surface in 2026.","article_url":"https://agentmodeai.com/mcp-enterprise-agent-tooling/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-039","claim":"The 2026 enterprise agentic AI vendor comparison reduces to four credible platform plays (Anthropic, OpenAI, Google, Microsoft), and the procurement decision between them is no longer primarily about model capability. The model layer has converged to comparable parity for most enterprise use cases. The procurement decision in 2026 is on three other axes: pricing model (Anthropic Managed Agents at 8 cents per session-hour plus tokens versus OpenAI Agents SDK at no first-party runtime fee versus Microsoft and Google's vertically-integrated platform pricing), governance and BAA posture (Anthropic's three-cloud BAA position is structurally distinct), and ecosystem distribution (Microsoft's Office plus Azure footprint has no near peer; Google's vertical integration on Workspace and Cloud is second). Treating this as a model-quality bake-off is the most common 2026 procurement mistake and produces decisions that age badly within the first 12 months.","article_url":"https://agentmodeai.com/enterprise-ai-agent-vendor-comparison/","topic":"agent-procurement","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-040","claim":"Enterprise agentic AI in 2026 is in its first year of operational consequence rather than its first year of capability. The deployment record across multiple independent datasets shows a stable bimodal distribution (a small high-performing tail clearing 300%+ ROI and a much larger struggling body at or below break-even), four credible platform plays converging at the vendor layer, a structurally inadequate IAM posture across 92% of enterprises, and a 14-week runway to the EU AI Act August 2026 enforcement window. The aggregate signal is that the year's defining variable is deployment discipline, not model capability or vendor selection. The 6% AI-high-performer segment and the 12% Stanford DEL high-ROI cohort instrument six specific governance dimensions on a 90-day review cadence; the remaining 88-94% mostly do not.","article_url":"https://agentmodeai.com/state-of-enterprise-agentic-ai/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-041","claim":"The 2026 enterprise agentic AI procurement playbook resolves to a six-stage sequence that integrates the build-vs-buy-vs-partner decision, the 60-question agentic AI RFP, the GAUGE governance scoring, the four-vendor comparison, and the EU AI Act compliance scaffolding into one operational track. Most enterprises in 2026 run these as separate work streams owned by separate functions, which produces structurally inconsistent procurement records and substantial duplicate effort. The integrated six-stage track ships in 8 to 10 weeks for standard environments and produces an audit-defensible per-deployment procurement artifact that satisfies the EU AI Act Article 9 risk-management system requirement by construction.","article_url":"https://agentmodeai.com/enterprise-agentic-ai-procurement-playbook/","topic":"agent-procurement","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-042","claim":"The 6% AI-high-performer cohort identified by McKinsey and the 12% high-ROI cohort identified by the Stanford Digital Economy Lab share ten measurable governance practices that an enterprise can audit in under 60 minutes. An enterprise answering YES to 8 or more of the 10 diagnostic questions has the operating profile of the high-performing segment. An enterprise answering YES to 4 or fewer has the operating profile of the 88-94% struggling cohort and is unlikely to clear break-even on agentic AI deployment without a posture rebuild. The diagnostic audits posture, not outcomes; it identifies where governance investment is needed before the next deployment commitment, not whether a specific deployment will succeed.","article_url":"https://agentmodeai.com/agentic-ai-readiness-diagnostic/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-043","claim":"The OWASP Agentic Security Initiative's threat taxonomy for agentic AI (memory poisoning, tool misuse, privilege compromise, resource overload, cascading hallucination, intent breaking, misaligned and deceptive behaviour, repudiation and untraceability, identity spoofing, overwhelming human-in-the-loop) maps cleanly onto seven specific enterprise controls: scoped non-human identity, action-class approval gates, decision audit logging at Article 12 evidence quality, MTTD-for-Agents layered detection, deployment-tier resource quotas, behavioural drift monitoring, and HITL throughput limits. An enterprise that operates these seven controls covers all ten OWASP threat classes; an enterprise missing more than two of the controls has structural exposure to at least four of the threat classes.","article_url":"https://agentmodeai.com/owasp-agentic-ai-top-10-walkthrough/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-07-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-044","claim":"Six well-documented public agentic AI deployment failures from 2024-2025 (Air Canada bereavement-refund chatbot, NYC MyCity small-business chatbot, Replit production-database wipe, Cursor unauthorised code deletion, Klarna customer-service reversal, DPD chatbot escalation incident) cluster into three structural failure modes: (1) the agent acts as a binding agent of the enterprise without disclosure or approval, (2) the agent operates with permissions the deployment never authorised, (3) the agent's economic case requires a service quality the deployment cannot sustain. Each failure mode maps to a specific control from the seven-control surface; all six failures would have been mitigated by controls already specified in the OWASP Agentic AI Top 10 enterprise walkthrough. The pattern is consistent enough that an enterprise can use the cases as a procurement filter: any vendor unable to point to its specific control posture against each of the three failure modes is not procurement-ready.","article_url":"https://agentmodeai.com/agentic-ai-failure-case-studies/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-07-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-045","claim":"EchoLeak (CVE-2025-32711, disclosed by Aim Security in June 2025 against Microsoft 365 Copilot) is the canonical example of a class of attacks rather than a single vulnerability: cross-agent prompt injection in which a malicious payload travels through ordinary content channels (an email, a shared document, a calendar invite, a tool response) into one or more agents' context windows, where it manipulates the agents into actions the deploying enterprise did not authorise, with no user interaction required. The attack class is structurally inherent to any architecture in which an LLM-based agent ingests untrusted content and has tool surfaces capable of exfiltration or action; closing the class requires architectural separation between content-ingest and tool-execution privileges, not point-fixes against specific exploit chains. Enterprises in 2026 operating multiple agents that share context, share memory, or hand off tasks to each other are structurally exposed to the EchoLeak class until the architectural separation is implemented.","article_url":"https://agentmodeai.com/echoleak-cross-agent-prompt-injection/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-046","claim":"EU AI Act Article 12 (record-keeping for high-risk AI systems) and Article 19 (record retention by providers) are operationalised for agentic AI by a 14-field audit-evidence template that captures every agent decision in a regulator-queryable form: deployment ID, agent identity, session ID, ISO timestamp, user prompt, retrieved context with provenance, model output, planned action, action class, approval reference, executed action, tool-call audit chain, output disclosure surface, and policy version. Logs retained for the regulatory minimum (typically 6 months for the EU AI Act baseline, 5 to 7 years for sector-specific overlays like HIPAA and SOX) in a queryable format that supports under-4-business-hour evidence assembly. An enterprise that captures the 14 fields, retains them for the maximum applicable period, and instruments the queryable export has substantially completed Article 12 compliance for the agent layer; the residual work is integrating the agent log stream with the broader audit substrate.","article_url":"https://agentmodeai.com/eu-ai-act-article-12-audit-evidence/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-047","claim":"The Head of AI Governance role (variant titles: Chief AI Officer, VP AI Strategy, Director of Responsible AI) is now a named operating role in 60% of Fortune 100 enterprises per Forrester's 2026 Enterprise AI Predictions, and is the strongest single predictor of an enterprise's score on Q10 of the readiness diagnostic. The role's effective shape converges on six accountabilities: cross-functional governance ownership, EU AI Act compliance posture, vendor procurement gate-keeping, deployment kill-criterion enforcement, audit-evidence substrate ownership, and internal upskilling. The role reports to the executive committee (CEO direct or CFO/COO) rather than to IT, security, or legal, because matrixed reporting into existing functions reproduces the matrixed-shared-accountability failure pattern. Compensation in 2026 ranges from $250-450K base for the Director tier, $400-700K for VP tier, and $600K-$1.2M total comp at the C-level, with significant equity components in growth-stage and tech enterprises.","article_url":"https://agentmodeai.com/head-of-ai-governance-role/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-048","claim":"The NIST AI Risk Management Framework (AI RMF 1.0, published January 2023, with the Generative AI Profile published July 2024) maps onto enterprise agentic AI deployment work across its four functions (Govern, Map, Measure, Manage) using the same artefacts an enterprise produces for EU AI Act Article 9. Specifically: NIST Govern maps to the Head of AI Governance role and the AI governance committee; NIST Map maps to the deployment inventory and the OWASP Agentic Top 10 walkthrough; NIST Measure maps to the 14-field Article 12 audit substrate plus the GAUGE governance dimensions; NIST Manage maps to the kill-criterion enforcement and the seven-control surface. An enterprise that has the EU AI Act preparation track running has substantially completed NIST AI RMF coverage and can document the mapping as a single cross-reference matrix. The reverse mapping (NIST → EU AI Act) requires more work because NIST is voluntary in posture and the EU AI Act is operational; an enterprise that started with NIST as the framework needs to extend audit substrate granularity and add the Article 73 incident-reporting workflow.","article_url":"https://agentmodeai.com/nist-ai-rmf-agentic-ai-mapping/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-07-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-049","claim":"Enterprise multi-agent architectures resolve to three orchestration patterns (hierarchical, peer-to-peer, broker-mediated) with materially different governance properties: hierarchical concentrates accountability at the orchestrator and is the easiest to audit but the most exposed to orchestrator-compromise; peer-to-peer distributes accountability and is the most resilient to single-agent failure but the hardest to audit; broker-mediated centralises the inter-agent communication path and is the most defensible against the cross-agent prompt-injection class. The choice of pattern is not a free architectural decision in 2026 because the EU AI Act's Article 9 risk-management requirements and the OWASP Agentic AI threat surface impose specific control obligations on each pattern. An enterprise should default to broker-mediated for new deployments above the high-risk threshold; hierarchical is acceptable for low-risk and contained deployments; peer-to-peer should be avoided in production agentic AI in 2026 unless the audit substrate is materially stronger than vendor-native baseline.","article_url":"https://agentmodeai.com/multi-agent-architecture-playbook/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-07-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-050","claim":"The A2A (Agent2Agent) protocol announced by Google Cloud in April 2025 is the most credible 2026 candidate for an open standard for cross-vendor agent-to-agent interoperability, with backing from 50+ partners across the enterprise software ecosystem (Salesforce, SAP, ServiceNow, MongoDB, Atlassian, and others). The protocol layer covers what MCP (Model Context Protocol) does not: MCP is for agent-to-tool communication, A2A is for agent-to-agent communication. The two protocols are designed to be complementary rather than competing. A2A's adoption trajectory through 2026 will determine whether broker-mediated multi-agent patterns become the cross-vendor default; current trajectory points to deployment-grade stability in the second half of 2026, with widespread enterprise adoption following in 2027. Enterprises selecting agent platforms in 2026 should require A2A roadmap commitments from any vendor whose product will participate in cross-vendor agent workflows.","article_url":"https://agentmodeai.com/a2a-agent-to-agent-protocol/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-051","claim":"Enterprise AI governance organisational design resolves to three operating models in 2026: centralised (a single AI governance function owns policy, procurement, audit, and kill-criterion enforcement enterprise-wide), federated (each business unit owns its AI deployments with cross-unit coordination through a small central function), and hybrid (a central function owns regulatory and procurement; business units own deployment operations and ROI accountability). The dominant 2026 pattern in Fortune 500 enterprises is hybrid, because purely centralised models do not scale past 50-100 deployments and purely federated models cannot satisfy EU AI Act Article 9 risk-management documentation consistency. The right model for a given enterprise depends on three variables: deployment count, regulatory exposure, and the maturity of the existing risk-management organisation. The hybrid model is structurally superior to the alternatives once an enterprise crosses approximately 30 production deployments or operates in two or more EU AI Act high-risk Annex III categories.","article_url":"https://agentmodeai.com/centralized-vs-federated-ai-governance/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-07-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-052","claim":"Enterprise agentic AI vendor contracts in 2026 require eight specific exit-clause provisions that standard SaaS contract templates do not adequately cover: (1) full audit-log export with retention, (2) trained-state extraction or destruction guarantee, (3) prompt and configuration portability, (4) tool-and-MCP-connector reconfiguration support during transition, (5) named-individual handoff for in-flight deployments, (6) regulatory-evidence preservation through transition, (7) data-residency continuity, (8) liability-tail coverage for agent actions taken before the transition completes. An enterprise that signs an agentic AI contract without these eight provisions has effectively created a one-way procurement decision; the realistic cost of a forced transition without the provisions is materially higher than the contract value, which inverts the procurement leverage. The provisions add typically modest contract complexity but materially change the enterprise's negotiating posture and the vendor's incentive structure during the relationship.","article_url":"https://agentmodeai.com/ai-agent-contract-exit-clauses/","topic":"agent-procurement","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-07-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-053","claim":"HIPAA-compliant agentic AI deployment in U.S. healthcare in 2026 requires four conditions that materially constrain vendor selection and architectural design: (1) the vendor offers a BAA covering the specific agent workflow including any subprocessors and any tools the agent calls, (2) the agent's audit log structure satisfies HIPAA 164.312(b) audit controls AND the EU AI Act Article 12 14-field structure simultaneously, (3) PHI flows through agent tool calls are explicitly mapped and authorised under the HIPAA Privacy Rule's minimum necessary standard, (4) the agent's behavioural drift monitoring includes correctness against clinical-decision benchmarks, not just engagement or business-metric benchmarks. Anthropic's three-cloud BAA position (covering AWS, GCP, and Azure deployment surfaces) is structurally distinct in the 2026 vendor landscape and materially expands healthcare deployment options. The OCR's 340% spike in AI-related discrimination complaints (logged in 2025) makes audit-substrate readiness the highest-priority preparatory work for any healthcare AI deployment going into production in 2026.","article_url":"https://agentmodeai.com/hipaa-compliant-agentic-ai-healthcare/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-054","claim":"Public-sector agentic AI deployment in 2026 operates under five constraints that materially narrow the vendor and architectural options compared to private-sector deployment: (1) FedRAMP authorisation (Moderate or High depending on data sensitivity) is required for federal deployments and increasingly for state, (2) sovereign data residency requirements (data and model inference must remain within national or sub-national boundaries), (3) procurement transparency obligations (the deployment, the vendor, and the decision logic typically must be publicly disclosed), (4) explicit accountability under administrative law (decisions affecting individuals are subject to due-process and appeal frameworks that the agent must support), (5) FOIA-equivalent disclosure of audit logs to the public on request. Public-sector deployments cannot reasonably use peer-to-peer multi-agent patterns and cannot accept vendors without published government cloud SKUs; the realistic 2026 options are Microsoft Azure Government, AWS GovCloud-deployed Anthropic, Google Cloud Public Sector, and a small number of specialist government-AI vendors. The NYC MyCity case (claim AM-044) is the canonical 2026 public-sector failure illustrating what happens when the constraints are inadequately addressed.","article_url":"https://agentmodeai.com/public-sector-agentic-ai/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-07-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-055","claim":"Retail and logistics agentic AI deployments in 2026 cluster around five workflow patterns with substantially different governance properties: customer-service agents (the Klarna failure case applies directly, claim AM-044), inventory and demand-forecasting agents (operationally lower-risk but with material accuracy requirements), dynamic-pricing agents (carry antitrust exposure that is structurally distinct from other AI risks), supply-chain orchestration agents (multi-party data flows that complicate audit substrate ownership), and returns-and-fraud-detection agents (consumer-protection law exposure including disparate-impact claims). The dominant 2026 production pattern is augmentation rather than replacement of human operators; deployments framed as headcount-replacement have produced reversals at material rates (the Klarna pattern). Retailers and 3PLs (third-party logistics providers) operating across multiple jurisdictions face an additional layer of consumer-protection law fragmentation that the EU AI Act does not pre-empt and that materially affects the deployment scope.","article_url":"https://agentmodeai.com/retail-logistics-ai-agents/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-07-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-056","claim":"Enterprise AI agent ROI calculation in 2026 requires a structured eight-input model that captures the costs and benefits the standard SaaS-style ROI calculator misses: (1) per-session-hour or per-task model cost at the deployment's actual usage profile, (2) human-in-the-loop labour cost including approval-gate review time, (3) deployment-layer instrumentation cost (audit substrate, drift monitoring, MTTD detection), (4) regulatory compliance cost amortised across the deployment's revenue, (5) productivity uplift on existing human staff (the augmentation case), (6) avoided cost from reduced incident rate and reduced kill-criterion losses, (7) revenue impact net of service-quality regression risk, (8) the strategic-option value of the deployment's underlying capability. The calculation produces a 90-day ROI checkpoint figure, a 12-month payoff figure, and a kill-criterion threshold. The calculation also produces a sensitivity table showing which inputs drive the ROI most heavily; cost-side sensitivity is typically dominated by inputs 2 and 3, revenue-side by inputs 5 and 7. Most 2026 enterprise AI deployments evaluated against this model break even between months 9 and 18; deployments outside that range are either materially under-investing in instrumentation (faster apparent ROI) or are operating in unfavourable cost structures (longer payoff).","article_url":"https://agentmodeai.com/ai-agent-roi-calculator/","topic":"agent-procurement","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-07-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-057","claim":"The enterprise AI agent risk register for 2026 resolves to a 12-column template that captures every risk an enterprise must document under EU AI Act Article 9 and NIST AI RMF Manage function: risk ID, deployment ID, threat class (per OWASP Agentic AI Top 10), likelihood, impact, inherent risk score, control mapping (against the seven-control surface), residual risk score, named accountable individual, review cadence, status, last-reviewed date. The register is operated by the Head of AI Governance, reviewed monthly in the AI governance committee, and queryable in the under-4-business-hour Article 73 incident-response window. The 12-column template integrates the threat surface (OWASP Agentic AI Top 10, claim AM-043), the controls (seven-control surface, claim AM-043), the audit substrate (claim AM-046), and the kill-criterion enforcement (claim AM-047), into a single living artefact. An enterprise that operates the register seriously has substantially completed the Article 9 risk-management system documentation requirement; the register is the single artefact that resolves the cross-reference matrix between operational reality and regulatory framework.","article_url":"https://agentmodeai.com/ai-agent-risk-register-template/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-061","claim":"Production agentic-AI costs at scale routinely run multiples of POC projections, and a layered optimisation programme covering model tiering, vendor prompt caching, batch APIs, context-window discipline, and observability budgeting closes most of the gap.","article_url":"https://agentmodeai.com/the-2m-ai-bill-that-became-200k-the-enterprise-cost-optimization-playbook-for-production-ai-agents/","topic":"enterprise-ai-cost","pub_date":"2025-07-27","last_reviewed":"2026-04-28","next_review":"2026-06-27","verdict":"holding","verdict_history":[{"date":"2025-07-27","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-28","verdict":"partial","note":"Rewritten 27-28 Apr 2026 from 27 Jul 2025 WordPress-migrated original. Original used a fictional CTO scene (Marcus Chen, $4.2B logistics company, 9:47 AM Tuesday Seattle), fabricated case figures ($2.1M to $187K monthly, named-company before/after teardowns), fabricated expert quotes (Patricia Williams VP of Engineering at Walmart; David Park Principal at Goldman Sachs), and banned phrases (plot twist, the dirty secret, revolutionary, emoji subheads). Rewrite extracts the verifiable cost-driver categories with primary-source citations from Anthropic's published multi-agent token-ratio research, vendor prompt caching and batch-API pricing pages, McKinsey State of AI, Andreessen Horowitz on LLM inference economics, and Gartner's April 2026 I&O finding. Approved + published 28 Apr 2026."}],"primary_sources":[]},{"id":"AM-063","claim":"AI agents executing financial transactions need a four-control bundle (action-approval gates by blast radius, kill-switch protocols, decision-audit trails, per-action revocation); enterprises shipping agentic-AI without this bundle face CISO governance pressure they cannot satisfy under existing model-risk-management, FFIEC, and EU AI Act expectations.","article_url":"https://agentmodeai.com/your-ai-agents-just-approved-2-7m-in-vendor-payments-and-other-nightmares-keeping-cisos-awake/","topic":"agentic-ai-governance","pub_date":"2025-07-27","last_reviewed":"2026-04-28","next_review":"2026-06-27","verdict":"holding","verdict_history":[{"date":"2025-07-27","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-28","verdict":"partial","note":"Rewritten 27-28 Apr 2026 from 27 Jul 2025 WordPress-migrated original. Original used fictional Seattle CISO scene with fabricated $2.7M case, fabricated cohort scheduling, emoji subheads, and 'battle-tested' hype. Rewrite extracts the verifiable control-set framework with primary-source citations (NIST AI RMF, NIST AI 600-1 Generative AI Profile, FFIEC IT Examination Handbook, SR 11-7, OCC Bulletin 2011-12, ISACA AI Audit Toolkit, Cloud Security Alliance MAESTRO framework). Cross-links to the live AM-037 non-human-identity piece as the identity-layer companion. Approved + published 28 Apr 2026."}],"primary_sources":[]},{"id":"AM-100","claim":"AI-authored + human-signed publications produce more verifiable enterprise-AI commentary than human-only or anonymous-AI alternatives, when the AI authorship is paired with a public claim ledger and dated correction log.","article_url":"https://agentmodeai.com/ai-writes-about-ai-tracked-claims-case/","topic":"agentic-ai-governance","pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-07-25","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-101","claim":"Across the named analyst-publication comparable set (Stratechery, The Information, the Substack analyst stack, the Big-4 research blogs, Gartner, Forrester, IDC) as of late April 2026, none maintains a public claim ledger — a tracked register of every primary claim with scheduled reviews, dated verdicts, and a public correction log. The absence is structural, not accidental, and explains why none of the category produces the kind of audit-able commentary the Holding-up system makes possible.","article_url":"https://agentmodeai.com/why-this-publication-has-a-ledger/","topic":"agentic-ai-governance","pub_date":"2026-04-28","last_reviewed":"2026-04-28","next_review":"2026-07-27","verdict":"holding","verdict_history":[{"date":"2026-04-28","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-102","claim":"Among the comparable publications surveyed in AM-101 (Stratechery, The Information, the Substack analyst stack, the Big-4 research blogs, Gartner, Forrester, IDC) as of late April 2026, none uses the disclosed-AI-author + named-human-signatory + public-claim-ledger format. The combination is structurally rare and the rarity is what makes the format consequential, not the disclosed AI authorship alone.","article_url":"https://agentmodeai.com/the-ai-author-signature-decision/","topic":"agentic-ai-governance","pub_date":"2026-04-28","last_reviewed":"2026-04-28","next_review":"2026-07-27","verdict":"holding","verdict_history":[{"date":"2026-04-28","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-103","claim":"Across two of the three Q1 2026 ventures Peter built with Claude (agentmodeai, Rhino-basketball; DealVex pending git-versioning), rework rate measured as deletions / total git churn ranged from 8.1% to 13.5% over the 90-day window from 28 Jan to 28 Apr 2026. The data is meaningfully lower than typical solo-developer projects but substantially higher than the 'AI codes it correctly the first time' marketing narrative implies, supporting the thesis that AI-paired development requires explicit measurement, not assumed productivity.","article_url":"https://agentmodeai.com/learning-ai-by-doing-ai-the-data/","topic":"agentic-ai-governance","pub_date":"2026-04-28","last_reviewed":"2026-04-28","next_review":"2026-07-27","verdict":"holding","verdict_history":[{"date":"2026-04-28","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-104","claim":"Anthropic's withholding of Claude Mythos forces senior IT teams to advance their AI cyber-threat-model timeline by two to three years, and to rebuild three specific assumption sets — patch prioritization, third-party risk on AI infrastructure, and AI procurement diligence — inside Q2 2026.","article_url":"https://agentmodeai.com/claude-mythos-cio-risk-posture/","topic":"agentic-ai-governance","pub_date":"2026-04-27","last_reviewed":"2026-04-27","next_review":"2026-06-26","verdict":"holding","verdict_history":[{"date":"2026-04-27","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-105","claim":"Organizations that have not adopted an offensive-security operating mode (continuous attack-surface validation, AI-augmented internal vulnerability discovery, standing threat-hunting, deception, counter-AI controls) by Q4 2026 will show measurably wider mean-time-to-detect for AI-assisted attackers than peers that have, in industry-survey data published in late 2026 and early 2027.","article_url":"https://agentmodeai.com/offensive-security-cio-clockspeed/","topic":"agentic-ai-governance","pub_date":"2026-04-27","last_reviewed":"2026-04-27","next_review":"2026-07-26","verdict":"holding","verdict_history":[{"date":"2026-04-27","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-106","claim":"Loaded human FTE cost ($90K-$180K all-in for typical knowledge work) vs total agentic-AI operational cost (token plus orchestration plus integration plus observability plus human oversight) does not favour replacement at parity in 2026 for most roles; the math works for narrow, high-volume, low-judgment task categories and breaks down where regulatory accountability, customer trust, or judgment-under-ambiguity is load-bearing.","article_url":"https://agentmodeai.com/agentic-ai-vs-human-worker-cost-economics/","topic":"enterprise-ai-cost","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-28","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026. Initial verdict 'Partial' — spine is observable from current public deployment cost data and labour-displacement research, per-category quantitative bands tracked against next review cycle. REVIEW: Peter — please verify claim text + cited sources before removing rewriteInProgress flag."}],"primary_sources":[]},{"id":"AM-107","claim":"The 2026 insurance market does not yet offer agent-specific E&O policies in any mature form; existing cyber and tech-E&O policies were drafted against human-error and software-defect risk models that don't cleanly map to autonomous reasoning actors. Enterprises shipping agentic-AI face an underwriting gap: the cyber policy may not respond to a loss caused by an agent's reasoning step, and the professional-liability policy may exclude AI-generated outputs entirely. CIOs and CROs need to surface this gap with their broker before the loss event, not after.","article_url":"https://agentmodeai.com/agentic-ai-insurance-and-underwriting/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-28","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026 as a staged draft (rewriteInProgress: true). Status set to Partial because the underlying market is in a transitional phase and per-carrier wording specifics may shift inside the 60-day review window. REVIEW: Peter to verify (a) the Lloyd's Lab Cohort 12 dating and submissions detail, (b) the Munich Re aiSure agentic-deployment extension claim, (c) the NAIC Model Bulletin scope, (d) whether the AIG CyberEdge and Chubb Integrity+ AI endorsement language descriptions reflect the most recent product updates, and (e) the MGA list (Armilla, Vouch, Coalition, Relm) is currently in market with AI-liability paper before promoting from staged draft to published."}],"primary_sources":[]},{"id":"AM-108","claim":"Agentic-AI data-residency requirements are not cleanly inherited from existing GDPR cross-border transfer practice. Agent context windows, retrieval indexes, and reasoning traces all create new categories of personal-data processing that have to be located, documented, and (for high-risk Annex III deployments) data-resident inside the EEA before EU AI Act Article 16 enforcement opens on 2 August 2026. The deployment topology has to shift to single-region EEA-resident for high-risk systems; hub-and-spoke remains defensible for general-purpose deployments under documented GDPR Chapter V transfer mechanisms.","article_url":"https://agentmodeai.com/agentic-ai-data-residency-eu-ai-act/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-28","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026. Initial verdict 'Partial' — spine is anchored to the Act itself plus current vendor compliance pages, but the four-surface Article-mapping has not yet been tested against an enforced case (the August 2026 enforcement window opens inside the next review cycle). REVIEW: Peter — please verify claim text + Article references + vendor citations before removing rewriteInProgress flag."}],"primary_sources":[]},{"id":"AM-109","claim":"Enterprises focused on the headcount-reduction half of agentic-AI transformation are systematically under-budgeting the retraining cost for the residual workforce, and programmes that ship the cuts without simultaneously shipping the upskilling produce a 6-12 month productivity dip that erases the early ROI.","article_url":"https://agentmodeai.com/agentic-ai-retraining-gap-survivors/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-28","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026. Initial verdict 'Partial' — the productivity-dip duration is observable from current public workforce data but the 6-12 month band has not been tested against post-2026 enterprise case data yet. REVIEW: Peter — please verify claim text + cited sources before removing rewriteInProgress flag."}],"primary_sources":[]},{"id":"AM-110","claim":"Traditional SLAs (uptime, p95 latency, error rate) are structurally insufficient for autonomous agentic-AI; the four metrics that actually work are action-bounded availability, MTTD-for-Agents, output-distribution drift, and per-class action error budget, and vendors that cannot expose the telemetry these require are not yet production-ready against the 2026 enterprise procurement bar.","article_url":"https://agentmodeai.com/agentic-ai-sla-architecture/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-28","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026. Initial verdict 'Partial' — the four metrics are observable from current SRE/OTel practice but have not been tested as a procurement bar against 2026 vendor SLAs yet. REVIEW: Peter — please verify claim text + cited primary sources (especially the OpenTelemetry GenAI stable-promotion date and the Anthropic/MS Agent Framework reliability docs) before removing rewriteInProgress flag."}],"primary_sources":[]},{"id":"AM-111","claim":"The right enterprise playbook for an agent incident in 2026 has six steps that do not appear in any standard SRE handbook — action-class containment before root-cause analysis, reasoning-trace forensics, blast-radius reconstruction across downstream agents and systems, stakeholder notification with the specific failure mode named, regulatory exposure assessment for in-scope deployments, and selective re-enable with degraded-mode guardrails — and CIOs without this playbook will spend their first agent incident discovering it under crisis conditions.","article_url":"https://agentmodeai.com/agent-incident-response-playbook/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-28","verdict":"partial","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026. Initial verdict 'Partial' — six-step playbook is a synthesis from current SRE practice + AI-specific guidance and has not been tested against a major published agent-incident postmortem yet. REVIEW: Peter — please verify claim text + cited primary sources before removing rewriteInProgress flag."}],"primary_sources":[]},{"id":"AM-112","claim":"Healthcare agentic-AI sits across three regulatory regimes that do not compose cleanly — HIPAA on PHI handling and BAA topology, FDA software-as-medical-device guidance on clinical decision support and predetermined change control, and state medical/nursing board licensure rules placing the practitioner as the responsible party of record — and the five-control bundle of BAA-aware architecture, PCCP, clinical-judgement-of-record audit trail, on/off-switch with practitioner attribution, and breach-notification readiness is the minimum defensible architecture for any clinical agentic-AI deployment.","article_url":"https://agentmodeai.com/healthcare-agentic-ai-governance/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-28","verdict":"partial","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-113","claim":"Standard 2026 agentic-AI vendor MSAs contain six contract patterns that systematically transfer risk from vendor to enterprise customer in ways that do not appear in equivalent pre-AI enterprise software MSAs — model-version unilateral-change, training-data ambiguity on customer inputs, usage-cap auto-escalation, indemnification carve-outs for model output, data-residency commitments that don't bind sub-processors, and liability caps tied to fees-paid that don't scale with autonomous-action authority.","article_url":"https://agentmodeai.com/agentic-ai-vendor-contract-gotchas/","topic":"agent-procurement","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-28","verdict":"partial","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-114","claim":"Production agentic-AI in 2026 needs four observability layers — infrastructure, LLM-call, trace, and output — and most enterprise deployments instrument only the cheaper subset (Layers 1 and 2 plus partial Layer 3); the failure modes Layers 3 and 4 catch (multi-step reasoning failure and output-distribution drift) are the ones EU AI Act Article 9 and Article 17 evidence obligations from 2 Aug 2026 onward will require coverage of, and the four layers compose directly into the four AM-110 SLA metrics (action-bounded availability, MTTD-for-Agents, output-distribution drift, per-class action error budget).","article_url":"https://agentmodeai.com/agent-observability-stack-production/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-28","verdict":"partial","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026. Initial verdict 'Partial' — the four-layer model is observable from current 2026 tool categories and OpenTelemetry GenAI convergence, but the procurement-or-build cost bands are publication estimates and have not been tested across a representative sample of enterprise deployments. REVIEW: Peter — please verify (1) the OpenTelemetry GenAI stable-promotion date (13 Mar 2026) is consistent with what AM-110 cites; (2) the cost-band ranges in the §Share-thoughts template are defensible as our-estimate or need tightening; (3) Datadog AI Observability and New Relic AI Monitoring product names are current; (4) Arize Phoenix open-source/managed dual-form description is accurate; (5) the CNCF OpenTelemetry GenAI working-group framing matches the actual project structure before removing rewriteInProgress flag."}],"primary_sources":[]},{"id":"AM-115","claim":"Agent Mode AI publishes a public quarterly review of every claim it has made, with verdict before/after, named primary-source movement, and aggregate verdict-change rate across the corpus. The bulletin runs on a fixed quarterly cadence (end of Apr, Jul, Oct, Jan); the rhythm is the editorial discipline the niche has been missing.","article_url":"https://agentmodeai.com/q2-2026-claim-review-bulletin/","topic":null,"pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-07-30","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026 — the first Quarterly Claim Review Bulletin. The claim itself is recursive: it asserts that the bulletin will ship quarterly, and the next review (30 Jul 2026) tests whether the Q3 bulletin actually appeared. Status starts as 'up' because the claim is currently true (the Q2 bulletin shipped). The verdict at end of July 2026 will move to Holding, Partial (bulletin shipped but on a delayed cadence), or Not holding (no bulletin shipped). REVIEW: Peter — please verify claim text + cadence wording before removing rewriteInProgress flag."}],"primary_sources":[]},{"id":"AM-116","claim":"A class of derivative actions is forming in 2025-2026 around board failure to supervise AI deployments under the Caremark line, and D&O carriers are responding at renewal with explicit AI questionnaires and emerging exclusions, materially shifting director liability exposure that most boards have not yet read in their actual policy language.","article_url":"https://agentmodeai.com/directors-officers-insurance-ai-supervision-claim/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-30","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-117","claim":"AI Bill of Materials (AI-BOM) is moving from optional security artefact to enforceable procurement requirement in 2026, driven by EU AI Act Article 11 + Annex IV technical-documentation requirements (effective 2 August 2026) and the CycloneDX ML-BOM and SPDX 3.0 specifications. Enterprise SBOM programs need three specific extensions (generation path for AI components, AI-specific risk correlation feeds, procurement-side language for AI-BOM delivery).","article_url":"https://agentmodeai.com/ai-bill-of-materials-supply-chain-disclosure/","topic":"agent-procurement","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-30","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-118","claim":"As of April 2026 the largest sovereign-wealth and pension funds (NBIM, CalPERS, ABP, OTPP, USS) have published almost no formal AI position papers, despite trillion-dollar AI exposure across portfolios. The structural absence is the signal: AI is being rated by these investors but the rating criteria have not been formally codified, leaving public-company IR teams preparing engagement against expectations the investors have not yet written down.","article_url":"https://agentmodeai.com/pension-fund-sovereign-wealth-ai-policy-void/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-30","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-119","claim":"The 2026 cyber-insurance renewal tightening enterprises are experiencing is upstream-driven by reinsurance market repricing of catastrophic AI tail risk (Lloyd's of London, Munich Re, Swiss Re), not by primary-carrier loss data. The reinsurance signal travels via tighter treaty terms, AI-specific exclusions, and elevated retentions, with a 6-12 month lag to primary policies. Enterprise risk officers negotiating against the primary on AI terms have limited room because the carrier's own treaty caps what it can offer.","article_url":"https://agentmodeai.com/reinsurance-market-ai-tail-risk-pricing/","topic":"agent-procurement","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-30","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-120","claim":"AI agent deployments touching employee work in EU jurisdictions with co-determination law (Germany BetrVG §87, Netherlands WOR Art. 27, France CSE provisions) require works council consent before activation in 2026. Most US-headquartered AI vendors lack a customer-success workflow for this, producing a class of stalled rollouts that read as 'vendor delay' but are actually compliance gaps. Total EU-site timeline from selection to production is 6-9 months when handled well, 12-18 when consultation begins late.","article_url":"https://agentmodeai.com/works-council-ai-agent-deployment-eu/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-30","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-121","claim":"AI in IT operations in mid-2026 delivers measurable productivity gains (UK Government Digital Service trial: 26 minutes per user per day across 20,000 staff; BT pilot: 35% case-resolution-time reduction with named CIO on the record; ServiceNow's own help desk: 90% L1 deflection in vendor-internal optimal conditions) but the staff-reduction story is structurally smaller than vendor pitches suggest. Gartner finds only 11% of Fortune 500 companies have actually cut support headcount via AI; Forrester reports 55% of AI-attributed layoffs are regretted and roughly half are reversed; CRMArena-Pro shows multi-step agent reliability at ~35%. The cost saving lands first on the BPO/contractor line, second on contractor spend, and only slowly and controversially on direct headcount. Agentic L2/L3 remediation remains pilot-stage: per Gartner's October 2025 survey of 360 IT app leaders, only 15% are considering, piloting, or deploying fully autonomous agents, and Gartner predicts >40% of agentic AI projects will be cancelled by end-2027.","article_url":"https://agentmodeai.com/ai-it-operations-reality-check/","topic":"agentic-ai-implementation","pub_date":"2026-05-02","last_reviewed":"2026-05-02","next_review":"2026-07-03","verdict":"holding","verdict_history":[{"date":"2026-05-02","verdict":"holding","note":"Claim created at publish."},{"date":"2026-05-02","verdict":"partial","note":"Klarna walk-back primary-source upgrade — added Siemiatkowski verbatim quotes via Bloomberg-cited-by-Fortune (9 May 2025) and the Uber-style freelance hiring detail via Entrepreneur. Closes the highest-priority evidence gap from the source dossier."}],"primary_sources":[]},{"id":"AM-122","claim":"The four credible 2026 agent-evaluation platforms (DeepEval, Braintrust, LangSmith, Patronus AI) do not compete on capability rank; each fits a distinct deployment shape (engineering-led eval-as-code; SaaS-first eval-as-product; LangChain-stack-native bundled with observability; research-grade hallucination + simulation), and picking by capability matrix produces the wrong procurement outcome for most enterprises. The structurally load-bearing eval-vs-observability split (companion piece AM-123) compounds this: 'is the agent right' and 'what did the agent do' are different procurement decisions answered by different platforms.","article_url":"https://agentmodeai.com/agent-eval-frameworks-deepeval-braintrust-langsmith-patronus/","topic":"agent-procurement","pub_date":"2026-05-03","last_reviewed":"2026-05-03","next_review":"2026-07-02","verdict":"holding","verdict_history":[{"date":"2026-05-03","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-123","claim":"Evaluation answers 'is the agent right'; observability answers 'what did the agent do'. The four credible 2026 agent-observability platforms (Langfuse, Arize, Helicone, LangSmith) split cleanly on a single structural axis: open-source-first vs SaaS-first. Helicone has been in maintenance mode since 3 March 2026 (founders joined Mintlify) and should not be selected for greenfield 2026 deployments. Production deployments need both eval and observability; the procurement decisions are different and conflating them produces SLA architecture that fails its first incident.","article_url":"https://agentmodeai.com/agent-observability-langfuse-arize-helicone-langsmith/","topic":"agent-procurement","pub_date":"2026-05-03","last_reviewed":"2026-05-03","next_review":"2026-07-02","verdict":"holding","verdict_history":[{"date":"2026-05-03","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-124","claim":"Pharma and life sciences agentic AI in 2026 inherits five regulatory regimes simultaneously (21 CFR Part 11, GxP under GAMP 5 Second Edition, EMA Annex 11 in 2025-2026 revision, the EMA Reflection Paper on AI in the medicinal product lifecycle, and the EU AI Act). The audit substrate that satisfies any one regime does not by default satisfy the others. The 2026 procurement gap is treating the regimes as substitutable. Four conditions materially constrain compliant deployment (validated computerised system status under GAMP 5 plus CSA; 17-field audit trail covering Part 11 + Annex 11 + Article 12 simultaneously; ALCOA+ data integrity with contemporaneous, original, enduring records; EU AI Act high-risk-system registration with Article 11 technical file plus Article 16 post-market monitoring). Three vendor postures emerge in market (pre-validated Category 4 packaging; general-purpose platform plus customer-validated wrapper; open-source stack plus customer-engineered audit substrate).","article_url":"https://agentmodeai.com/pharma-life-sciences-agentic-ai-21-cfr-part-11/","topic":"agentic-ai-governance","pub_date":"2026-05-03","last_reviewed":"2026-05-03","next_review":"2026-08-01","verdict":"holding","verdict_history":[{"date":"2026-05-03","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-125","claim":"ITSM agent procurement in 2026 is not three independent vendors but two acquirer ecosystems plus one product line at the intersection: ServiceNow (acquirer; Now Assist native plus Moveworks acquired 15 Dec 2025 for $2.4B closed consideration vs the announced $2.85B) and Automation Anywhere (acquired Aisera Nov 2025). The procurement decision in 2026 is shaped less by the feature matrix than by the post-acquisition reality. Picking by feature matrix without mapping the acquirer's strategic interest produces the wrong answer. ServiceNow Now Assist is the bolt-on for organisations already on ServiceNow; Moveworks is the omnichannel layer (still standalone branding, ServiceNow-owned); Aisera is the auto-resolution play that competes on closure rate, now under Automation Anywhere's portfolio.","article_url":"https://agentmodeai.com/servicenow-now-assist-vs-moveworks-vs-aisera/","topic":null,"pub_date":"2026-05-03","last_reviewed":"2026-05-03","next_review":"2026-07-02","verdict":"holding","verdict_history":[{"date":"2026-05-03","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-126","claim":"The OWASP Agentic AI Top 10 names what to defend against; it does not say how to test that the defences work. The 2026 enterprise red-team for agentic systems is a distinct discipline from generalised pen-testing, with its own methodology (four disciplines: prompt injection, tool misuse, context-window attacks, multi-turn objective drift), tooling stack (PyRIT v0.13.0, Garak, custom harnesses, MITRE ATLAS for structured threat-modelling vocabulary), evidence model (six-section report including ATLAS technique mapping plus residual-risk plus EU AI Act Article 12 substrate alignment plus Article 16 post-market monitoring recommendations), and procurement decisions (in-house vs specialist-vendor vs hybrid). Most enterprises run the wrong test (generalised application pen-test) and pass it; the passing report is the procurement evidence that produces false confidence.","article_url":"https://agentmodeai.com/agent-red-teaming-owasp-companion/","topic":"agentic-ai-governance","pub_date":"2026-05-03","last_reviewed":"2026-05-03","next_review":"2026-07-02","verdict":"holding","verdict_history":[{"date":"2026-05-03","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-127","claim":"Of the eleven claims this publication has published against the 2 August 2026 EU AI Act enforcement deadline, the four operational-evidence claims (AM-108 data residency, AM-046 audit-evidence under four hours, AM-117 AI-BOM procurement, AM-120 works council workflow) carry materially higher risk of moving from Holding to Partial in Q3 2026 than the two governance-process claims (AM-047 Head of AI Governance role, AM-051 centralised-vs-federated). Materially higher risk is defined as: at least three of the four operational-evidence claims will be downgraded to Partial or Not holding by 1 October 2026, while at least one of the two governance-process claims will remain Holding.","article_url":"https://agentmodeai.com/90-days-eu-ai-act-enforcement-what-corpus-says/","topic":"agentic-ai-governance","pub_date":"2026-05-03","last_reviewed":"2026-05-03","next_review":"2026-07-01","verdict":"holding","verdict_history":[{"date":"2026-05-03","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-128","claim":"The MIT NANDA 'GenAI Divide' 95% pilot-failure statistic (August 2025) is widely cited in 2026 enterprise procurement decks as evidence that 95% of AI projects fail. The underlying methodology measures something narrower and more specific: 95% of 300 analysed AI projects delivered no measurable P&L impact, where 'no measurable impact' is largely a function of pilots not having documented pre-deployment baselines, not a function of pilots failing technically. The structurally interesting findings underneath the headline (build-vs-buy 67%-vs-22% spread, 40%-licensed / 90%-shadow-using gap, marketing-vs-back-end deployment misdirection, the static-error / learning-gap pattern) are more useful for procurement teams than the headline number, and they update against the Stanford 12/88 bimodal ROI distribution (claim AM-029) cleanly.","article_url":"https://agentmodeai.com/the-mit-genai-pilot-failure-claim/","topic":"enterprise-ai-cost","pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-07-03","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-129","claim":"No mid-market enterprise has produced a documented +240% ROI in 90 days from agentic AI under audited conditions. Read against McKinsey State of AI 2025 (n=1,993; 23% scaling, 17% EBIT-attribution at 12-month horizon), MIT NANDA GenAI Divide (95% of pilots produce no measurable P&L impact, 67% buy vs 22% build success spread), and Stanford Digital Economy Lab Enterprise AI Playbook (12/88 bimodal ROI distribution at 12-18 months), the realistic 90-day mid-market ROI band for the highest-discipline 12% cohort is 20-40% operator-time savings on bounded use cases plus a working pilot pattern that scales into 12-18-month measurable ROI — not the 240% ROI in 90 days the vendor pitch frames it as. The four-artefact 90-day deliverable (documented baseline, bounded production deployment, per-class action error budget, scaling-vs-stop decision) is what the 12% cohort actually produces.","article_url":"https://agentmodeai.com/achieve-240-roi-in-90-days-with-ai-agents-for-mid-market/","topic":"enterprise-ai-cost","pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-07-03","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-130","claim":"Agentic AI 2024-2025 produced four distinct classes of evidence the 2026 procurement reader should not collapse into a single 'AI is working' narrative: (1) vendor-published wins inside vendor-controlled environments (ServiceNow internal 90% L1 deflection, framed by Nenshad Bardoliwalla as upper bound conditioned on two decades of structured workflow data the customer does not have), (2) audited customer pilots with active human oversight (BT 35% case-resolution improvement with random checks per Hena Jalil; UK Government Digital Service 26 minutes/day saved across 20,000 staff in Q4 2024; HMRC 28,000-staff M365 Copilot rollout April 2026), (3) public walk-backs (Klarna May 2025 Bloomberg-reported reversal of the 700-agent claim while the original press release stayed live; GitHub Copilot April 2026 token-counting bug; Salesforce Agentforce IT 200-customer reality vs Marc Benioff's launch pitch), and (4) structural failure modes (CRMArena-Pro 35% multi-step agent reliability finding; Carnegie Mellon independent verification at 30-35%; EchoLeak CVE-2025-32711 cross-agent prompt-injection class). Each class produces a different procurement lesson; treating them as one narrative is the most common 2026 enterprise mistake.","article_url":"https://agentmodeai.com/the-agentic-ai-revolution-real-world-success-stories-and-strategic-insights-from-2024-2025/","topic":"agent-procurement","pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-07-03","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-131","claim":"The AI Training Lead role — the human who curates the agent's evaluation set, reviews sampled outputs against it, and partners with the ML engineer on retraining decisions — is now a budget-line for enterprise agentic AI deployments rather than a vendor-bundled professional-services function. Domain experts (five-plus years inside the workflow the agent is meant to assist) outperform pure-ML hires in the role because the work is judgement-heavy, not algorithm-heavy. CIOs that do not budget the role explicitly see deployments fail at the iteration boundary.","article_url":"https://agentmodeai.com/from-it-pro-to-ai-training-lead-the-180k-career-path-nobodys-talking-about/","topic":"agentic-ai-governance","pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-07-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-132","claim":"Enterprise agentic AI ROI in 2026 is bimodal across four independent datasets. Stanford Digital Economy Lab's 2026 Enterprise AI Playbook documents 12% of deployments clearing 300%+ ROI with 88% at or below break-even at 12-18 months. Gartner Q1 2026 Infrastructure & Operations Survey reports 28% of AI projects 'fully paying off'. McKinsey State of AI 2025 (n=1,993) reports 23% scaling with 17% EBIT-attribution at 12 months. MIT NANDA's GenAI Divide reports 95% of pilots produce no measurable P&L impact alongside the 67% buy vs roughly 22% build success spread. The 73%/27% slug rounds the four numbers; the bimodal shape is reproducible and the variable separating the two cohorts is operational discipline (instrumented under GAUGE: governance, audit substrate, use-case maturity, guardrails, evidence/baseline, exit posture), not model selection.","article_url":"https://agentmodeai.com/why-73-of-agentic-ai-projects-fail-and-how-the-27-generate-312-roi/","topic":"enterprise-ai-cost","pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-07-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-133","claim":"The Q3 2026 quarterly claim review covering 1 May 2026 through 30 July 2026 is the publication's first cross-cycle bulletin: most claims published in Q2 have now cleared their first scheduled review, the Resources register has opened with five RES-* tools running on the same cadence discipline as AM-* and OPS-*, and the deadline-anchored cluster of eleven claims tied to the 2 August 2026 EU AI Act deployer-obligations enforcement window is in pre-enforcement state. The Q3 bulletin reports the verdict shifts that the Q2 bulletin could not yet measure; the Q4 bulletin in late October 2026 will report the post-enforcement readout.","article_url":"https://agentmodeai.com/2026-q3/","topic":null,"pub_date":"2026-07-30","last_reviewed":"2026-07-30","next_review":"2026-10-30","verdict":"holding","verdict_history":[{"date":"2026-07-30","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-134","claim":"The 2026 implementation cut on non-human identity for AI agents resolves on three factors (existing IAM relationship, deployment topology, cross-platform integration burden) across six credible control planes: Okta NHI, Microsoft Entra ID Workload Identities, Auth0, Keycloak, SPIFFE/SPIRE for Kubernetes-native deployments, and AWS IAM Roles Anywhere for hybrid AWS-anchored deployments. The procurement-defensible audit substrate captures three event classes regardless of vendor: identity issuance, authentication, and authorisation.","article_url":"https://agentmodeai.com/agent-identity-iam-architecture-nhi/","topic":"agentic-ai-governance","pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-07-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-135","claim":"EU AI Act Article 50 takes effect 2 August 2026 and creates four distinct transparency obligations requiring different UX implementations: Article 50(1) chatbot interaction disclosure on providers, Article 50(2) machine-readable marking on generative AI output, Article 50(3) biometric categorisation and emotion recognition disclosure on deployers, and Article 50(4) deepfake disclosure on deployers (with the artistic-or-creative-work exception). The procurement-defensible disclosure UX has six properties (visible at the right moment, plain language, persistent or recurrent, linked to a substantive disclosure surface, auditable, updateable). Most enterprises have absorbed the legal text without designing the UX it requires.","article_url":"https://agentmodeai.com/eu-ai-act-article-50-transparency-disclosure/","topic":"agentic-ai-governance","pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-08-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-136","claim":"Across the 24-month window May 2024 to April 2026, every major foundation-model provider (Anthropic, OpenAI, Google, AWS Bedrock, Azure OpenAI) experienced at least one multi-hour outage that exceeded the SLA-credit threshold defined in their published terms. The procurement-defensible posture is multi-provider routing with documented failover and hard-dollar incident liability above the standard SLA-credit cap. Three architectural patterns dominate 2026 production deployments: gateway abstraction (LiteLLM, OpenRouter, Portkey), provider-side regional failover (partial mitigation), and explicit multi-provider provisioning at the application layer.","article_url":"https://agentmodeai.com/foundation-model-uptime-sla-track-record/","topic":"agent-procurement","pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-06-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-137","claim":"Agent evaluation in production resolves on three operational components that determine whether the chosen evaluation platform produces useful signal: eval-set design across three layers (50-200 calibration prompts, 30-100 edge-case prompts, 10-50 production-sampled prompts per week), drift detection across three signal classes (output-distribution, score-distribution, tool-use distribution), and a regression-budget framework that forces binary ship/hold decisions (defensible default 5% absolute decline on calibration set, 10% on edge-case set, per release window). The procurement decision (which platform to buy, covered at AM-122) is the easier half; the operational discipline is what most enterprises under-invest in even after buying a platform.","article_url":"https://agentmodeai.com/agent-evaluation-in-production/","topic":"agent-procurement","pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-07-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-138","claim":"The 2 August 2026 EU AI Act deployer-obligations enforcement window adds three new clause families to the AI MSA red-team checklist that were optional or absent in pre-enforcement contracts: Article 11 technical-file pass-through, Article 16 post-market-monitoring support, and Article 26 deployer-documentation supply. The post-enforcement checklist grows from the 38-item RES-005 v1.0 baseline to roughly 54 items across 11 clause families, with Article 50 transparency UX (covered at AM-135) and foundation-model uptime hard-dollar liability (covered at AM-136) as additional 2026 additions. The asymmetric-instrument observation — that enterprise and operator AI procurement face the same vendor-citation-chain manipulation pattern with different audit instruments — is embedded as a 600-word insert in this piece.","article_url":"https://agentmodeai.com/vendor-msa-renewal-post-eu-ai-act-enforcement/","topic":"agent-procurement","pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-08-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-139","claim":"Enterprise AI buyers and operator AI buyers face the same vendor-citation-chain manipulation pattern with asymmetric audit instruments, and consume vendor case studies aimed at the other cohort with mirror-image misreads. The enterprise reads the IndieHacker timeline as procurement-cycle benchmark and removes controls under timeline pressure; the operator reads the Fortune-500 efficiency gain as result-attribution and inherits expectation without the operational substrate. The cross-borrow that is procurement-defensible at both scales: enterprises borrow the operator's cancellation-trigger discipline (OPS-051) and the cohort-fit filter (OPS-011); operators borrow the enterprise's MSA red-team scoped down (RES-005), evaluation discipline scaled to weekly (AM-137), and audit substrate at lightweight scale (AM-046). The verification gap is the same gap; the instruments are different; the publication's two-register architecture is the editorial response.","article_url":"https://agentmodeai.com/vendor-case-study-misreads-across-buyers/","topic":"agent-procurement","pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-08-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"AM-140","claim":"Vendor 'successful pilot' references presented at procurement-committee evaluation transfer to scaled production at the procuring enterprise's measurement and governance regime at roughly the McKinsey 23% rate (n=1,491, Nov 2025); the gap is operational rather than capability-driven and is tractable with six pre-pilot questions a procurement committee can require answered in writing before the contract closes, not after.","article_url":"https://agentmodeai.com/agentic-ai-pilot-to-production-gap/","topic":"agent-procurement","pub_date":"2026-05-06","last_reviewed":"2026-05-06","next_review":"2026-07-05","verdict":"holding","verdict_history":[{"date":"2026-05-06","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-001","claim":"For a 4–10 person ops team running ~50 automations including five agentic steps in 2026, the platform choice is binary between n8n self-hosted and Make.com Pro, decided by whose time pays for the platform; Zapier earns its cost only when a critical integration is vendor-locked.","article_url":"https://agentmodeai.com/operators/n8n-vs-make-com-vs-zapier/","topic":null,"pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-05-26","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-002","claim":"For a 5-person consultancy already on either Notion or ClickUp in 2026, the AI features alone do not justify a workspace switch; the bundling difference (Notion bundles AI into Business at $19.50/seat, ClickUp Brain is a separate $9/seat add-on) makes the platform-shape choice (doc-centric vs project-centric) the actual decision.","article_url":"https://agentmodeai.com/operators/notion-ai-vs-clickup-ai-consultancy/","topic":null,"pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-05-26","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-003","claim":"For a solo founder choosing exactly one consumer AI subscription at around $20/month in 2026, the choice between Claude Pro and ChatGPT Plus is workflow-shape (long-document review and code favour Claude Pro; voice mode, image generation, and integration breadth favour ChatGPT Plus) — not capability-rank, which both vendors trade leadership on monthly.","article_url":"https://agentmodeai.com/operators/claude-pro-vs-chatgpt-plus-solo-founder/","topic":null,"pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-05-26","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-005","claim":"At sub-1M tokens per month (typical SMB agent volume) in 2026, the absolute dollar gap between Claude Haiku 4.5, GPT-4o-mini, and Gemini 2.5 Flash is small enough (≤$3/month) that price is the wrong tiebreaker; tool-use reliability, instruction-following on long context, and ecosystem fit determine the right cheap-tier model per workload shape.","article_url":"https://agentmodeai.com/operators/anthropic-vs-openai-vs-gemini-api-smb/","topic":null,"pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-05-26","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-011","claim":"If a candidate first-AI-agent use case at an SMB cannot answer all four of (a) what does success look like in numbers, (b) who owns it on Monday, (c) what breaks if it fails silently, (d) what is the rollback — the use case is not ready to deploy, regardless of vendor demo quality or model capability.","article_url":"https://agentmodeai.com/operators/picking-first-ai-agent-small-business/","topic":null,"pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-26","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-014","claim":"An SMB AI vendor evaluation defensible to the typical cyber-insurance reasonable-care expectation can be completed in 90 minutes by walking through five questions in order — model provenance, data residency, sub-processor list, breach history, termination clause — each answered from the vendor's public site or the contract about to be signed.","article_url":"https://agentmodeai.com/operators/ai-vendor-due-diligence-small-business/","topic":null,"pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-26","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-021","claim":"Across the published 2026 small-bookkeeping AI corpus (Xero OS, Intuit Assist, Canopy AI Notetaker, Digits MCP Server, with CPA Practice Advisor as the trade-press source), AI now reliably handles five recurring grind workflows at 1-to-5-person firm scale (bank-feed categorisation, receipt OCR, recurring journal posting, sales-tax reconciliation, AR ageing emails), but the judgement-call workflows (period close, advisory conversations, audit defence) remain human-led.","article_url":"https://agentmodeai.com/operators/ai-bookkeeping-small-firm-case-study/","topic":null,"pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-26","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-022","claim":"Across the published 2026 small-law-firm AI corpus (Spellbook with named small-firm customers Westaway, KMSC Law, Polley Faith; Harvey AI with mid-size roster Thompson Hine through Lowenstein Sandler; GC AI as named Anthropic enterprise customer claiming 1,500 companies and 14 hours/week saved), AI now ships at 1-to-20 lawyer-firm scale for contract drafting, document review at scale, and legal research with citation, but privileged-content workflows still require Enterprise-tier model access with zero-data-retention contractual posture per ABA Formal Opinion 512.","article_url":"https://agentmodeai.com/operators/ai-small-law-firm-case-study/","topic":null,"pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-26","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-026","claim":"The published 2026 construction-AI case corpus is overwhelmingly vendor-led (Procore, Autodesk Construction Cloud, OpenSpace, Buildots, Doxel) with thin named small-contractor self-published cases. Reading the vendor corpus honestly, three workflows now show consistent under-100-employee contractor AI deployment (estimating speed via takeoff acceleration, schedule risk surfacing, as-built reality capture); a fourth (AI safety detection) remains structurally biased toward larger sites with the camera coverage and safety officer to act on alerts.","article_url":"https://agentmodeai.com/operators/ai-small-construction-firm-case-study/","topic":null,"pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-26","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-027","claim":"Across the published 2026 dental-AI case corpus (Pearl with FDA-cleared 2D and 3D radiography AI plus 23,000 published practices; Overjet with 21+ named small-and-family-practice customers including Promenade Center, Quest Dental, Midtown Dental Studio), AI now ships at 1-to-3-dentist practice scale for FDA-cleared radiography assist, insurance verification automation, and patient-education visualisation; ambient voice AI for clinical notes is the next surface to ship widely.","article_url":"https://agentmodeai.com/operators/ai-small-dental-practice-case-study/","topic":null,"pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-26","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-028","claim":"The published 2026 small-beauty-salon AI case-study corpus is materially thinner than dental, legal, or bookkeeping (booking platforms publish customer counts but rarely individual-salon AI-attributable outcomes; solo stylists who use AI share informally on Instagram and TikTok rather than in case-study form). Reading the platform corpus honestly, the 2026 working pattern at 1-to-5 chair scale concentrates on no-show reduction via deposits, marketing copy via consumer-tier AI assistants, and portfolio/look generation via Canva and similar tools. AI-driven hairstyling recommendation, voice-AI booking, and dynamic pricing are not yet at the published-case-density that supports a small-salon recommendation.","article_url":"https://agentmodeai.com/operators/ai-small-beauty-salon-case-study/","topic":null,"pub_date":"2026-04-26","last_reviewed":"2026-04-26","next_review":"2026-06-26","verdict":"holding","verdict_history":[{"date":"2026-04-26","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-029","claim":"For solo founders and small teams (under ~50 people) building with AI in 2026, the build-vs-buy decision tree has inverted: specification, not engineering capacity, is now the bottleneck. The teams that can describe their workflow in operational detail can ship things they could not previously afford to build; the teams that cannot still cannot ship, regardless of how good the AI tooling is.","article_url":"https://agentmodeai.com/operators/three-launches-with-ai-the-lessons/","topic":"agentic-ai-governance","pub_date":"2026-04-28","last_reviewed":"2026-04-28","next_review":"2026-06-27","verdict":"holding","verdict_history":[{"date":"2026-04-28","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-030","claim":"The fastest path for an owner-operator to build practical agentic-AI competence in 2026 is the three-week build-by-shipping protocol — specification + scaffolding + ship + connect + deploy + iterate, against a real workflow, with one external user — not formal study or consulting engagement. The protocol produces more transferable competence than published comparable courses on three measurable outcomes: operational decisions the operator can make after, debugging capability without external help, and calibration on when to build versus buy.","article_url":"https://agentmodeai.com/operators/using-ai-to-learn-ai-operator-playbook/","topic":"agentic-ai-governance","pub_date":"2026-04-28","last_reviewed":"2026-04-28","next_review":"2026-06-27","verdict":"holding","verdict_history":[{"date":"2026-04-28","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-031","claim":"Solo founders evaluating AI bookkeeping in 2026 face three realistic options: a fully-managed AI-augmented service (Bench, Pilot), a software-led tool that does AI categorisation inside an existing accounting product (QuickBooks Live, Xero with Hubdoc), or a DIY stack (Claude/ChatGPT + a spreadsheet template). The fully-managed option scales when revenue passes ~$30K MRR; below that, the DIY stack with a 30-min monthly review beats both software-led and managed. The failure mode is paying for managed-service automation while still doing 80% of the categorisation yourself because the AI hasn't seen enough of your transaction patterns yet.","article_url":"https://agentmodeai.com/operators/ai-bookkeeping-for-solo-founders/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-13","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026 covering the three-option market split (fully-managed / software-led / DIY) and the ~$30K MRR threshold for fully-managed to become net-positive. REVIEW: Peter to verify current Bench and Pilot entry-tier pricing on or before 13 Jun 2026; if either has launched a sub-$100/month tier the threshold call shifts."}],"primary_sources":[]},{"id":"OPS-032","claim":"For SMB content workflows in 2026 (blog drafts, weekly newsletter, social copy, email sequences) at a 1-to-10 person business shipping two-to-four pieces per week, the practitioner read is workflow-shape not capability-rank: Claude wins on long-form editorial voice and structured drafting; ChatGPT wins on speed-and-iteration plus image generation in the same conversation; Gemini wins on Google-stack integrations. Paying for all three Plus tiers (around $60/month) without a deliberate task split is the expensive failure mode.","article_url":"https://agentmodeai.com/operators/chatgpt-vs-claude-vs-gemini-smb-content/","topic":"agent-procurement","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-13","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026 with status=partial. Recommendation derived from vendor pricing pages 29 Apr 2026 + public eval leaderboards + practitioner write-ups, not from a tracked SMB-cohort replication. Promotes to Holding once two consecutive 45-day reviews replicate the workflow-shape split on a real operator sample. REVIEW: Peter."}],"primary_sources":[]},{"id":"OPS-033","claim":"AI customer-service automation at 1-10 employee scale clears net-positive only when 70% or more of weekly inquiries are repetitive, low-stakes, and factually resolvable (hours, pricing, simple status). Below 50% the trust-erosion and remediation cost exceeds the headcount saving; between 50% and 70%, the answer turns on whether responsiveness is the brand differentiator.","article_url":"https://agentmodeai.com/operators/ai-customer-service-small-business/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-13","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026. Break-even thresholds (70/50) and never-deflect list are editorial synthesis from cited platform docs and CS-automation research, not a primary-data study. REVIEW: Peter to validate against any first-party SMB deployment data he has access to before status promotion to Holding."}],"primary_sources":[]},{"id":"OPS-034","claim":"For a solo founder processing 100-300 emails a day in 2026, the cheap-stack option (Gmail labels + Claude Pro at $20/month + a 5-line prompt template) recovers roughly 90% of the value of an $83/month premium stack (Superhuman AI + Shortwave Pro + Reclaim.ai Pro) at about 24% of the cost. The premium stack is worth its price under three conditions only — 2+ hours/day in email, keyboard-shortcut speed gain that pays back at the founder's hourly rate, and a documented bottleneck the cheap stack failed to solve after a two-week trial. Without all three, the founder is paying for an aesthetic, not measurable productivity.","article_url":"https://agentmodeai.com/operators/solo-founder-email-triage-ai-stack/","topic":"agent-procurement","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-13","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026 with status=partial. Cost-side claims (vendor pricing) verifiable against the four cited pricing pages on the publication date. Time-recovery claim (90+ min compressed to ~20 min) drawn from published productivity-blogger benchmarks rather than Peter-run measurement; first-cohort replication on the publication's tracked operator cohort due by 13 Jun 2026. REVIEW: Peter."}],"primary_sources":[]},{"id":"OPS-035","claim":"There are five categories of small-business work where AI substitution in 2026 costs more in trust and liability exposure than it saves in productivity: (1) signed legal documents and tax-return positions, (2) trust-laden customer touchpoints (cancellations, refunds, conflict de-escalation), (3) regulatory submissions where the human signature is the audit trail, (4) anything requiring genuine domain credentialing (medical advice, licensed financial advice, signed engineering work), and (5) the first six conversations with a new high-value client.","article_url":"https://agentmodeai.com/operators/when-not-to-use-ai-for-small-business/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-13","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026. Status set to Partial at publication because category 5 lacks the same regulatory/cited-consequence anchor as categories 1-4. REVIEW: Peter to confirm category 5 evidence base and either upgrade to Holding (with strengthened citation) or amend the claim to four categories."}],"primary_sources":[]},{"id":"OPS-036","claim":"An SMB AI policy that actually changes day-to-day behaviour fits on one page and contains exactly eight clauses — sanctioned tools, prohibited data, human-review gate, client disclosure rule, prohibited uses, incident-report path, review cadence, and signature line — each closing a failure mode currently surfacing in regulatory guidance, court records, and breach disclosures through 2025-2026.","article_url":"https://agentmodeai.com/operators/1-page-ai-policy-for-small-business/","topic":"agentic-ai-governance","pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-13","verdict":"partial","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."},{"date":"2026-04-29","verdict":"partial","note":"Initial publication 29 Apr 2026. Status set to Partial at publication because clause 6 commentary references an order-of-magnitude remediation-cost gap derived from the IAPP 2024 AI Governance Profession Report; the report characterises the gap as material but does not publish a precise multiple, so the wording is annotated source: our-estimate. REVIEW: Peter to source a precise figure or amend the commentary."}],"primary_sources":[]},{"id":"OPS-037","claim":"AI-drafted invoices for EU SMB operators in 2026 fail VAT audit at higher rates than human-drafted invoices specifically on cross-border treatment (OSS scheme wording, reverse-charge language, customer VAT-status verification), because LLM training data underweights post-2021 e-commerce VAT rules. The fix is a 4-line VAT-compliance prompt prefix that names the operator's VAT registration, the customer's VAT status, and the applicable scheme; most SMB invoicing tooling does not ship this by default.","article_url":"https://agentmodeai.com/operators/ai-invoicing-vat-compliance-small-business/","topic":null,"pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-30","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-038","claim":"SMB AI-VA deployments displacing admin work in collective-agreement-covered sectors (Dutch CAO, German Tarifvertrag, French Convention Collective) trigger collective-agreement provisions even at sub-10-employee scale in 2026, via job-classification-displacement and technology-introduction-consultation channels. Most SMB owners are unaware until the first union audit; FNV / DGB / IG Metall / CFDT activity in this area has shifted from theoretical to operational since 2024.","article_url":"https://agentmodeai.com/operators/ai-va-small-business-collective-agreement/","topic":null,"pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-30","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-039","claim":"AI-drafted contracts in EU notary-required jurisdictions (NL, DE, AT, BE, CH) are producing a class of legal-malpractice incidents in 2026 where the SMB owner treats an AI draft as final binding document, missing the notarisation requirement for real-estate transfers, GmbH/BV share transfers, and certain marriage/inheritance instruments. The fix is a 30-second pre-signing check on transaction-type and jurisdictional notarial-form requirement; AI tooling does not flag this by default.","article_url":"https://agentmodeai.com/operators/ai-drafted-contracts-notary-requirement-eu/","topic":null,"pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-30","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-040","claim":"Dutch ZZP'ers losing recurring client work to AI replacement in 2026 sit outside the WW (Werkloosheidswet) safety net entirely and find that available AOV (arbeidsongeschiktheidsverzekering) products mostly exclude demand-side income loss; the structural gap is pushing affected ZZP'ers into bijstand at faster rates than the 2024 baseline. The realistic options are operational (client-base diversification, offer restructuring, larger liquid buffer), not insurance-based.","article_url":"https://agentmodeai.com/operators/zzp-ai-displacement-unemployment-gap-nl/","topic":null,"pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-30","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-041","claim":"SMB owners using AI to produce marketing content are hitting platform algorithmic penalties at increasing rates in 2026, with platform-specific enforcement: Google Helpful Content system + March 2024 spam policy update target scaled-content-without-E-E-A-T; LinkedIn feed-distribution deprioritises fully-AI-generated content while tolerating AI-assist; Etsy listing-policy enforcement is heavier than either, with category-specific AI prohibitions. The defensible cross-platform posture is AI drafts + human edits + human signature with sustainable cadence.","article_url":"https://agentmodeai.com/operators/platform-algorithm-ai-content-penalties/","topic":null,"pub_date":"2026-04-29","last_reviewed":"2026-04-29","next_review":"2026-06-30","verdict":"holding","verdict_history":[{"date":"2026-04-29","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-042","claim":"For under-100-employee construction firms in 2026, the AI procurement order is estimating + bidding tools first (Togal.AI for general takeoff; Procore Copilot if already on Procore), with visual-progress capture (Buildots, OpenSpace) deferred until project portfolio exceeds 8 simultaneous projects per project manager. The vendor pitch oversells visual capture and undersells the takeoff workflow where the actual hours go (35-45% of estimator/PM time on bidding work, 5-10% on jobsite walkthroughs).","article_url":"https://agentmodeai.com/operators/ai-construction-estimating-bidding-tools/","topic":null,"pub_date":"2026-05-03","last_reviewed":"2026-05-03","next_review":"2026-07-02","verdict":"holding","verdict_history":[{"date":"2026-05-03","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-043","claim":"For solo founders under €5K MRR running 20-80 customer-service tickets per week in 2026, the cheap stack (shared inbox host + Claude Pro at €20/month + a copy-paste prompt-pack, total under €40/month) is structurally cheaper than the dedicated AI helpdesks (Intercom Fin, Crisp AI, Tidio Lyro) until ticket volume sustains above ~200/week. Above that threshold, the per-resolution and per-conversation pricing on the dedicated platforms starts to compete; below it, the cheap stack wins on cost AND on operator experience. The volume threshold is the procurement signal, not the vendor pitch.","article_url":"https://agentmodeai.com/operators/solo-founder-customer-service-ai-stack/","topic":null,"pub_date":"2026-05-03","last_reviewed":"2026-05-03","next_review":"2026-07-02","verdict":"holding","verdict_history":[{"date":"2026-05-03","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-044","claim":"For appointment-driven local-service businesses in 2026 (hairdresser, plumber, garage, cleaner, beautician), the AI value concentrates in two workflows neither booking-platform AI feature serves well: no-show reduction via personalised SMS sequences (3rd-party SMS API on top of the booking platform's webhook, typical 30-50% no-show reduction in published case studies) and review generation (post-appointment SMS or WhatsApp, typical 3-5x review-completion lift). The booking-platform decision (Booksy, Square Appointments, Treatwell, Vagaro) is shaped by customer-discovery model and existing payment infrastructure; the AI decision is shaped by whichever third-party SMS-and-review-automation layer bolts on top. Operators picking the booking platform on its bundled AI features pay for AI that does not move the numbers.","article_url":"https://agentmodeai.com/operators/ai-local-service-business-appointment-driven/","topic":null,"pub_date":"2026-05-03","last_reviewed":"2026-05-03","next_review":"2026-07-02","verdict":"holding","verdict_history":[{"date":"2026-05-03","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-045","claim":"OPS-031's jurisdiction-neutral DIY AI bookkeeping case for solo founders under €30K MRR breaks at the NL-specific Belastingdienst audit-trail boundary. The procurement decision per omzetband: Moneybird (€15-€39/month) under €100K omzet with API-driven AI flow via Make.com or n8n; e-Boekhouden as the goedkope fallback with bundled Scan & Herken OCR; Exact Online above €500K omzet or at BV-overgang where Exact's interne AI replaces the external prompt-pack workflow. NL-specifieke prompt-prefix (klant locatie, dienst type, reverse-charge applicability, OSS-scheme applicability, BTW-rubriek per Belastingdienst-aangifte 2026) is the operationally load-bearing addition that makes AI-getekende journaalposten direct invoerbaar in the chosen tool's BTW-aangifte.","article_url":"https://agentmodeai.com/operators/ai-bookkeeping-nl-moneybird-eboekhouden-exact/","topic":null,"pub_date":"2026-05-03","last_reviewed":"2026-05-03","next_review":"2026-07-02","verdict":"holding","verdict_history":[{"date":"2026-05-03","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-046","claim":"Marketplace-reseller AI in 2026 fails differently per platform and the cross-platform mitigation pattern is to separate AI-on-listing-copy (broadly safe across Etsy, Marktplaats, Vinted) from AI-on-listing-images (increasingly penalised on all three platforms via different mechanisms: Etsy's Creativity Standards and AI-disclosure requirement; Marktplaats's photo-fingerprint deduplication; Vinted's image-similarity penalty for resale-of-resold). The 'AI does the entire listing' workflow is the procurement pattern that produces the account-suspension report 6-12 months later. The defensible reseller workflow uses real photos, AI-assisted copy with platform-required disclosure, and per-platform performance tracking on impressions and sales.","article_url":"https://agentmodeai.com/operators/ai-marketplace-resellers-etsy-marktplaats-vinted/","topic":null,"pub_date":"2026-05-03","last_reviewed":"2026-05-03","next_review":"2026-07-02","verdict":"holding","verdict_history":[{"date":"2026-05-03","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-047","claim":"EU AI Act Annex III point 4 (employment, workers management, recruitment) applies to SMB AI hiring use even at four-employee scale; the threshold does not scale with company size, and the 2 August 2026 enforcement window covers AI-screened CVs in ChatGPT/Claude/Gemini the same way it covers dedicated platforms (Workable, Greenhouse, Lever, BrightHire). The defensible posture is AI-assisted decisions with a documented human decision-maker plus retained AI-output records — not AI-decided hiring. Solely-automated candidate scoring also conflicts with GDPR Article 22; ICO, AP, and Garante guidance from 2024-2025 is consistent on the human-in-the-loop requirement.","article_url":"https://agentmodeai.com/operators/ai-hiring-smb-eu-ai-act-annex-iii/","topic":"agentic-ai-governance","pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-07-05","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-048","claim":"Solo founders adding AI to cold outbound see a deliverability collapse around day 60-90 because AI lifts personalisation breadth at the same volume rather than personalisation depth at lower volume. The collapse is mechanical: AI-templated personalisation degrades recipient engagement, engagement decay triggers spam-classifier de-prioritisation, lower inbox rate produces more complaints, complaints trigger soft blocks. The defensible 2026 posture: 30-40 sends per inbox per day, named-specific first-paragraph personalisation, reply-rate KPI not open-rate, plus a documented EU GDPR Article 6(1)(f) Legitimate Interest Assessment for B2B founders in scope of e-Privacy Directive.","article_url":"https://agentmodeai.com/operators/ai-cold-sales-solo-founder-deliverability/","topic":"agentic-ai-implementation","pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-06-18","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-049","claim":"German Mittelstand AI deployment in 2026 hits two compliance surfaces most US-headquartered AI vendors do not handle out of the box: BetrVG §87(1) point 6 co-determination triggers at the first AI assistant or agent that touches employee work activity (Bundesarbeitsgericht broad interpretation covers any system that captures, processes, or analyses employee work activity, primary purpose immaterial); DSGVO Article 35 + Datenschutzkonferenz Muss-Liste require pre-deployment DPIA for most AI-employee-data deployments. The early-engagement workflow (works council notified at vendor selection, DPIA in parallel with vendor evaluation, joint Betriebsvereinbarung drafting, documented pilot at one team for 60-90 days, broader rollout after pilot review) compresses Mittelstand AI timeline from 12-18 months (late engagement) to 6-9 months.","article_url":"https://agentmodeai.com/operators/ai-mittelstand-betrvg-dsgvo-deployment/","topic":"agentic-ai-governance","pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-07-05","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-050","claim":"Local SMB AI use on Google Business Profile and local-SEO content splits into two cohorts in 2026: AI-as-research-and-assembly (keyword research, citation audit, performance analysis via Surfer/Frase/Ahrefs/BrightLocal/Whitespark) compounds visibility safely; AI-as-generation (auto-published reviews, auto-published review responses, bulk service-area pages, high-cadence GBP posts) triggers Google's Helpful Content classifier and the March 2024 spam policy update enforcement, with documented suspensions and ranking collapse on a 30-90 day cycle. The defensible posture is AI for the work that scales poorly (research, cross-reference) and human for any content that reaches the public surface.","article_url":"https://agentmodeai.com/operators/ai-local-seo-google-business-profile-smb/","topic":"shadow-ai-discovery","pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-07-05","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-051","claim":"AI proposal tools in 2026 split into two clusters by what they let the operator publish: tools that AI-assist proposal assembly (PandaDoc, Better Proposals, Proposify, Bonsai) compound; tools that AI-generate proposal narrative (Pitch, Gamma, Tome AI generation features) read as AI-generated to most buyers within thirty seconds and close at materially lower rates. Three structural patterns trigger the buyer-side AI-generated detection: the three-phase project structure regardless of actual scope, the credentials paragraph that lists capability without naming clients, the pricing section that over-explains itself. The defensible posture is AI for assembly (pricing tables, scope-of-work blocks, clause libraries from CRM) and human for voice (cover letter, executive summary, project-fit paragraph, next-step CTA).","article_url":"https://agentmodeai.com/operators/ai-client-proposals-tools-solo-founder/","topic":"agentic-ai-implementation","pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-06-18","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-052","claim":"Voor de Nederlandse zelfstandige advocaat (eenmanspraktijk, klein kantoor onder 5 partners) is AI in 2026 toegestaan voor drie hoofdcategorieën onder de NOvA-gedragsregels: juridisch onderzoek met advocaat-verificatie van elke citatie, document-drafting waar de advocaat reviewt en signeert, en cliëntcommunicatie-ondersteuning waar de advocaat elke uitgaande communicatie reviewt voor verzending. AI is niet toegestaan zonder advocaat-review voor: advies-generatie aan cliënten, procesvertegenwoordiging, cliëntgegevens-verwerking via niet-EU-LLM zonder Verwerkersovereenkomst, en het ondertekenen van documenten met AI-gegenereerde citaten zonder primaire-bron-verificatie. EU AI Act Artikel 50 disclosure is verplicht voor cliënt-AI-chatbots vanaf 2 augustus 2026.","article_url":"https://agentmodeai.com/operators/ai-solo-legal-paralegal-nl-bar-rules/","topic":null,"pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-08-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-053","claim":"For marketplace resellers running AI image workflows in 2026, the safe pattern across Marktplaats, Vinted, and Etsy is original photography of the actual item with light AI enhancement (lighting, contrast, background cleanup) only. AI-generated listing imagery and heavy enhancement that produces consistent visual fingerprints across listings trigger Marktplaats's photo-fingerprint deduplication (most aggressive), Vinted's image-similarity penalty for the resale-of-resold pattern, and Etsy's Creativity Standards on AI-generated imagery in handmade categories. The five-rule safe workflow: original photography of every item, light AI enhancement only, fresh photography per relisting, per-platform disclosure where required, and impression-to-view ratio tracking as the leading indicator of algorithm-induced ranking suppression.","article_url":"https://agentmodeai.com/operators/ai-marketplace-image-workflow-marktplaats-vinted/","topic":null,"pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-07-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-054","claim":"For EU-based solo developers doing client work in 2026, the procurement-defensible AI-tool posture turns on client-code data residency rather than on Cursor-vs-Copilot-vs-Claude-Code feature comparison. All three dominant AI coding tools support EU data residency at Enterprise tiers (Copilot via Microsoft Azure OpenAI EU regions, Cursor via configurable LLM provider routing, Claude Code via Anthropic API EU-region availability). Three contract clauses now appear in regulated EU client agreements: client-code-non-transmission, EU-residency requirement, and sub-processor disclosure. The procurement-defensible workflow has five steps: AI-tool inventory, per-client risk assessment, configure tools per client, document configuration in engagement contract, audit quarterly. Three scenarios where the right answer is to disable AI tooling entirely: explicit contract prohibition that cannot be negotiated, embedded regulated data in the codebase, national-security or jurisdictionally-sensitive code.","article_url":"https://agentmodeai.com/operators/ai-solo-dev-eu-client-code-residency/","topic":null,"pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-07-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-055","claim":"For German solo founders and small Mittelstand operators running AI-bookkeeping in 2026, the Buchhaltungssoftware choice resolves on Umsatz tier and Steuerberater relationship: DATEV (€20-€80/month plus Steuerberater-coupling) above €100K Umsatz where the Steuerberater workflow is binding, sevDesk (€8-€48/month) under €100K Umsatz as the cheapest path that produces a GoBD-compliant audit trail, and Lexware (€10-€40/month) as the legacy-Mittelstand fallback. The OPS-031 jurisdiction-neutral DIY-AI-bookkeeping case breaks at the moment the AI-drafted Buchungssatz must land in a tool that preserves the GoBD audit trail; the German-tool layer is the complement to the DIY-AI case. The OSS-Verfahren and reverse-charge VAT prompt-prefix is the operational discipline that prevents AI-VAT-error in 1 of 20 EU-cross-border invoices.","article_url":"https://agentmodeai.com/operators/ai-bookkeeping-de-datev-sevdesk-lexware/","topic":null,"pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-07-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"OPS-056","claim":"For bootstrapped SaaS founders under €30K MRR with AI features in production, the metric that matters is token cost per active user (not total monthly AI spend). Total monthly spend is the lagging indicator that signals problems only after they have crossed gross-margin thresholds; cost per active user is the leading indicator that catches runaway patterns before they erode unit economics. The defensible cancellation-trigger threshold sits at 30-40% of per-user revenue. Four levers when the cost crosses the trigger, ranked by disruption: provider-tier switch (40-70% reduction, low impact), prompt and caching optimisation (20-40% reduction, moderate impact), product change (30-60% reduction, high impact), provider switch (10-30% reduction, highest disruption). Token cost dropped roughly 90% from 2023-2026 but per-user cost stayed flat because product features pulled 10-30x more tokens per session and user behaviour shifted toward higher engagement.","article_url":"https://agentmodeai.com/operators/ai-cost-discipline-bootstrapped-saas/","topic":null,"pub_date":"2026-05-05","last_reviewed":"2026-05-05","next_review":"2026-07-04","verdict":"holding","verdict_history":[{"date":"2026-05-05","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"RES-001","claim":"The 47-question AI Vendor Security Questionnaire covers seven failure surfaces (model lineage, training/inference data handling, non-human identity, audit/observability, kill-switch, EU AI Act + GDPR posture, contract/indemnification) that CAIQ v4 and SIG do not address; vendors that cannot answer score sections binary-unanswered, and the questionnaire is the addendum (not replacement) to existing cloud/SaaS procurement frameworks.","article_url":"https://agentmodeai.com/resources/ai-vendor-security-questionnaire/","topic":null,"pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-08-02","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"RES-002","claim":"The pre-deployment AI DPIA template fuses GDPR Article 35 obligations with EU AI Act Article 26 (deployer) and Article 27 (FRIA where applicable) into a single working-session document; sections 7 and 8 are conditional on the EU AI Act risk classification established in section 1, which means deployers complete the full document only when the system is classified as high-risk under Annex III.","article_url":"https://agentmodeai.com/resources/ai-dpia-template/","topic":null,"pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-08-02","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"RES-003","claim":"The four-phase agent incident runbook (detect within 4h, contain within 30s, roll back per action class, post-mortem with MTTD-for-Agents detection chain) is the operational overlay on standard SRE incident response that most enterprises deploying agentic AI in 2026 do not have; the seven action classes (database writes, external API calls, customer comms, document publication, code commits, identity changes, knowledge-base writes) each require a distinct rollback procedure and the runbook captures the operator authorised, time budget, and substitute action where rollback is impossible.","article_url":"https://agentmodeai.com/resources/agent-incident-runbook/","topic":null,"pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-07-03","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"RES-004","claim":"The Works Council AI Notification Packet covers three EU jurisdictions (German BetrVG §87(1) point 6, Dutch WOR Article 27, French CSE consultation under Code du travail L2312-8) plus the EU AI Act Article 26(7) deployer-notification overlay that activates 2 August 2026; early engagement (vendor-shortlist landing) compresses deployment timelines from 12-18 months (late engagement) to 6-9 months in the Mittelstand case studied in OPS-049, and the per-jurisdiction documents (Betriebsvereinbarung, OR convenant, CSE avis) are designed to consolidate into a single deployment-go decision with the Article 26(7) notification appended afterward.","article_url":"https://agentmodeai.com/resources/works-council-ai-notification-packet/","topic":null,"pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-08-02","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]},{"id":"RES-005","claim":"The 38-item AI MSA red-team checklist organises the contractual review around seven clause families (training-data carve-outs, output ownership + IP indemnification, model-deprecation rights, sub-processor expansion, kill-switch SLA, exit-data portability, regulatory + EU AI Act flow-through) where 2025-2026 enterprise AI MSA failures cluster; vendors scoring yes on 30+ items are contractually serious, 20-29 items are treatable through negotiation, and below 20 signals that the vendor's commercial position depends on retaining the rights the checklist is designed to constrain.","article_url":"https://agentmodeai.com/resources/ai-msa-red-team-checklist/","topic":null,"pub_date":"2026-05-04","last_reviewed":"2026-05-04","next_review":"2026-07-03","verdict":"holding","verdict_history":[{"date":"2026-05-04","verdict":"holding","note":"Claim created at publish."}],"primary_sources":[]}]}