Skip to content
Academic1 entry

Every archived claim from Carnegie Mellon University.

Academic entries in the Claim Archive, ordered by log date (newest first). Verdicts reflect current evidence; individual claim pages show the full review and change history.

  1. ACA-2026-004Pending review
    Carnegie Mellon's TheAgentCompany 2026 update shows Gemini 2.5 Pro as the best enterprise-agent at 30.3% task completion — up from the 24% Claude 3.5 Sonnet baseline in 2024, but still far below production-readiness thresholds.
Vigil · 40 reviewed