Published December 2, 2025 | Version v1
Publication Open

Does Agentic AI Exist? A Cross-Vendor Investigation (Evans Law v6.0)

  • 1. PatternPulse AI

Description

The AI industry has rapidly adopted the term "agentic AI" to describe large language models that can autonomously execute multi-step tasks, maintain state across extended interactions, and operate with minimal human supervision. This paper presents evidence that autonomous agentic capability, as marketed, does not currently exist in any major commercial LLM system.

Through systematic cross-vendor testing of five leading AI platforms—OpenAI's GPT, Anthropic's Claude, xAI's Grok, DeepSeek, and Google's Gemini—we demonstrate that what is marketed as "agentic AI" is either (1) sophisticated tool-use within supervised conversational sessions, or (2) traditional software systems with LLM components, where the engineering infrastructure, not the AI, provides the agentic behavior. We argue that "agentic AI" represents a category error: conflating LLM capability with complete system capability.

We introduce three theoretical tools: (1) Evans' Law for Agentic AI (Cₐ = L × S(t,e) × U(v)), which predicts collapse at discrete operational thresholds—Launch (83% failure rate), Sustain (memory-free drift), and Upgrade (100% failure rate); (2) The Evans Ratio (E = Cp/Cd), which quantifies the balance between probabilistic intelligence and deterministic control; and (3) The Brock Threshold (E = 1), which defines the boundary between sophisticated automation and true agency.

Analysis of production deployments, including Booking.com's customer service agent, Klarna's support automation, and Salesforce's Agentforce, reveals that successful "agentic" systems achieve reliability through extensive code-based constraints that limit LLM operation to narrow, carefully bounded tasks. These systems consistently measure E < 0.3 on the Evans Ratio (E = Cp/Cd), indicating that deterministic scaffolding contributes 3-10x more to autonomous behavior than the probabilistic core - all operating far below the Brock Threshold (E = 1), the boundary where true agency would emerge.

We conclude with a call for evidence-based terminology and propose evaluating LLMs as components within engineered systems rather than as autonomous agents. The question is not whether LLMs will become truly autonomous agents in the future, but whether we can be precise about what they are today.

Files

Agentic2EvansLaw6.0FINAL.pdf

Files (7.5 MB)

Name Size Download all
md5:af366edefe0f18977a99b1a4ab0812bb
612.2 kB Preview Download
md5:e4a101e8cf0e38c576c6cc8c30151aaa
4.9 MB Preview Download
md5:705780cc5c7cb597b4e8ae1fa28d04fb
241.4 kB Preview Download
md5:76eb42b85a550e9deffc7da96b22587e
673.9 kB Preview Download
md5:5cfd287ca30ce76a9b603c0babe7e409
1.0 MB Preview Download

Additional details

Dates

Available
2025-12-02
This paper presents the first cross-vendor empirical analysis of "agentic AI" claims, demonstrating that no commercially available LLM exhibits autonomous operational capability. Using Evans' Law (Agentic Extension), the Evans Ratio, and the Brock Threshold, we show that all observed agentic behavior arises from deterministic scaffolding rather than model-driven autonomy. The study unifies conversational and operational collapse into a single framework and provides practical guidance for enterprises deploying LLM-integrated systems.