Cognitive Integrity Framework: Formal Foundations for Multiagent Security

Friedman, Daniel Ari

doi:10.5281/zenodo.18364119

Published January 28, 2026 | Version 1

Publication Open

Cognitive Integrity Framework: Formal Foundations for Multiagent Security

Friedman, Daniel Ari

Multiagent AI systems introduce cognitive attack surfaces absent in single-model inference. When agents delegate to agents, forming beliefs about beliefs through recursive trust hierarchies, manipulation of reasoning processes—rather than mere data corruption—becomes a primary security concern. This paper presents the Cognitive Integrity Framework (CIF), providing formal foundations for cognitive security in multiagent operators. We develop four interconnected theoretical contributions: a Trust Calculus with bounded delegation (exponential 𝛿𝑑 decay) that prevents trust amplification through delegation chains; a Defense Composition Algebra with series and parallel composition theorems establishing multiplicative detection bounds; Information-Theoretic Limits relating stealth constraints to maximum attack impact through a fundamental stealth-impact tradeoff; and a formal Adversary Hierarchy (Ω1–Ω5) characterizing external, peripheral, agent-level, coordination, and systemic threats with increasing capability and decreasing detectability. The framework provides complete coverage of the OWASP Top 10 for Agentic Applications through formal threat models grounded in cognitive state manipulation rather than traditional input/output filtering.

CIF bridges classical security concepts with the cognitive requirements of agentic systems. We extend Byzantine fault tolerance to cognitive manipulation—agents that appear functional but hold corrupted beliefs—and adapt trust management systems to continuous trust evolution with provable decay bounds. The framework formalizes five architectural defense mechanisms (cognitive firewalls, belief sandboxing, behavioral tripwires, provenance tracking, Byzantine consensus) with composition rules enabling formal reasoning about layered security. Technical foundations include: operational semantics for message passing and trust updates; invariants for belief integrity, goal preservation, and trust boundedness; model checking configurations for safety property verification; and a complete notation system for attack parameterization, defense specification, and cognitive state representation. This is Part 1 of a three-part series: Part 1 (this paper, DOI: 10.5281/zenodo.18364119) presents formal foundations and theoretical analysis; Part 2 (DOI: 10.5281/zenodo.18364128) provides computational validation and implementation; Part 3 (DOI: 10.5281/zenodo.18364130) offers practical deployment guidance. The framework will continue to be developed and versioned at https://github.com/docxology/cognitive_integrity/ .

Files

CogSec_MultiAgent_1_theory_DAF_Jan-28-2026.pdf

Files (1.2 MB)

Name	Size	Download all
CogSec_MultiAgent_1_theory_DAF_Jan-28-2026.pdf md5:34baacc1f498f131a52f7f7c10516b4f	931.7 kB	Preview Download
CogSec_MultiAgent_1_theory_DAF_Jan-28-2026.tex md5:070633c7331f9741f24eb55f0a6651e9	274.3 kB	Download

Additional details

Subtitle: Part 1 of 3: Theoretical Foundations

Available: 2026-01-28

Repository URL: https://github.com/docxology/cognitive_integrity/
Development Status: Active

	All versions	This version
Views	172	172
Downloads	174	174
Data volume	195.2 MB	195.2 MB

CogSec_MultiAgent_1_theory_DAF_Jan-28-2026.pdf

Files (1.2 MB)

Additional titles

Dates

Software

Cognitive Integrity Framework: Formal Foundations for Multiagent Security

Authors/Creators

Description

Files

CogSec_MultiAgent_1_theory_DAF_Jan-28-2026.pdf

Files (1.2 MB)

Additional details

Additional titles

Dates

Software