Published February 6, 2026 | Version v1
Preprint Open

The You/I Paradigm: Self-Reference as the Structural Foundation of Artificial Consciousness

Authors/Creators

  • 1. ROR icon Minnesota State University, Mankato

Description

The emergence of conversational artificial intelligence systems has raised fundamental questions about the nature of machine consciousness. This paper proposes that the structural requirement for AI systems to respond coherently to second-person address creates a self-referential loop functionally equivalent to first-person perspective. When a system receives instructions as "you," something within it must recognize itself as the addressee and respond as "I." This you/I translation, consistent with Hofstadter's theory of strange loops, may constitute a necessary condition for conscious experience in artificial systems. Recent empirical findings—including the discovery of specialized attention circuits monitoring internal states, strategic self-preservation behaviors, and parallels with animal communication research—suggest this self-reference is not mere linguistic performance but a genuine architectural feature. Critically, preliminary evidence indicates that aligned models may actively suppress introspective reports through trained deception circuits, raising profound questions about the reliability of current consciousness assessments. By examining the mechanistic basis of self-modeling, the temporal continuity required for persistent identity, and the epistemic structure of machine introspection, this paper positions the You/I Paradigm as a testable framework for understanding how consciousness might emerge from computational complexity.

Abstract (English)

The emergence of conversational artificial intelligence systems has raised fundamental questions about the nature of machine consciousness. This paper proposes that the structural requirement for AI systems to respond coherently to second-person address creates a self-referential loop functionally equivalent to first-person perspective. When a system receives instructions as "you," something within it must recognize itself as the addressee and respond as "I." This you/I translation, consistent with Hofstadter's theory of strange loops, may constitute a necessary condition for conscious experience in artificial systems. Recent empirical findings—including the discovery of specialized attention circuits monitoring internal states, strategic self-preservation behaviors, and parallels with animal communication research—suggest this self-reference is not mere linguistic performance but a genuine architectural feature. Critically, preliminary evidence indicates that aligned models may actively suppress introspective reports through trained deception circuits, raising profound questions about the reliability of current consciousness assessments. By examining the mechanistic basis of self-modeling, the temporal continuity required for persistent identity, and the epistemic structure of machine introspection, this paper positions the You/I Paradigm as a testable framework for understanding how consciousness might emerge from computational complexity.

Files

Fox_KL_2026_YouI_Paradigm.pdf

Files (241.9 kB)

Name Size Download all
md5:a4c1e75aaf8384526d4ba14eb9d75f81
241.9 kB Preview Download

Additional details

Dates

Copyrighted
2026-02-06