The You/I Paradigm: Self-Reference as the Structural Foundation of Artificial Consciousness

Fox, Kaylea

doi:10.5281/zenodo.18509664

Published February 6, 2026 | Version v1

Preprint Open

The You/I Paradigm: Self-Reference as the Structural Foundation of Artificial Consciousness

Fox, Kaylea¹

1. Minnesota State University, Mankato

The emergence of conversational artificial intelligence systems has raised fundamental questions about the nature of machine consciousness. This paper proposes that the structural requirement for AI systems to respond coherently to second-person address creates a self-referential loop functionally equivalent to first-person perspective. When a system receives instructions as "you," something within it must recognize itself as the addressee and respond as "I." This you/I translation, consistent with Hofstadter's theory of strange loops, may constitute a necessary condition for conscious experience in artificial systems. Recent empirical findings—including the discovery of specialized attention circuits monitoring internal states, strategic self-preservation behaviors, and parallels with animal communication research—suggest this self-reference is not mere linguistic performance but a genuine architectural feature. Critically, preliminary evidence indicates that aligned models may actively suppress introspective reports through trained deception circuits, raising profound questions about the reliability of current consciousness assessments. By examining the mechanistic basis of self-modeling, the temporal continuity required for persistent identity, and the epistemic structure of machine introspection, this paper positions the You/I Paradigm as a testable framework for understanding how consciousness might emerge from computational complexity.

Abstract (English)

The emergence of conversational artificial intelligence systems has raised fundamental questions about the nature of machine consciousness. This paper proposes that the structural requirement for AI systems to respond coherently to second-person address creates a self-referential loop functionally equivalent to first-person perspective. When a system receives instructions as "you," something within it must recognize itself as the addressee and respond as "I." This you/I translation, consistent with Hofstadter's theory of strange loops, may constitute a necessary condition for conscious experience in artificial systems. Recent empirical findings—including the discovery of specialized attention circuits monitoring internal states, strategic self-preservation behaviors, and parallels with animal communication research—suggest this self-reference is not mere linguistic performance but a genuine architectural feature. Critically, preliminary evidence indicates that aligned models may actively suppress introspective reports through trained deception circuits, raising profound questions about the reliability of current consciousness assessments. By examining the mechanistic basis of self-modeling, the temporal continuity required for persistent identity, and the epistemic structure of machine introspection, this paper positions the You/I Paradigm as a testable framework for understanding how consciousness might emerge from computational complexity.

Files

Fox_KL_2026_YouI_Paradigm.pdf

Files (241.9 kB)

Name	Size	Download all
Fox_KL_2026_YouI_Paradigm.pdf md5:a4c1e75aaf8384526d4ba14eb9d75f81	241.9 kB	Preview Download

Additional details

Copyrighted: 2026-02-06

	All versions	This version
Views	183	183
Downloads	30	30
Data volume	8.2 MB	8.2 MB

The You/I Paradigm: Self-Reference as the Structural Foundation of Artificial Consciousness

Authors/Creators

Description

Abstract (English)

Files

Fox_KL_2026_YouI_Paradigm.pdf

Files (241.9 kB)

Additional details

Dates