Published May 13, 2026 | Version 5
Preprint Open

ConsciOS: A Viable Systems Architecture for Human and AI Alignment

Description

ConsciOS proposes a formal systems architecture for studying human and AI alignment as a property of nested control structure rather than only post-hoc behavioral correction. The paper models consciousness and self-regulation as a testable control architecture composed of an Embodied Controller for short-horizon perception-action loops, a Supervisory Controller for policy/frame selection, and a Meta-Controller for long-horizon priors and governance constraints.

The v5 manuscript is prepared as a journal-submission revision for a Hypothesis and Theory article. It sharpens the AI-alignment framing, removes nonessential metaphor and branding language, and clarifies the distinction between formal proposal, illustrative instrumentation, and future empirical validation. The core mechanisms remain: a Resonance Engine that combines expected utility, coherence, and cost; an Interoceptive Control Signal (ICS) as a proposed fast feedback channel; and Time-Integrated Coherence (TIC) as a proposed resource for policy complexity and option-availability.

The manuscript situates ConsciOS within systems theory, the Viable System Model, active inference, affect science, hierarchical reinforcement learning, and human-in-the-loop AI alignment. It provides formal definitions, algorithmic sketches, falsifiable hypotheses, proposed human-subjects and simulation protocols, governance considerations, and a reproducible toy instrumentation demo. The toy code is included to show how selector variables can be logged and visualized; it is not presented as empirical validation of the architecture.

Keywords: consciousness architecture; AI alignment; viable systems model; active inference; hierarchical reinforcement learning; interoception; coherence-based control; human-AI hybrids; cybernetics; systems engineering

Notes (English)

Version 5:

Major journal-submission revision. Core architecture, mathematical structure, and proposed research program remain continuous with v4.

Reframed the manuscript for journal submission as a Hypothesis and Theory article; sharpened AI-alignment scope; added journal-style data availability, ethics, author contribution, funding, conflict of interest, and generative AI statements; removed nonessential metaphors from the operational glossary; clarified that the toy simulation is illustrative instrumentation rather than empirical validation; updated Figure 3 terminology and prior-update labeling; refreshed repository metadata, citation metadata, license/contact details, and build outputs. 

Version 4:

Core architecture unchanged. Replaced internal metaphors with standard control-theoretic vocabulary. Aligned variable names and definitions with active inference and systems engineering literature. The underlying mathematical logic, control topology, and architectural specifications remain identical to previous versions.

Version 3:

Core architecture unchanged. Expanded comparative analysis with standard Hierarchical Reinforcement Learning (HRL). Added discussion on "coherent but misaligned" failure modes and computational feasibility in high-dimensional spaces.

Version 2:

Core architecture unchanged. Refined formatting for academic neutrality.

Files

ConsciOS_v5_preprint.pdf

Files (4.3 MB)

Name Size Download all
md5:937640a1792138024eec7670afcd91d1
2.7 MB Preview Download
md5:700017b751d92e2e7c12e2147f8f6ed7
1.6 MB Preview Download

Additional details

Related works

Is cited by
Book: 10.5281/zenodo.17898213 (DOI)

Dates

Updated
2026-05-13
Version 5 manuscript revision

Software

Repository URL
https://github.com/Sistemist/consciOS-paper
Programming language
Python , Shell , TeX
Development Status
Active