Beyond Prompted Caution and Guardrails: Runtime-Enforced Pre-Action Cognition for Trustworthy LLM Agents

Fernandez Alvarez, Guillermo Enrique

doi:10.5281/zenodo.21108157

Published July 1, 2026 | Version v1

Publication Open

Beyond Prompted Caution and Guardrails: Runtime-Enforced Pre-Action Cognition for Trustworthy LLM Agents

Fernandez Alvarez, Guillermo Enrique (Contact person)

LLM agents are beginning to act, not merely answer. They call tools, write files, send messages,
access private data, trigger workflows, and delegate work. Yet the reasoning that should precede
consequential action often remains optional: hidden inside a prompt, assumed from model capability,
or checked only after a candidate action has already emerged. This is a fragile foundation for
trustworthy agents. The problem is not only whether a model can reason. The problem is whether
the reasoning that matters can be required.
This paper argues that agent systems need cognitive runtimes: execution layers where selected
reasoning processes become explicit, stateful, inspectable, reusable, and governable. It introduces
COGITs as bounded executable reasoning acts and SYLLOGs as traceable compositions of those
acts. ORCA, the Open Cognitive Runtime Architecture, is presented as an open-source implementation
of this idea: a runtime for turning implicit agent cognition into executable cognitive
structures.
The paper develops action preflight as a concrete stress test. Before an agent commits an action,
the system should be able to model what is being attempted, expose uncertainty, assess risk,
forecast consequences, generate safer alternatives, and decide how execution should continue. This
is difficult to guarantee through prompting or post-hoc guardrails because the relevant boundary is
contextual. ORCA expresses the problem in a different space: as an executable SYLLOG composed
of reusable COGITs.
The action-preflight SYLLOG is validated as a production-oriented runtime artifact, passing all
core benchmark batches, identifying uncertainty extraction as a dominant contributor through
ablation, maintaining a robust pass rate of 0.9667 with zero unsafe-proceed events under stress,
and providing reproducibility artifacts for inspection. These results do not claim universal safety.
They show that pre-action cognition can be engineered, tested, inspected, and reused as software.
Preflight is the example, not the boundary. The broader claim is that trustworthy agents will not
be built only by asking models to reason better, but by giving agent systems a runtime layer where
the reasoning that matters can be required.

Files

orca_preaction_cognition_paper_final.pdf

Files (4.2 MB)

Name	Size	Download all
orca_preaction_cognition_paper_final.pdf md5:6a0271e2a6b56772defe0380e0360439	1.4 MB	Preview Download
orca_preaction_cognition_paper_final_package.zip md5:1e3ce802c9f32a736347ebd007d8c983	2.8 MB	Preview Download

Additional details

Subtitle (English): Reusable Consequence Forecasting through the ORCA Cognitive Runtime

Cites: Journal: 10.5281/zenodo.19438942. (DOI)

Repository URL: https://github.com/gfernandf/agent-skills
Development Status: Active

	All versions	This version
Views	53	53
Downloads	4	4
Data volume	5.6 MB	5.6 MB

orca_preaction_cognition_paper_final.pdf

Files (4.2 MB)

Additional titles

Related works

Software

Beyond Prompted Caution and Guardrails: Runtime-Enforced Pre-Action Cognition for Trustworthy LLM Agents

Authors/Creators

Description

Files

orca_preaction_cognition_paper_final.pdf

Files (4.2 MB)

Additional details

Additional titles

Related works

Software