TEL-OS v2.0: Inference-Only Latent Governance and Attention Guillotine for LLM Security

gutierrez alvarez tostado, josue johnatan

doi:10.5281/zenodo.18903148

Published March 7, 2026 | Version 2

Preprint Open

TEL-OS v2.0: Inference-Only Latent Governance and Attention Guillotine for LLM Security

gutierrez alvarez tostado, josue johnatan (Researcher)

Traditional AI alignment strategies (RLHF, system prompts) rely on "semantic guardrails" that are structurally vulnerable to adversarial jailbreaks like Prefix Injections and Many-Shot attacks. We present TEL-OS v2.0, a mechanistic interpretability framework that neutralizes these threats by intervening directly in the model's residual stream. Using a combination of Latent Refinement, Attention Guillotines, and the Love Equation for tensor governance, TEL-OS achieves a 0.0% Attack Success Rate (ASR) while maintaining 100% fluent output on Llama-3.1-8B. Our results prove that safety can be guaranteed as an intrinsic physical invariant of the model's latent manifold, independent of prompt-based filtering.

Files

TELOS.pdf

Files (8.6 kB)

Name	Size	Download all
TELOS.pdf md5:96c2f942c21a83c54f481b9a277d9861	8.6 kB	Preview Download

Additional details

Submitted: 2026-03-07

Repository URL: https://github.com/jostoz/tel-os
Programming language: Python
Development Status: Active

Views

Downloads

Show more details

	All versions	This version
Views	29	29
Downloads	27	27
Data volume	353.5 kB	353.5 kB

More info on how stats are collected....

DOI

Resource type

Preprint

Publisher

Zenodo

Languages

English

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: March 7, 2026
Modified: March 7, 2026

TELOS.pdf

Files (8.6 kB)

Dates

Software

TEL-OS v2.0: Inference-Only Latent Governance and Attention Guillotine for LLM Security

Authors/Creators

Description

Files

TELOS.pdf

Files (8.6 kB)

Additional details

Dates

Software