Validity Mirage: Context Compression Failure Modes in LLMs
Description
This archive presents five working papers on context compression failure modes in large language models.
The central finding is the validity mirage: naive context compression can preserve surface-level answer correctness while silently substituting the governing hypothesis, causing a model to answer confidently about the wrong task. We develop a tropical semiring algebra (max-plus over ℝ ∪ {−∞}) for measuring context health under compression, and show that structurally guarded retention policies eliminate pivot drift where recency-based baselines fail completely.
Empirical validation spans five open-weight model architectures (Llama 3.1 8B, Mistral 7B v0.3, Gemma 2 9B, Phi-3 Medium 14B, Qwen 2.5 14B) across 11,400+ boundary instances and 4,200+ streaming trials, with additional testing against 13 real incident graphs (12 NTSB aviation investigations and the Knight Capital 2012 trading failure). A production MCP server implementation is available separately.
Included papers:
Paper 00: Continuous Control and Structural Regularization in Multi-Agent Narrative Extraction
Paper 01: Absorbing States in Greedy Search
Paper 02: Streaming Oscillation Traps in Endogenous-Pivot Sequential Extraction
Paper 03: The Validity Mirage: Context Algebra for Endogenous Semantics under Memory Compression
Paper I: Tropical Algebra of Endogenous-Pivot Semantics
Reproducible validation artifacts and benchmark outputs are included in the results/ directory. All papers are working paper first drafts distributed under CC-BY 4.0.
Files
Files
(1.8 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:008c968fd7c13fd53c7b9c3120ee3661
|
1.8 MB | Download |
Additional details
Related works
- Is supplement to
- Software: https://github.com/jack-chaudier/dreams (URL)
- Software: https://github.com/jack-chaudier/tropical-mcp (URL)
- Presentation: https://dreams-dun.vercel.app (URL)
Dates
- Collected
-
2025-09-01 / 2026-02-26Research period