Published June 12, 2026 | Version Perturbation Draft PD v2.0
Preprint Open

Counting What You Have Not Yet Said

Authors/Creators

  • 1. Srødingr ApS

Contributors

Project member:

Description

A family of viral tests circulates as proof that large language models do not understand language. The canonical instance asks for a translation of the Italian Solo 3 parole: non sei solo; the model renders it Just 3 words: you are not alone, failing to notice that the English phrase contains four words and that the sentence's meta-linguistic claim must therefore update. This paper accepts the failure as real and rejects the inference drawn from it. The test conflates four distinct capabilities: detection that a numeral predicates the sentence's own form rather than world content; recomputation of form-predicates under a form-altering transformation; counting, which is trivially within capability when posed as a task; and output-commitment ordering, the autoregressive constraint that the count is emitted before the counted span exists. Only the conjunction fails; each component dissociates, and the dissociation pattern is diagnostic of architecture and pipeline, not of understanding. The paper makes three further moves. First, it identifies the self-referential count as a structural inversion of the frame-before-content principle: the orderings that sequential comprehension rewards become impossible commitments when the frame is a predicate over content that does not yet exist, which is why human translators solve the problem in revision and why single-pass machine output is being compared, unmarked, to post-edit human output. Second, it prices the remedy. An always-on whole-response verification pass by a Haiku-class model costs on the order of six hundredths of a cent per response and roughly two hundred million dollars annually at frontier-platform scale, but its binding cost is architectural: a verifier that must see the complete response kills streaming. A pipelined per-sentence verifier preserves streaming at sentence granularity, leaves steady-state throughput untouched because small verifiers outrun large generators, and exposes the design space's general law, here named the revision horizon: a streaming system can repair only what it has not yet released, so the release quantum sets the upper bound on the scope of self-reference a pipeline can enforce; a scope-adaptive vanguard makes the quantum a variable, widening holdback to match detected predicate scope and falling back to predicate postposition when scope exceeds the buffer. Gated triggering reduces the cost by three to four orders of magnitude, which establishes that the layer's absence is an engineering choice rather than an economic necessity. The architecture turns out to converge with sentence-buffered streaming guardrails developed independently and contemporaneously in the safety literature; the paper discloses the prior art it found between its own first and second drafts, and reads convergence from two unrelated problem domains as evidence that the revision horizon is a general law of streaming systems. Third, it asks what the layer would buy, and answers: mostly benchmark optics. A verifier tuned to viral test families is measurement colonising the phenomenon; a general meta-linguistic verifier inherits the detection problem it was meant to solve, because deciding that a numeral is a form-predicate is the self-reference problem. The paper closes with three falsifiable predictions, the strongest of which is that pass rates on such tests will track revision access and training contamination rather than model scale, making the test self-defeating as evidence on roughly the timescale of its own virality. An appendix records the paper's own production latency, hours from provocation to preprint, and asks what scholarship at conversation speed does to the architecture of publishing, including what it did for this paper: a post-completion literature search that caught a two-week-old overlapping result inside the deposit window and was answered the same day.

Files

Counting_What_You_Have_Not_Yet_SaidPD2.pdf

Files (407.2 kB)

Name Size Download all
md5:5295c591ecfdd43feed58d885c99aff3
407.2 kB Preview Download

Additional details

Additional titles

Subtitle
Self-Referential Invariants, the Missing Edit Pass, and the Price of Checking

References

  • Anthropic. (2026). Claude Haiku 4.5 pricing. Published API rates, verified 12 June 2026. https://www.anthropic.com/claude/haiku
  • Yu, Jiaqi, Xin Wang, Yixu Wang, Jie Li, Yan Teng, Xingjun Ma, and Yingchun Wang. (2026). SentGuard: Sentence-Level Streaming Guardrails for Large Language Models. arXiv preprint arXiv:2606.02041. https://arxiv.org/abs/2606.02041
  • NVIDIA. (2025). Stream Smarter and Safer: How NeMo Guardrails Enhance LLM Output Streaming. NVIDIA Technical Blog and NeMo Guardrails documentation. https://developer.nvidia.com/blog/stream-smarter-and-safer-learn-how-nvidia-nemo-guardrails-enhance-llm-output-streaming/
  • Temte, Storm Bjørn. (2026a). Binary Bias: The Foundational Cognitive Constraint That Precedes and Shapes All Other Biases. Srødingr ApS preprint.
  • Temte, Storm Bjørn. (2026e). The Perturbation Window: Why Early Publication Maximises Intellectual Robustness. Srødingr ApS preprint.
  • Temte, Storm Bjørn. (2026f). LLMs as Perturbation Partners: Adversarial Prompting, Multi-Model Triangulation, and the Future of AI-Augmented Scholarship. Srødingr ApS preprint.
  • Temte, Storm Bjørn. (2026g). Optimal Release Timing: A Decision-Theoretic Framework for Perturbation Drafts. Srødingr ApS preprint.
  • Temte, Storm Bjørn. (2026n). Convergent Ergonomics: How a Narrative Format for Hyperphantasic Readers and a Transformer Architectural Limitation Reveal the Same Information-Ordering Principle. Srødingr ApS preprint, Zenodo.
  • Temte, Storm Bjørn. (2026p). The Causal Mask Problem: How an Unexamined Architectural Choice Degrades Comprehension in Head-Final Languages. Srødingr ApS preprint, Zenodo.
  • Temte, Storm Bjørn. (2026q). Superpositional Text: Against Premature Collapse in Human and Machine Language. Srødingr ApS preprint, Zenodo, PD v1.1. https://doi.org/10.5281/zenodo.20489581
  • Temte, Storm Bjørn. (2026as). Translation as Probability Cloud: Against Premature Collapse in Cross-Linguistic Semantic Recovery. Srødingr ApS preprint, Zenodo.
  • Temte, Storm Bjørn. (forthcoming). Exceltricity and the Mummified Metric. Citation home unconfirmed; verify against the canonical spine before deposit.