Published February 7, 2026 | Version v3
Preprint Open

The Voynich Protocol v3.0: Statistical Analysis of MS408 as a Formal Technical System

Authors/Creators

Description

Abstract

This paper presents a comprehensive statistical analysis of the Voynich Manuscript (MS408), proposing its interpretation as technical documentation of a formal system rather than encrypted natural language. Through rigorous quantitative analysis of 116,755 words from the complete EVA transcription, we demonstrate that MS408 exhibits statistical properties inconsistent with natural language but strongly aligned with formal technical protocols.

Key Statistical Findings:

  • Type-Token Ratio (TTR): 0.130 — Places MS408 strictly within the range of formal languages (0.1–0.3), effectively ruling out natural language encryption (typically 0.4–0.6).

  • Zipf’s Law: $\alpha = 1.27 \pm 0.07$ — Matches technical documentation and server logs, distinct from natural narrative ($\alpha \approx 1.0$).

  • Structural Formality: 38.7% of the corpus is concentrated in the top-10 bigram patterns.

  • Sequential Determinism: 47% of word-pair sequences show deterministic repetition, characteristic of procedural protocols.

Conclusion:

Using Bayesian inference, we assess the "Systemic Protocol" hypothesis at 85% probability. We propose a paradigm shift from cryptanalytic to systems-engineering approaches in Voynich research. This document (v3.0) supersedes previous hypotheses regarding the "R&D Logbook".

Files

Voynich_Protocol_v3_0.pdf

Files (1.1 MB)

Name Size Download all
md5:effd5d8648b3d0463baa123b4099323b
814.9 kB Preview Download
md5:4c65a36eb7207367809910175a2c0fa0
328.3 kB Preview Download