Published February 11, 2026
| Version v4
Preprint
Open
The Voynich Manuscript Deciphered: A Phonetic Transcription of Spoken Elu-Sinhala
Description
The Voynich Manuscript (Beinecke MS 408, carbon-dated 1404-1438 CE) is identified as a 15th-century Elu-Sinhala pharmaceutical text - a teaching manual recording a physician's spoken instructions for Ayurvedic preparations. The writing system is a bespoke abugida mapping 27 EVA characters to 14 Sinhala phonemes.
Key results: 90.7% of 35,916 tokens glossable in English; 99.3% matching Sinhala dictionary; only 0.7% truly unknown.
v4 changes - Adversarial fairness audit:
- Equalized pharmaceutical vocabulary: 115 concepts x 6 languages, max/min ratio 1.25
- Fair cross-language test: Sinhala matches 13x more pharmaceutical terms than any other language (Z=1.66, honestly below 2.0 significance threshold)
- Panchavidha Z=10.5 downgraded to CIRCULAR: 3/6 terms not in Sinhala dictionary. Non-circular test: external pharma Z=3.5
- 8 new validation scripts with full random-decoder comparisons
- Failed tests documented: folio clustering (Z=0.2), recipe sequencing (all metrics failed)
- 3 STRONG PASS pillars: SOV syntax (Z=7.04), pharmaceutical collocations (0/10 random), external pharma vocabulary (Z=3.5)
- 3 cross-modal convergences (inherently random-proof)
Notes
Files
main.pdf
Additional details
Related works
- Is supplemented by
- https://github.com/kamb-code/Voynich (URL)