Skip to the research ↓
Atmospheric illustration of a medieval scribe at work, artistic rendering, not a manuscript page
A mystery you can measure

A 600-year-old book
no one can read.

Sometime in the early 1400s, someone filled 240 pages of vellum with a script that has no parallel and a language no one has matched. Six centuries of scholars, cryptographers, and codebreakers have tried, and failed, to read it.

We cannot read it either. But we can measure it. Scroll down.

Atmosphere only · artistic rendering, not a real folio
Voynich Manuscript folio f33r, an impossible plant, Beinecke MS 408, Yale
The strangeness

Unknown script.
Impossible plants.
Unreadable stars.

The plants do not grow anywhere on Earth. The star charts match no sky. The script repeats and flows like writing, yet every attempt to read it dissolves under inspection.

These are real pages from the manuscript, held at Yale's Beinecke Library. Look closely, and the writing almost makes sense. Almost.

MS 408 · f33r · Yale Beinecke · Public domain
Voynich Manuscript folio f67r, an astronomical or cosmological diagram, Beinecke MS 408, Yale
The same hand, page after page

Whoever wrote it
was not improvising.

The text is consistent, fluent, and patterned. That consistency is exactly what makes the manuscript so hard to dismiss as nonsense, and exactly what we can put numbers to.

MS 408 · f67r · Yale Beinecke · Public domain
What the numbers show

It behaves like a real language
on some axes. Not on others.

Measure the word frequencies and they follow Zipf's law, the same lopsided curve every human language obeys. Measure the per-character entropy and it lands inside the range of real languages. By those tests, this is not random noise. But measure how rigidly the words are built, and the manuscript pulls away from every natural language we have checked.

Random gibberish Natural language
Random gibberish

No structure

Flat word frequencies, no repeating internal patterns, no Zipf curve. The Voynich text is clearly not this.

The Voynich sits here

Intermediate Structural Regime

More organized than gibberish, yet more rigidly patterned than any natural language we measured. It looks language-like on Zipf and entropy, and anomalous on word-internal structure.

Natural language

Flexible structure

Zipf curve, language-range entropy, and a looser, more varied internal word structure than the Voynich shows.

That gap is the real puzzle. Not a hidden message waiting to be unlocked, but a structure that is too organized to be noise and too rigid to be an ordinary language.

Two hands

It was not written
in one voice.

Decades ago, Prescott Currier noticed the manuscript splits into two writing systems, which he called Language A and Language B. They use the same script but behave differently. With a reproducible structural measure, that split is not a hunch. It is statistical.

d = −1.01
Separation between Currier A and B · p < 0.001
Currier A

One set of habits

The earlier sections share a consistent statistical fingerprint in how words are built and combined.

Currier B

A measurably different one

The later sections separate cleanly from A on the same metric, a large, reliable effect, not noise.

See how the two hands separate →

Voynich Manuscript folio f78r, a biological section illustration, Beinecke MS 408, Yale
Why no one has read it

Every claimed solution
fails the same test.

Proto-Romance. A medieval health manual. An elaborate hoax. "AI cracked it." Dozens of announcements, across a century. Each one fits a few words, declares victory, and never produces coherent, repeatable text that holds up on pages the author never used.

See every claim, and exactly why each failed →

MS 408 · f78r · Yale Beinecke · Public domain
The invitation

A mystery you can measure.

We do not claim to read the Voynich Manuscript. We claim something narrower and verifiable: we can measure its structure, and what we measure is reproducible by anyone. Here is where to go next.

A note on honesty: this page measures structure, it does not read the text. Nothing here is a decipherment or a translation. The value is in what can be checked: numbers any researcher can reproduce, and a puzzle stated plainly instead of solved by assertion.