★ Start Here The Manuscript Library What It Isn't Findings Curiosities Methodology Press Collaborate Resources
What the community disagrees about

Open Debates

The questions where serious researchers genuinely disagree. No false consensus.

Debate 01

Is Currier A/B binary or multi-dimensional?

The traditional view: two distinct languages, or dialects, corresponding to distinct sections of the manuscript. The A/B distinction has anchored Voynich linguistics since Currier first articulated it in the 1970s, and it remains one of the few structural facts the community accepts without serious contest.

Parisel (2026) challenges the framing: A/B is described as a "low-resolution projection of a three-layer generative structure" — a boolean folio switch combined with a template system plus a gradient d/l dimension. On this model, calling it two languages mistakes a projection of a richer structure for the structure itself.

What we don't know: do more dimensions exist beyond these three? Are the "layers" linguistically meaningful — reflecting dialect, register, topic — or purely statistical artifacts of a generative procedure with no semantic correlate? And if the boolean switch is a scribal choice, does it reveal something about how the manuscript was produced or simply about who produced it?

Current positions

Parisel (2026): three layers confirmed computationally. Community: broadly accepts A/B as real; split on whether the multi-dimensional reframing adds explanatory power or complexity without resolution.

Genuinely open
Debate 02

Does the VMML signal reflect morphological structure or statistical artifact?

Our finding: Voynich BPE Mean Morpheme Length (VMML) of 5.918 exceeds the alphabetic ceiling established across 63 language corpora. Among tested natural languages, no alphabetic script reaches this value. The signal appears structurally significant.

The counter-argument is worth taking seriously: BPE VMML could be inflated by consistent character n-gram patterns that aren't morphological — for example, a cipher with fixed-length blocks, or a script with systematic digraph conventions, could produce elevated VMML without any underlying morphological structure. The metric measures compression efficiency, not meaning.

Our response: the character shuffle test shows a 22% VMML drop for Voynich versus 13–14% for Tagalog under positional randomization. This asymmetry suggests genuine positional structure — something is concentrated at specific positions in the token stream. But it doesn't definitively resolve whether that structure is morphological (prefixes, suffixes, roots) or ciphered (block boundaries, padding conventions).

Open question

Is VMML discriminating between natural language and cipher, or between high-entropy and low-entropy systems more broadly? The shuffle test narrows this but doesn't close it.

Partially constrained, not resolved
Debate 03

The Rugg hoax question — still open?

Rugg (2004) demonstrated that a Cardan grille — a simple mechanical device using a perforated card slid over a table of syllables — could generate text with Voynich-like statistical properties. The paper was widely cited as evidence that the manuscript might be an elaborate, content-free hoax.

Montemurro and Zanette (2013) showed that the Voynich exhibits semantic clustering — high-information words distributed non-randomly across sections in a way inconsistent with random hoax generation. Their method, borrowed from natural language processing, suggests the text carries semantic structure even if we can't read it.

The question remains: could a skilled hoaxer deliberately engineer semantic-like clustering? Bax (2014)'s word identifications — tentative but specific — suggest particular tokens carry meanings consistent with illustrations, which is also difficult to explain as coincidence in a hoax. Current state: the hoax hypothesis is statistically disfavored by multiple independent analyses, but has not been formally ruled out. No mathematical proof of impossibility exists.

Current state

Statistically disfavored. Not ruled out. The community leans toward genuine script but acknowledges the hoax cannot be definitively eliminated without decipherment.

Disfavored but not closed
Debate 04

What does the Tagalog result actually mean?

Among natural languages tested, Tagalog produced the closest VMML match to the Voynich on one metric — an elevated value consistent with agglutinative morphology. This result attracted attention as the first natural-language near-match on that dimension.

The alternative interpretation cuts directly: both Tagalog texts tested were authored by José Rizal. This tests intra-author stability within a single writer's style, not cross-author language stability. Before concluding anything about Tagalog as a language, more texts from different authors, periods, and registers are needed.

We agree this is a genuine limitation. The finding stands as "Tagalog VMML is elevated in the tested Rizal corpus" — not as "Tagalog structurally resembles the Voynich." The two statements are very different claims with very different implications. We have been explicit about this in our reporting and intend to extend testing when suitable corpora are available.

Status

We acknowledge the limitation. The Tagalog finding is preliminary and author-specific. It is reported honestly, not as a strong claim about the language.

Self-acknowledged limitation
Debate 05

Multiple scribes — what do they imply for the structure?

Davis (2020) identifies at least five distinct scribal hands in the manuscript, correlated imperfectly with the Currier A/B sections but not reducible to them. The relationship between scribal identity and statistical signature is not one-to-one.

If two scribes produce two statistical profiles, the most natural interpretation is two dialects, registers, or language variants — consistent with a living language used across a community. If five scribes produce only two statistical profiles, that requires explanation: why do three additional hands leave no statistical trace?

Parisel's boolean switch could reflect scribal convention, topic, register, or something else entirely. The possibility that a single scribe could switch statistical regimes deliberately — as a cipher operator selecting between two encoding modes — is also not excluded. The scribal hand evidence is real; its interpretation for the linguistic hypothesis is contested.

Open question

Does scribal hand correlate with language/dialect, encoding mode, or neither? Davis's identification of five hands with two statistical profiles needs a satisfying explanation from any serious hypothesis.

Genuinely open

"These debates are active in the research community. The most sustained public discussion of Voynich linguistics, codicology, and hypothesis testing happens at voynich.ninja — the serious end of online Voynich scholarship."

Follow the discussion at voynich.ninja →
↑ Back to top