Twenty documented facts about the Voynich Manuscript — strange, verified, and stranger for being true. No speculation. No "could be." These are things we actually know.
X-ray fluorescence (XRF) analysis of the manuscript's ink composition revealed iron-gall and copper-based pigments consistent with 15th-century European production. No modern synthetic compounds were found. The result rules out any 19th- or 20th-century forgery of the writing itself.
Emperor Rudolf II of Bohemia reportedly purchased the manuscript for 600 gold ducats, according to a letter written by Johannes Marcus Marci to Athanasius Kircher in 1665. Rudolf was an obsessive collector of curiosities, automata, and esoteric objects. The price suggests he believed it was the genuine work of Roger Bacon.
Radiocarbon dating performed in 2009 at the University of Arizona established that the parchment was made between approximately 1404 and 1438, with 95% confidence. This makes the manuscript early 15th century — older than many had assumed, and definitively pre-Columbian.
During and after World War II, some of the world's best codebreakers — including analysts at the NSA — attempted to crack the Voynich script. Declassified NSA documents released in 2016 confirm internal studies were conducted, all inconclusive. William Friedman, who broke the Japanese "Purple" cipher, called it "the most mysterious manuscript in the world."
By conservative count, the Voynich Manuscript has been "solved" more than fifty times since the early 20th century. Proposed languages include Hebrew, Arabic, Latin, Nahuatl, Ukrainian, proto-Romance, and invented ciphers. Not one decipherment has produced a coherent, independently translatable text verified by linguists outside the proposer's circle.
Unlike virtually every manuscript of similar length, the Voynich text shows almost no crossing-out, overwriting, or erasure. Scholars have proposed three explanations: the scribe copied from a pre-composed text and was extremely skilled; the text was produced using some mechanical or systematic method; or the absence of corrections is itself a signal that the text is not composed the way natural language is.
The "herbal" section depicts what appear to be medicinal plants. They have roots, stems, leaves, and flowers that closely resemble real botanical structures — but don't correspond to any identified species. Botanists have proposed they are composites, invented species, or highly stylized versions of real plants. No systematic botanical identification has succeeded across the full herbal section.
One diagram in the astronomical section shows a starburst pattern that some researchers have proposed could represent the Crab Nebula supernova (SN 1054), which was visible from Earth and recorded by Chinese astronomers. The identification is intriguing but unverified — the manuscript's date range makes it temporally plausible only if the illustration was copied from an older source.
The Voynich Manuscript has dedicated Wikipedia entries in more than 40 languages, including Esperanto, Basque, and Swahili. It is one of the most internationally documented mysteries in the history of written artefacts. The English article alone runs to over 15,000 words and has been cited in academic publications.
Ethel Lilian Voynich — wife of Wilfrid Voynich, who bought the manuscript in 1912 — was the author of The Gadfly (1897), a political adventure novel that became a massive bestseller, especially in Russia and China, where it sold tens of millions of copies in the 20th century. The couple's lives were themselves stranger than most fiction.
Yale University's Beinecke Rare Book and Manuscript Library digitized the complete manuscript in high resolution and made all images freely available online. Every folio, every detail, every stain and fold is publicly accessible. This has democratized research and enabled computational analysis that would have been impossible with physical access alone.
The manuscript passed through the Jesuit College at Mondragone, was hidden during the 18th-century suppression of the Society of Jesus, survived both world wars while stored in European collections, was sold privately in 1912, donated to Yale in 1969, and digitized in 2004. Through all of it, not a single line has been translated. It has outlasted every attempt to understand it.
Before 1998, every Voynich researcher used a different notation for the manuscript's unknown characters. Comparing results across teams was nearly impossible. Gabriel Landini and René Zandbergen created EVA — the European Voynich Alphabet — as a standardized ASCII-compatible transcription system that could be processed by computers and shared across institutions.
EVA assigns a short alphabetic code to each Voynich glyph, enabling researchers worldwide to work on the same dataset. Without it, computational studies like entropy analysis, Zipf distributions, and BPE clustering would have been far harder to coordinate.
Zipf's Law states that in any natural language, the most common word appears roughly twice as often as the second most common, three times as often as the third, and so on — producing a characteristic power-law distribution. A 2014 analysis confirmed that Voynichese obeys this law with high fidelity.
This finding is significant because pure random text does not follow Zipf's Law. It rules out gibberish or purely stochastic generation and is one of the strongest statistical arguments that the manuscript encodes something structured — whether a language, a cipher, or a highly systematic invented system.
Hans Peter Kraus, the rare book dealer who bought the manuscript from Anne Nill (Voynich's secretary) for $24,500 in 1961, spent years trying to sell it for $160,000. When he failed to find a buyer, he tried to donate it to the Library of Congress. They declined — citing lack of budget to acquire it properly and insufficient expertise to study it.
Yale University's Beinecke Rare Book and Manuscript Library accepted the donation in 1969. It has been there ever since, catalogued as MS 408. The Library of Congress's refusal is now one of the more quietly famous decisions in the history of American rare books.
Gordon Rugg, a cognitive scientist at Keele University, demonstrated in 2003 that a Cardan grille — a card with cut-out holes slid over a grid of syllables — could generate text with statistical properties resembling Voynich's. The experiment reignited the hoax debate and received wide press coverage.
However, Rugg's demonstration only showed that such a device could produce superficially similar text — not that it did, or that the full statistical profile of the manuscript (including its semantic clustering, Zipf compliance, and boundary concentration patterns) can be replicated this way. Subsequent computational tests found the match incomplete.
The Voynich Manuscript has inspired at least three published novels, two feature films, a graphic novel series, and multiple video game puzzles. Writers and filmmakers are drawn to its combination of visual strangeness and complete opacity — a mystery with no solution to spoil.
The most cited fictional connection is Umberto Eco's The Name of the Rose (1980), which features an encrypted medieval manuscript. Eco, however, explicitly denied the Voynich was his inspiration — he cited other sources. The association persists anyway, which is itself something of a Voynich phenomenon: things attach to it regardless of evidence.
Marcelo Montemurro and Damián Zanette (2013) applied information-theoretic analysis to the Voynich text and found that high-entropy "keyword" words — the kind that carry topical meaning in natural language — are distributed across sections in ways consistent with semantic organisation. The herbal section has different keywords from the astronomical section, which differs from the biological section.
This pattern is extremely difficult to produce by chance or by simple mechanical text generation. It implies that whatever the manuscript encodes, it is not random — and it is organized around content, not just form. This is one of the most technically rigorous results in Voynich research.
The "Biological" section of the manuscript — sometimes called the balneological section — depicts dozens of nude female figures bathing in interconnected pools and tubes rendered in green ink. It is the most photographed and most cited section in popular coverage of the manuscript, and the most resistant to interpretation.
Leading hypotheses include medical illustrations of circulatory or reproductive systems, alchemical diagrams representing transformation processes, a medieval spa or therapeutic bathing manual, and purely symbolic cosmological imagery. No hypothesis has been adopted by more than a small number of researchers, and none has connected the imagery to any known illustrated tradition.
Stephen Bax, a professor of applied linguistics at the University of Bedfordshire, published a working paper in 2014 proposing the identification of 14 specific words in the Voynich text. His method was comparative iconography: finding illustrations in the manuscript that closely resembled labeled images in other medieval manuscripts (stars, plants, constellations) and then reading the Voynich labels against the known names.
Among his proposed identifications was the word for Taurus, matched to the constellation diagram. The method is legitimate in principle — it is how Linear B was partially cracked — but requires that the labels actually name the illustrated objects, which is an assumption for Voynich. His proposal has not been independently replicated, but has also not been definitively refuted.