The Characters That Haunt the Dictionary
Every writing system has its oddities. English has silent letters. French has accents that seem to exist purely out of spite. But Japanese has something far stranger: ghost kanji — characters that were officially registered in the national standard, printed in dictionaries, coded into computers, and yet no one knows what they mean.
They have no confirmed readings. No established definitions. No traceable history of ever being used in any document, inscription, or text. They simply appeared one day in the official record, as if summoned from nowhere — and they have refused to leave.
In Japanese, they are called 幽霊漢字 (yūrei kanji), literally "ghost characters." And their story is one of the most fascinating, and funniest, footnotes in the history of writing itself.
How Do You Accidentally Create a Character?
The story begins in 1978, when Japan's Ministry of Economy, Trade and Industry undertook an enormous project: the creation of JIS漢字, the Japanese Industrial Standard character set for computing. The digital age was arriving, and Japan needed to decide which kanji would live inside its machines.
The task was monumental. Japanese uses thousands of kanji in daily life, but the compilers wanted to be thorough. They drew from municipal records, regional name registries, place-name databases, and existing dictionaries. Lists were compiled from dozens of sources, cross-referenced, and merged into a master set. In 1978, the first JIS standard (JIS C 6226) encoded 6,349 characters.
And buried among them were ghosts.
- It was included in the JIS standard (1978 or later revisions)
- No one can identify a confirmed, historical usage of the character
- No established reading (pronunciation) or meaning can be verified
- It likely originated from a transcription error, misprint, or misreading during the compilation process
The problem was deceptively simple. When compilers gathered kanji from handwritten municipal documents — records of place names, family registries, land surveys — they sometimes encountered characters that were smudged, poorly written, or unique to a single local source. A brushstroke that bled into another. A radical that was slightly off. A clerk's personal shorthand mistaken for an actual character.
Nobody double-checked whether these characters had ever existed in mainstream use. They were dutifully added to the list, coded into the standard, and released into the digital wild. Once inside JIS, they propagated into every computer, every font, every dictionary that used the standard as its foundation.
Meet the Ghosts
In the early 2000s, a researcher named 笹原宏之 (Sasahara Hiroyuki), a professor of linguistics at Waseda University, conducted a landmark investigation into these spectral characters. His findings were both meticulous and delightful.
One of the most famous ghost kanji is 妛. This character appears in the JIS set, but no dictionary published before 1978 contains it. Sasahara traced its origin to a place-name registry where the character 𡚴 (a rare but real kanji) had been printed with a defect — the top portion broke apart, and the vertical stroke became detached, creating what looked like an entirely new character: 山 (mountain) sitting atop 女 (woman). The printing artifact was faithfully transcribed as a "new" kanji and entered the standard.
Another ghost, 彁, is perhaps the purest phantom of them all. It has no known reading, no known meaning, and no confirmed source. Multiple researchers have attempted to trace its origin and failed. It sits in the JIS code chart like an uninvited guest at a banquet — present, silent, and utterly inexplicable.
The character 碵 appears to have been born when someone misread 碯 (a variant of 瑪, as in agate) during transcription. A single stroke went astray, and a ghost was born.
- 妛 — Likely a printing defect of another character. Mountain + woman, but nobody knows why.
- 彁 — The ultimate ghost. No reading, no meaning, no source. Nothing.
- 碵 — Probably a misreading of 碯. A stone radical paired with the wrong right half.
- 蟐 — Suspected transcription error from a regional insect name registry.
- 袮 — Likely confused with 祢, the shrine-related character, due to a radical swap.
Why the Ghosts Won't Leave
Here is where the story becomes uniquely Japanese. You might expect that, once discovered, these phantom characters would simply be deleted. Problem identified, problem solved. But that is not what happened.
Once a character enters the JIS standard, removing it would break backward compatibility. Every database, every archived document, every system that references that code point depends on its continued existence. If you delete a ghost kanji, you create a hole in the digital fabric. Software that encounters the now-empty code point might crash, display errors, or corrupt data.
So the ghosts stay. They are maintained, updated, and carried forward through every revision of the standard. They exist in the Unicode Consortium's master list. They are encoded in the fonts on the device you are reading this on right now. Somewhere in your computer's memory, 彁 is sitting quietly, waiting to be called upon — for a purpose that does not exist.
There is something profoundly poetic about this. In a culture shaped by もったいない (mottainai, the ethic of not wasting) and 供養 (kuyō, memorial rites for objects and even concepts), the ghost kanji are treated with a kind of institutional compassion. They cannot be explained, but they will not be discarded. They have been given a place, and that place will be honored.
What the Ghosts Reveal
The ghost kanji are funny. They are the bureaucratic equivalent of a typo that became law. But they also illuminate something deeper about the nature of writing systems.
Kanji are not fixed, eternal objects. They are living artifacts — shaped by the hands that write them, the eyes that read them, the machines that encode them. Every kanji that exists today was, at some point in history, invented by someone. The line between a "real" character and a "ghost" is thinner than we like to admit. If 妛 had been used in a novel, cited in a poem, or adopted by a community, it would have crossed from phantom to legitimate in a generation.
Writing is an act of collective agreement. A character is real because we agree it is real. The ghosts are simply characters for which no one ever signed the agreement — but the system filed them anyway.
And so they persist: silent, meaningless, officially recognized, and immortal. The kanji that no one can read, living forever in machines that will never forget them.
- Sasahara Hiroyuki's research papers and his book 『日本の漢字』 explore ghost kanji in academic depth.
- The JIS X 0208 standard document lists all characters, including ghosts, with their code points.
- The Unicode Consortium's CJK Unified Ideographs block preserves these characters for global computing.
A Final Haunting
Next time you marvel at the complexity of Japanese — its three scripts, its thousands of kanji, its endless layers of meaning — remember that even the system itself doesn't fully know what it contains. Somewhere in the vast architecture of the written language, there are doors that open onto empty rooms. Characters with no voice, no story, no reason to exist.
And yet they are there. Because in Japan, even a ghost deserves a place to stay.
Comments (0)
No comments yet. Be the first to share your thoughts!
Leave a Comment