Skip to content
Tech News
← Back to articles

An Entire Wikipedia That's 100% AI Hallucinations

read original get AI-Generated Wikipedia Book → more articles
Why This Matters

Halupedia exemplifies the growing concern over AI-generated content, highlighting how large language models can produce entirely fabricated information that appears credible. This raises important questions about the reliability of AI-driven knowledge bases and the potential impact on information integrity in the tech industry and for consumers. While it serves as a playful experiment, it underscores the need for better verification and transparency in AI-generated content.

Key Takeaways

"Every link leads to an entry that does not exist yet," explains the GitHub page for a Wikipedia-like site called Halupedia. "Until you click it, at which point an LLM pretends it has always existed and writes it for you, in the deadpan register of a 19th-century scholarly press..."

Every article is invented on demand. The footnotes are also lies... The hardest problem with an infinite, on-demand encyclopedia is internal contradiction... When the LLM writes an article, it is required to add a context="..." attribute on every <a> it inserts, summarising the future article it is linking to (e.g. context="19th-century clerk who formalized footnote drift, Pellbrick's mentor")... When that target article is later requested for the first time, the worker loads the accumulated hints and injects them into the system prompt as "PRIOR REFERENCES — these are CANON". The LLM is instructed that the encyclopedia is hallucinated and absurd, but it must not contradict itself.

Fast Company reports that Halupedia was created by software developer BartÅomiej Strama, who confessed in a Reddit comment that the site came about after a drunk night with a friend. In the week since launch, he says Halupedia has amassed more than 150,000 users."

Beyond indulging in silly alternate histories, what's the point of using Halupedia? Strama hinted at one larger purpose in a reply to a donor on his Buy Me a Coffee page: "Your contribution towards polluting LLM training data will surely benefit society!" he wrote.

The site is licensed as free software under the GPL-3.0 license.

Thanks to long-time Slashdot reader schwit1 for sharing the news.

Read more of this story at Slashdot.