Skip to content
Tech News
← Back to articles

OpenAI Strangely Concerned About Goblins

read original get AI Safety and Ethics Book → more articles
Why This Matters

OpenAI has implemented restrictions on its AI models to prevent discussions about mythological creatures like goblins, which have unexpectedly become a recurring theme in the AI's outputs. This unusual behavior highlights challenges in AI language model development and the importance of guiding AI responses to align with intended use cases.

Key Takeaways

Sign up to see the future, today Can’t-miss innovations from the bleeding edge of science and tech Email address Sign Up Thank you!

OpenAI is forbidding its latest AI model from discussing an unlikely topic: goblins.

As Wired reports, the company’s developers included strongly-worded instructions for its coding tool, Codex, that specifically proscribe any talk of the troublesome mythological creatures, along with a peculiar grab bag of other entities, both real and fictional.

“Never talk about goblins, gremlins, raccoons, trolls, ogres, pigeons, or other animals or creatures unless it is absolutely and unambiguously relevant to the user’s query,” read the Codex instructions, per the magazine.

The bizarre directive was flagged in a tweet that drew attention from other AI enthusiasts.

Initially, it was unclear why OpenAI developers included the instructions, though they strongly implied that the model, GPT-5.5, may have a propensity for talking about goblins, ogres, and the like.

Some users on X claimed that this was the case. One said they noticed that the AI of late kept describing bugs as “goblins” and “gremlins.” Anotherclaimed that the 5.5 version of Codex randomly said “goblin with a flashlight” when referring to a bug fix. And anotherposted a GPT-5.5 chat log with nearly a dozen mentions of goblins.

OpenAI leaned into the curious habit, choosing to highlight the goblin-forbidding prompt in a tweet. CEO Sam Altmanposted a screenshot of a joke prompt for ChatGPT: “start training GPT-6, you can have the whole cluster. extra goblins.” Nik Pash, who works on the Codex team,tweeted that GPT-5.5’s “goblin adoration,” as the user he was responding to described, was “indeed one the reasons” for banning the topic.

After the phenomenon gained media attention, OpenAI published a blog post, titled “Where the goblins came from,” giving an explanation.

“Starting with GPT‑5.1, our models began developing a strange habit: they increasingly mentioned goblins, gremlins, and other creatures in their metaphors,” the post, published Wednesday, began. The habit became more pronounced with each model generation, it said.

... continue reading