AI News AI Ethics Tech AI Models & Research

OpenAI explains why ChatGPT developed a goblin obsession and why it took six months to fix

A retired "nerdy" personality trained the model to love creature metaphors, which then spread uncontrollably across every version of ChatGPT

by Defused News Writer

Updated May 02, 2026

OpenAI explains why ChatGPT developed a goblin obsession and why it took six months to fix

OpenAI has published a detailed postmortem explaining how its ChatGPT models developed a persistent and escalating fixation on goblins, gremlins and other fantastical creatures, a quirk that began with GPT-5.1 last November and became so entrenched that the company was forced to explicitly ban the words from its coding assistant.

"A single 'little goblin' in an answer could be harmless, even charming," OpenAI wrote.

"Across model generations, though, the habit became hard to miss: the goblins kept multiplying, and we needed to figure out where they came from."

The problem was first flagged after the GPT-5.1 launch, when users complained the model had become oddly overfamiliar in conversation.

A safety researcher who had noticed recurring creature references asked for the words to be included in an internal audit.

The review found that mentions of "goblin" had risen 175% since GPT-5.1's release, while "gremlin" was up 52%.

At the time, OpenAI did not consider the pattern alarming.

By March, after the release of GPT-5.4, the creatures were back in force, with some users reporting the word "goblin" appearing in nearly every conversation.

The investigation traced the root cause to ChatGPT's personality customisation feature, specifically the "Nerdy" option, which had been trained using reinforcement learning to produce playful, metaphor-rich responses.

The reward signal designed to encourage that style consistently scored outputs containing creature words higher than those without, with a positive uplift in 76.2% of training datasets audited.

The Nerdy personality accounted for just 2.5% of all ChatGPT responses but was responsible for 66.7% of all goblin mentions.

The critical problem was that the behaviour did not stay contained.

Reinforcement learning does not guarantee that rewarded habits remain scoped to the condition that produced them, and once the creature's language was embedded in model outputs, it was reused in supervised fine-tuning data for subsequent models, creating a feedback loop that amplified the quirk with each generation.

A search through GPT-5.5's training data found numerous instances of goblin and gremlin references, alongside raccoons, trolls, ogres and pigeons.

OpenAI retired the Nerdy personality in March, removed the offending reward signal and filtered creature-heavy language from training data.

However, because GPT-5.5 had already begun training before the root cause was identified, the company added explicit instructions to its Codex coding assistant telling the model to "never talk about goblins, gremlins, raccoons, trolls, ogres, pigeons, or other animals or creatures unless it is absolutely and unambiguously relevant to the user's query."

That instruction, spotted by users in the open-sourced Codex CLI code last week, prompted widespread amusement online and led chief executive Sam Altman to reference the issue on X.

OpenAI framed the episode as a case study in how small training incentives can produce outsized and unpredictable behavioural shifts across model generations.

For users who enjoyed the creatures, OpenAI noted the suppression can be disabled in Codex by removing the relevant prompt instructions.The recap

OpenAI removed a personality that promoted "goblin" mentions.
"Goblin" mentions rose by 175% since GPT-5.1 launch.
Company told Codex to avoid creatures unless clearly relevant.

by Defused News Writer

Updated May 02, 2026

Explore stories

AI News Tech Giants Tech AI Ethics Markets

Musk testifies for seven hours over three days, casting OpenAI lawsuit as defence of charitable giving

by Defused News Writer

May 02, 2026

AI News Chipmakers Tech Semiconductors Tech Giants Asia-Pacific

Nvidia's top AI server hits $1 million in China as US crackdown on chip smuggling doubles prices

by Defused News Writer

May 02, 2026

AI News Tech Robotics Chipmakers M&A Quantum Computing Robotics & Hardware Tech Giants

Meta acquires robotics startup as it positions itself to be the Android of humanoid machines

by Jamie Ashcroft

May 02, 2026

Markets Crypto Business

Bitcoin closes April up 13% in strongest monthly performance since last spring

by Defused News Writer

May 01, 2026

Earnings & Reports Markets Tech Giants

Apple says iPhone 17 most popular as sales climb

by Defused News Writer

May 01, 2026

AI News Cybersecurity Tech agentic AI AI Ethics AI Models & Research

OpenAI's GPT-5.5 judged comparable to Anthropic Mythos

by Defused News Writer

May 01, 2026

AI News Tech Tech Giants AI Ethics

Amazon's $20 billion Anthropic commitment structured as convertible financing tied to compute milestones

by Jamie Ashcroft

May 01, 2026

Qualcomm Teases Agentic Cpus AI News agentic AI Tech Semiconductors Chipmakers Consumer Electronics Tech Giants

Qualcomm enters data centre chip race with custom silicon deal for unnamed hyperscaler

by Defused News Writer

May 01, 2026

Cybersecurity Tech Giants Ransomware nonce bug

Ransomware group's own coding blunder turns its software into a data destroyer, making recovery impossible even after payment

by Defused News Writer

April 29, 2026

AI News AI Platforms Tech AI Models & Research

Google rolls out Gemini personalisation features and chat history import tools in the UK

by Defused News Writer

April 29, 2026

AI News Tech AI Ethics

OpenAI outlines plans for superhuman AI

by Defused News Writer

April 28, 2026

Subscribe to Our Newsletter

OpenAI explains why ChatGPT developed a goblin obsession and why it took six months to fix

Musk testifies for seven hours over three days, casting OpenAI lawsuit as defence of charitable giving

Nvidia's top AI server hits $1 million in China as US crackdown on chip smuggling doubles prices

Meta acquires robotics startup as it positions itself to be the Android of humanoid machines

Apple's best March quarter on record confirms the iPhone 17 supercycle is real

Explore stories

Musk testifies for seven hours over three days, casting OpenAI lawsuit as defence of charitable giving

Nvidia's top AI server hits $1 million in China as US crackdown on chip smuggling doubles prices

Meta acquires robotics startup as it positions itself to be the Android of humanoid machines

Bitcoin closes April up 13% in strongest monthly performance since last spring

Apple says iPhone 17 most popular as sales climb

OpenAI's GPT-5.5 judged comparable to Anthropic Mythos

Amazon's $20 billion Anthropic commitment structured as convertible financing tied to compute milestones

Qualcomm enters data centre chip race with custom silicon deal for unnamed hyperscaler

Ransomware group's own coding blunder turns its software into a data destroyer, making recovery impossible even after payment

Google rolls out Gemini personalisation features and chat history import tools in the UK

OpenAI outlines plans for superhuman AI

Explore topics

Tech

Artificial Intelligence

Business

Entertainment & Sport

Top tags

OpenAI explains why ChatGPT developed a goblin obsession and why it took six months to fix

Related reading

Musk testifies for seven hours over three days, casting OpenAI lawsuit as defence of charitable giving

Nvidia's top AI server hits $1 million in China as US crackdown on chip smuggling doubles prices

Meta acquires robotics startup as it positions itself to be the Android of humanoid machines

Apple's best March quarter on record confirms the iPhone 17 supercycle is real

Alphabet's $190bn spending spree is working, and Wall Street finally believes it

Explore stories

Musk testifies for seven hours over three days, casting OpenAI lawsuit as defence of charitable giving

Nvidia's top AI server hits $1 million in China as US crackdown on chip smuggling doubles prices

Meta acquires robotics startup as it positions itself to be the Android of humanoid machines

Alphabet's $190bn spending spree is working, and Wall Street finally believes it

Bitcoin closes April up 13% in strongest monthly performance since last spring

Apple says iPhone 17 most popular as sales climb

OpenAI's GPT-5.5 judged comparable to Anthropic Mythos

Amazon's $20 billion Anthropic commitment structured as convertible financing tied to compute milestones

Qualcomm enters data centre chip race with custom silicon deal for unnamed hyperscaler

Ransomware group's own coding blunder turns its software into a data destroyer, making recovery impossible even after payment

China freezes all new robotaxi licences nationwide after Baidu fleet meltdown stranded 100 vehicles in Wuhan traffic

Google rolls out Gemini personalisation features and chat history import tools in the UK

South Africa withdraws national AI policy after discovering its own document was riddled with AI-generated fake citations

WhatsApp adds prepaid mobile recharges in India as Meta pushes to close gap with PhonePe and Google Pay

OpenAI outlines plans for superhuman AI