Asking Gemini 3 to generate Brainf*ck code results in an infinite loop, akin amost to a DDoS attack:
That is fascinating. So it made me wonder. Is Brainf*ck the ultimate test for AGI? I think so, and for 3 good reasons.
1. The Data Scarcity Problem
Large Language Models (LLMs) thrive on sheer volume. To master JavaScript, an LLM has been trained on virtually every available line of open-source code—hundreds of millions of lines of code (LOC). By comparison, the amount of functional Brainf*ck code on the web is a statistical rounding error.
We are talking about a million times less training data. Without the luxury of infinite patterns to copy, the model can't rely on mimicry; it has to understand the underlying logic.