Google DeepMind has revealed Genie 3, its latest foundation world model that the AI lab says presents a crucial stepping stone on the path to artificial general intelligence, or human-like intelligence.
“Genie 3 is the first real-time interactive general purpose world model,” Shlomi Fruchter, a research director at DeepMind, said during a press briefing. “It goes beyond narrow world models that existed before. It’s not specific to any particular environment. It can generate both photo-realistic and imaginary worlds, and everything in between.”
Genie 3, which is still in research preview and not publicly available, builds on both its predecessor Genie 2 – which can generate new environments for agents – and DeepMind’s latest video generation model Veo 3 – which exhibits a deep understanding of physics.
Image Credits:Google DeepMind
With a simple text prompt, Genie 3 can generate multiple minutes – up from 10 to 20 seconds in Genie 2 – of diverse, interactive, 3D environments at 24 frames per second with a resolution of 720p. The model also features “promptable world events,” or the ability to use a prompt to change the generated world.
Perhaps most importantly, Genie 3’s simulations stay physically consistent over time because the model is able to remember what it had previously generated – an emergent capability that DeepMind researchers didn’t explicitly program into the model.
Fruchter said that while Genie 3 clearly has implications for educational experiences and new generative media like gaming or prototyping creative concepts, its real unlock will manifest in training agents for general purpose tasks, which he said is essential to reaching AGI.
“We think world models are key on the path to AGI, specifically for embodied agents, where simulating real world scenarios is particularly challenging,”Jack Parker-Holder, a research scientist on DeepMind’s open-endedness team, said during a briefing.
Techcrunch event Tech and VC heavyweights join the Disrupt 2025 agenda Netflix, ElevenLabs, Wayve, Sequoia Capital — just a few of the heavy hitters joining the Disrupt 2025 agenda. They’re here to deliver the insights that fuel startup growth and sharpen your edge. Don’t miss the 20th anniversary of TechCrunch Disrupt, and a chance to learn from the top voices in tech — grab your ticket now and save up to $675 before prices rise on August 7. Tech and VC heavyweights join the Disrupt 2025 agenda Netflix, ElevenLabs, Wayve, Sequoia Capital — just a few of the heavy hitters joining the Disrupt 2025 agenda. They’re here to deliver the insights that fuel startup growth and sharpen your edge. Don’t miss the 20th anniversary of TechCrunch Disrupt, and a chance to learn from the top voices in tech — grab your ticket now and save up to $675 before prices rise. San Francisco | REGISTER NOW
Image Credits:Google DeepMind
... continue reading