Google DeepMind releases its Genie 3 model, which can generate 3D worlds from a prompt and has enough visual memory for a few minutes of continuous interaction
AI world models have many limitations, but Genie 3 offers improvements like real-time interaction and better memory.
The Verge Jay Peters
Related Coverage
- Genie 3: A new frontier for world models Google DeepMind
- DeepMind thinks its new Genie 3 world model presents a stepping stone toward AGI TechCrunch · Rebecca Bellan
- Google outlines latest step towards creating artificial general intelligence The Guardian · Dan Milmo
- DeepMind reveals Genie 3 “world model” that creates real-time interactive simulations Ars Technica · Ryan Whitwam
- Google DeepMind's Genie 3 can dynamically alter the state of its simulated worlds Engadget · Igor Bonifacic
- Google DeepMind's Genie 3 Gets One Step Closer Toward Interactive Worlds That Feel Alive TechEBlog · Jackson Chung
- Google DeepMind Unveils Genie 3, an AI That Generates Playable 3D Worlds in Real Time WinBuzzer · Markus Kasanmascheff
- Google DeepMind Unveils Genie 3 as Step Toward AGI With Real-Time 3D World Generation Analytics India Magazine · Siddharth Jindal
- Genie 3: A new frontier for world models Hacker News
Discussion
-
@timkellogg.me
Tim Kellogg
on bluesky
Genie 3: A general world model — Google announced Genie 3, a world model that can generate 3D scenes in real-time, meaning that it can be used to create 3D experiences that you can immediately use. Like a dynamic video game limited by your own imagination — deepmind.google/d…
-
@philpax.me
@philpax.me
on bluesky
well, then; I knew that world models would begin to approach the holodeck experience, but I wasn't expecting for it to happen so soon deepmind.google/discover/blo...
-
@officiallogank
Logan Kilpatrick
on x
Introducing Genie 3, the most advanced world simulator ever created, enabled by numerous research breakthroughs. 🤯 Featuring high fidelity visuals, 20-24 fps, prompting on the go, world memory, and more. [video]
-
@googledeepmind
@googledeepmind
on x
What if you could not only watch a generated video, but explore it too? 🌐 Genie 3 is our groundbreaking world model that creates interactive, playable environments from a single text prompt. From photorealistic landscapes to fantasy realms, the possibilities are endless. 🧵 [video…
-
@googledeepmind
@googledeepmind
on x
🔘 Real-time capabilities Genie 3 is our first world model to allow live interaction, while also improving consistency and realism compared to Genie 2. It can generate dynamic worlds at 720p and 24 FPS, with each frame created in response to user actions. [video]
-
@alex_peys
Alex Peysakhovich
on x
its no secret that i love world models, and this seems *stunningly good*, gdm is really dropping crazy things these days. can we have it in the gemini app?
-
@shanelegg
Shane Legg
on x
I have to admit, when I saw them generating flakey two second clips of 2D platform games... I did not expect this level of performance just two years later! Wow! Well done Genie Team.
-
@googledeepmind
@googledeepmind
on x
🔘 Long-horizon consistency Environments created remain largely consistent over several minutes, with visual memory extending as far as 1️⃣ minute in the past. This ability is critical to enable AI agents to learn about the world, and provides humans with an immersive [video]
-
@sethbannon
Seth Bannon
on x
Would have thought we were 1+ years away from this. The real-time rendering + the memory is super impressive. Wild times!
-
@pfau
David Pfau
on x
This is really impressive work, and congrats to the team, but as an aside...does anyone else find it weird that “world model” evolved from meaning “the minimal model needed to plan in an environment” to “action-conditional video model”?
-
@sundarpichai
Sundar Pichai
on x
Genie 3 is 🔥
-
@parkerortolani
Parker Ortolani
on x
one thing that's worth noting about models like Genie is that they could become a huge advantage for Google in the headset space when up against Apple, who as we all know, hasn't gotten itself together on text and static image models yet
-
@drjimfan
@drjimfan
on x
This is game engine 2.0. Some day, all the complexity of UE5 will be absorbed by a data-driven blob of attention weights. Those weights take as input game controller commands and directly animate a spacetime chunk of pixels. Agrim and I were close friends and coauthors back at