Demis Hassabis
👤 PersonAppearances Over Time
Podcast Appearances
Yeah, it was trained off of video and some synthetic data from game engines.
And it's just reverse engineered it.
And for me, it's very close to my heart, this project, but it's also quite mind-blowing because in the 90s, in my early career, I used to write video games and AI for video games and graphics engines.
And I remember how hard it was to do this by hand, program all the polygons and the physics engines.
And it's amazing to just see this, do it effortlessly.
All of the reflections on the water and the way materials flow.
and objects behave.
And it's just doing that all out of the box.
so so the reason we're building these kind of models is um we feel and we've always felt uh we're obviously progressing on the normal language models like with our gemini model but from the beginning with gemini we wanted it to be multimodal so we wanted it to input and take any kind of input images audio video and it can output anything
And so we've been very interested in this because for an AI to be truly general, to build AGI, we feel that the AGI system needs to understand the world around us and the physical world around us, not just the abstract world of languages or mathematics.
And of course, that's what's critical for robotics to work.
It's probably what's missing from it today.
And also things like smart glasses, a smart glasses system that helps you in your everyday life.
It's got to understand the physical context that you're in and how the intuitive physics of the world works.
So we think that building these types of models, these Genie models and also Veo, the best text-to-video models,
Those are expressions of us building world models that understand the dynamics of the world, the physics of the world.
If you can generate it, then that's an expression of your system understanding those dynamics.
Yeah, that's right.
So if you look at our Gemini Live version of Gemini, where you can hold up your phone to the world around you, I'd recommend any of you try it.
It's kind of magical what it already understands about the physical world.