Demis Hassabis
π€ SpeakerAppearances Over Time
Podcast Appearances
Now, it's not clear to me there is a limit now with just sort of passive perception.
Now, it's not clear to me there is a limit now with just sort of passive perception.
Now, it's not clear to me there is a limit now with just sort of passive perception.
Now, the interesting thing is that I think this has huge consequences for robots as an embodied intelligence, as an application, because the types of models we've built, Gemini and also now Veo, and we'll be combining those things together at some point in the future, is we've always built Gemini, our foundation model, to be multimodal from the beginning.
Now, the interesting thing is that I think this has huge consequences for robots as an embodied intelligence, as an application, because the types of models we've built, Gemini and also now Veo, and we'll be combining those things together at some point in the future, is we've always built Gemini, our foundation model, to be multimodal from the beginning.
Now, the interesting thing is that I think this has huge consequences for robots as an embodied intelligence, as an application, because the types of models we've built, Gemini and also now Veo, and we'll be combining those things together at some point in the future, is we've always built Gemini, our foundation model, to be multimodal from the beginning.
And the reason we did that, and we still lead on all the multimodal benchmarks, is because for twofold. One is we have a vision for this idea of a universal digital assistant, an assistant that goes around with you on the digital devices, but also in the real world, maybe on your phone or a glasses device, and actually helps you
And the reason we did that, and we still lead on all the multimodal benchmarks, is because for twofold. One is we have a vision for this idea of a universal digital assistant, an assistant that goes around with you on the digital devices, but also in the real world, maybe on your phone or a glasses device, and actually helps you
And the reason we did that, and we still lead on all the multimodal benchmarks, is because for twofold. One is we have a vision for this idea of a universal digital assistant, an assistant that goes around with you on the digital devices, but also in the real world, maybe on your phone or a glasses device, and actually helps you
in the real world, like recommend things to you, help you navigate around, help with physical things in the world, like cooking, stuff like that. And for that to work, you obviously need to understand the context that you're in. It's not just the language I'm typing into a chatbot. You actually have to understand the 3D world I'm living in, right?
in the real world, like recommend things to you, help you navigate around, help with physical things in the world, like cooking, stuff like that. And for that to work, you obviously need to understand the context that you're in. It's not just the language I'm typing into a chatbot. You actually have to understand the 3D world I'm living in, right?
in the real world, like recommend things to you, help you navigate around, help with physical things in the world, like cooking, stuff like that. And for that to work, you obviously need to understand the context that you're in. It's not just the language I'm typing into a chatbot. You actually have to understand the 3D world I'm living in, right?
I think to be a really good assistant, you need to do that. But the second thing is, of course, is exactly what you need for robotics as well. And we released our first big sort of Gemini robotics work, which has caused a bit of a stir.
I think to be a really good assistant, you need to do that. But the second thing is, of course, is exactly what you need for robotics as well. And we released our first big sort of Gemini robotics work, which has caused a bit of a stir.
I think to be a really good assistant, you need to do that. But the second thing is, of course, is exactly what you need for robotics as well. And we released our first big sort of Gemini robotics work, which has caused a bit of a stir.
And that's the beginning of showcasing what we can do with these multimodal models that do understand physics of the world with a little bit of robotics fine-tuning on top to do with the actions, the motor actions and the planning a robot needs to do. And it looks like it's going to work.
And that's the beginning of showcasing what we can do with these multimodal models that do understand physics of the world with a little bit of robotics fine-tuning on top to do with the actions, the motor actions and the planning a robot needs to do. And it looks like it's going to work.
And that's the beginning of showcasing what we can do with these multimodal models that do understand physics of the world with a little bit of robotics fine-tuning on top to do with the actions, the motor actions and the planning a robot needs to do. And it looks like it's going to work.
So actually now I think these general models are actually going to transfer to the embodied robotic setting without too much extra sort of special casing or extra data or extra effort, which is probably not what most people, even the top roboticists would have predicted five years ago.
So actually now I think these general models are actually going to transfer to the embodied robotic setting without too much extra sort of special casing or extra data or extra effort, which is probably not what most people, even the top roboticists would have predicted five years ago.