Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Demis Hassabis

πŸ‘€ Speaker
See mentions of this person in podcasts
3240 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

But as for going forwards, I think that there's still a lot of interesting ideas

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

things to be resolved around planning and how does the brain construct the right world models?

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

I studied, for example, how the brain does imagination, or you can think of it as mental simulation.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

So how do we create very rich visual spatial simulations of the world in order for us to plan better?

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

I think that's a super promising direction in my opinion.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

So, you know, we've got to carry on improving the large models and we've got to carry on basically making them more and more accurate predictors of the world.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

So in effect, making them more and more reliable world models, that's clearly a necessary, but I would say probably not sufficient component of an AGI system.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

And then on top of that, I would, you know, we're working on things like

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

alpha zero-like planning mechanisms on top that make use of that model in order to make concrete plans to achieve certain goals in the world and perhaps sort of chain thought together or lines of reasoning together and maybe use search to kind of explore massive spaces of possibility.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

I think that's kind of missing from our current large models.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Well, I mean, one thing is Moore's law tends to help if every year, of course, more computation comes in.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

But we focus a lot on sample efficient methods and reusing existing data, things like experience replay, and also just looking at more efficient ways.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

I mean, the better your world model is, the more efficient your search can be.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

So one example I always give with AlphaZero, our system to play Go and chess and any game, is that it's stronger than world champion level, human world champion level at all these games.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

And it uses a lot less search than a brute force method like Deep Blue, say to play chess.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Deep Blue, one of these traditional Stockfish or Deep Blue systems would maybe look at millions of possible moves for every decision it's going to make.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

AlphaZero and AlphaGo looked at around tens of thousands of possible positions in order to make a decision about what to move next.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

But a human grandmaster, a human world champion probably only looks at a few hundreds of moves, even the top ones, in order to make their very good decision about what to play next.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

So that suggests that...

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

obviously the brute force systems don't have any real model other than the heuristics about the game.