Demis Hassabis

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

But as for going forwards, I think that there's still a lot of interesting ideas

331.016 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

things to be resolved around planning and how does the brain construct the right world models?

336.569 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

I studied, for example, how the brain does imagination, or you can think of it as mental simulation.

343.719 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

So how do we create very rich visual spatial simulations of the world in order for us to plan better?

350.087 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

I think that's a super promising direction in my opinion.

375.652 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

So, you know, we've got to carry on improving the large models and we've got to carry on basically making them more and more accurate predictors of the world.

378.877 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

So in effect, making them more and more reliable world models, that's clearly a necessary, but I would say probably not sufficient component of an AGI system.

388.093 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

And then on top of that, I would, you know, we're working on things like

396.948 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

alpha zero-like planning mechanisms on top that make use of that model in order to make concrete plans to achieve certain goals in the world and perhaps sort of chain thought together or lines of reasoning together and maybe use search to kind of explore massive spaces of possibility.

400.053 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

I think that's kind of missing from our current large models.

418.49 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Well, I mean, one thing is Moore's law tends to help if every year, of course, more computation comes in.

438.241 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

But we focus a lot on sample efficient methods and reusing existing data, things like experience replay, and also just looking at more efficient ways.

447.633 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

I mean, the better your world model is, the more efficient your search can be.

459.108 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

So one example I always give with AlphaZero, our system to play Go and chess and any game, is that it's stronger than world champion level, human world champion level at all these games.

462.592 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

And it uses a lot less search than a brute force method like Deep Blue, say to play chess.

474.09 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Deep Blue, one of these traditional Stockfish or Deep Blue systems would maybe look at millions of possible moves for every decision it's going to make.

479.979 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

AlphaZero and AlphaGo looked at around tens of thousands of possible positions in order to make a decision about what to move next.

488.613 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

But a human grandmaster, a human world champion probably only looks at a few hundreds of moves, even the top ones, in order to make their very good decision about what to play next.

498.292 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

So that suggests that...

509.354 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

obviously the brute force systems don't have any real model other than the heuristics about the game.

511.057 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment