Demis Hassabis

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Look, this is something Shane and I and many others here, we've had that forefront of our minds for

1333.314 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

since before we started DeepMind.

1337.56 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Because we planned for success, crazy.

1340.408 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

In 2010, no one was thinking about AI, let alone AGI.

1342.213 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

But we already knew that if we could make progress with these systems and these ideas, the technology that would be created would be unbelievably transformative.

1345.041 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

So we already were thinking 20 years ago about, well, what would the consequences of that be, both positive and negative?

1353.524 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Of course, the positive direction is amazing science, things like AlphaFold, incredible breakthroughs in health and science and maths and discovery, scientific discovery.

1360.591 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

But then also we got to make sure these systems are sort of understandable and controllable.

1370.241 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

And I think there's sort of several, this would be a whole sort of discussion in itself, but

1374.205 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

There are many, many ideas that people have from much more stringent eval systems.

1378.55 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

I think we don't have good enough evaluations and benchmarks for things like, can the system deceive you?

1383.537 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Can it exfiltrate its own code?

1389.246 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Sort of undesirable behaviors.

1390.728 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

And then there's ideas of actually using AI, maybe narrow AIs, so not general learning ones, but systems that are specialized for a domain to help us as the

1392.691 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

the human scientists, analyze and summarize what the more general system is doing.

1404.588 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Narrow AI tools.

1411.677 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

I think that there's a lot of promise in creating hardened sandboxes or simulations that are hardened with cybersecurity arrangements around the simulation, both to keep the AI in, but also as cybersecurity to keep hackers out.

1413.6 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Then you could experiment a lot more

1431.483 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

freely within that sandbox domain.

1433.225 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

And I think a lot of these ideas are, and there's many, many others, including the analysis stuff we talked about earlier, where can we analyze and understand what the concepts are that this system is building, what the representations are, so maybe they're not so alien to us and we can actually keep track of the kind of knowledge that it's building.

1435.568 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment