Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Demis Hassabis

πŸ‘€ Speaker
See mentions of this person in podcasts
3240 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Look, this is something Shane and I and many others here, we've had that forefront of our minds for

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

since before we started DeepMind.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Because we planned for success, crazy.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

In 2010, no one was thinking about AI, let alone AGI.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

But we already knew that if we could make progress with these systems and these ideas, the technology that would be created would be unbelievably transformative.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

So we already were thinking 20 years ago about, well, what would the consequences of that be, both positive and negative?

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Of course, the positive direction is amazing science, things like AlphaFold, incredible breakthroughs in health and science and maths and discovery, scientific discovery.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

But then also we got to make sure these systems are sort of understandable and controllable.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

And I think there's sort of several, this would be a whole sort of discussion in itself, but

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

There are many, many ideas that people have from much more stringent eval systems.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

I think we don't have good enough evaluations and benchmarks for things like, can the system deceive you?

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Can it exfiltrate its own code?

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Sort of undesirable behaviors.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

And then there's ideas of actually using AI, maybe narrow AIs, so not general learning ones, but systems that are specialized for a domain to help us as the

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

the human scientists, analyze and summarize what the more general system is doing.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Narrow AI tools.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

I think that there's a lot of promise in creating hardened sandboxes or simulations that are hardened with cybersecurity arrangements around the simulation, both to keep the AI in, but also as cybersecurity to keep hackers out.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Then you could experiment a lot more

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

freely within that sandbox domain.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

And I think a lot of these ideas are, and there's many, many others, including the analysis stuff we talked about earlier, where can we analyze and understand what the concepts are that this system is building, what the representations are, so maybe they're not so alien to us and we can actually keep track of the kind of knowledge that it's building.