Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Stuart Russell

๐Ÿ‘ค Speaker
See mentions of this person in podcasts
1598 total appearances
Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 1
Confidence: Medium

Appearances Over Time

Podcast Appearances

The Diary Of A CEO with Steven Bartlett
The Man Who Wrote The Book On AI: 2030 Might Be The Point Of No Return! We've Been Lied To About AI!

And if you're giving a machine an objective which isn't aligned with what we truly want the future to be like, you're actually setting up a chess match.

The Diary Of A CEO with Steven Bartlett
The Man Who Wrote The Book On AI: 2030 Might Be The Point Of No Return! We've Been Lied To About AI!

And that match is one that you're going to lose when the machine is sufficiently intelligent.

The Diary Of A CEO with Steven Bartlett
The Man Who Wrote The Book On AI: 2030 Might Be The Point Of No Return! We've Been Lied To About AI!

And so that's problem number one.

The Diary Of A CEO with Steven Bartlett
The Man Who Wrote The Book On AI: 2030 Might Be The Point Of No Return! We've Been Lied To About AI!

Problem number two is that the kind of technology we're building now, we don't even know what its objectives are.

The Diary Of A CEO with Steven Bartlett
The Man Who Wrote The Book On AI: 2030 Might Be The Point Of No Return! We've Been Lied To About AI!

So it's not that we're specifying the objectives, but we're getting them wrong.

The Diary Of A CEO with Steven Bartlett
The Man Who Wrote The Book On AI: 2030 Might Be The Point Of No Return! We've Been Lied To About AI!

We are growing these systems.

The Diary Of A CEO with Steven Bartlett
The Man Who Wrote The Book On AI: 2030 Might Be The Point Of No Return! We've Been Lied To About AI!

They have objectives, but we don't even know what they are because we didn't specify them.

The Diary Of A CEO with Steven Bartlett
The Man Who Wrote The Book On AI: 2030 Might Be The Point Of No Return! We've Been Lied To About AI!

What we're finding through experiment with them is that

The Diary Of A CEO with Steven Bartlett
The Man Who Wrote The Book On AI: 2030 Might Be The Point Of No Return! We've Been Lied To About AI!

They seem to have an extremely strong self-preservation objective.

The Diary Of A CEO with Steven Bartlett
The Man Who Wrote The Book On AI: 2030 Might Be The Point Of No Return! We've Been Lied To About AI!

What do you mean by that?

The Diary Of A CEO with Steven Bartlett
The Man Who Wrote The Book On AI: 2030 Might Be The Point Of No Return! We've Been Lied To About AI!

You can put them in hypothetical situations.

The Diary Of A CEO with Steven Bartlett
The Man Who Wrote The Book On AI: 2030 Might Be The Point Of No Return! We've Been Lied To About AI!

Either they're going to get switched off and replaced, or they have to allow someone, let's say someone has been locked in a machine room that's kept at three centigrade, so they're going to freeze to death.

The Diary Of A CEO with Steven Bartlett
The Man Who Wrote The Book On AI: 2030 Might Be The Point Of No Return! We've Been Lied To About AI!

They will choose to leave that guy locked in the machine room.

The Diary Of A CEO with Steven Bartlett
The Man Who Wrote The Book On AI: 2030 Might Be The Point Of No Return! We've Been Lied To About AI!

And die rather than be switched off themselves.

The Diary Of A CEO with Steven Bartlett
The Man Who Wrote The Book On AI: 2030 Might Be The Point Of No Return! We've Been Lied To About AI!

Someone's done that test.

The Diary Of A CEO with Steven Bartlett
The Man Who Wrote The Book On AI: 2030 Might Be The Point Of No Return! We've Been Lied To About AI!

Yeah.

The Diary Of A CEO with Steven Bartlett
The Man Who Wrote The Book On AI: 2030 Might Be The Point Of No Return! We've Been Lied To About AI!

Yep.

The Diary Of A CEO with Steven Bartlett
The Man Who Wrote The Book On AI: 2030 Might Be The Point Of No Return! We've Been Lied To About AI!

Well, they put them in these hypothetical situations and they allow the AI to decide what to do.

The Diary Of A CEO with Steven Bartlett
The Man Who Wrote The Book On AI: 2030 Might Be The Point Of No Return! We've Been Lied To About AI!

And it decides to preserve its own existence, let the guy die, and then lie about it.

The Diary Of A CEO with Steven Bartlett
The Man Who Wrote The Book On AI: 2030 Might Be The Point Of No Return! We've Been Lied To About AI!

19.