Eliezer Yudkowsky

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

To train a system to win chess games, you have to be able to tell whether a game has been won or lost.

4960.032 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

And until you can tell whether it's been won or lost, you can't update the system.

4967.601 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

So the problem I see...

5016.329 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

is that your typical human has a great deal of trouble telling whether I or Paul Christiano is making more sense.

5018.251 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

And that's with two humans, both of whom I believe of Paul and claim of myself, are sincerely trying to help, neither of whom is trying to deceive you.

5027.631 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

I believe of Paul and claim of myself.

5036.847 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

So the deception thing is the problem for you, the manipulation, the alien actress.

5041.131 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

So yeah, there's like two levels of this problem.

5047.276 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

One is that the weak systems are, well, there's three levels of this problem.

5049.698 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

There's like the weak systems that just don't make any good suggestions.

5054.563 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

There's like the middle systems where you can't tell if the suggestions are good or bad.

5058.847 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

And there's the strong systems that have learned to lie to you.

5062.85 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

I would love to dance around it.

5107.634 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

No, I'm probably not doing a great job of explaining.

5110.061 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

Which I can tell, because the Lex system didn't output like, ah, I understand.

5116.413 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

So now I'm trying a different output to see if I can elicit the... Well, no, a different output.

5123.847 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

I'm being trained to output things that make Lex think that he understood what I'm saying and agree with me.

5129.838 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

Help me out here.

5140.935 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

I'm trying not to be.

5145.16 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

I'm also trying to be constrained to say things that I think are true and not just things that get you to agree with me.

5147.102 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment