Eliezer Yudkowsky

👤 Speaker

See mentions of this person in podcasts

1713 total appearances

Appearances Over Time

Podcast Appearances

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

But that doesn't mean that you're correct and justified in letting everything slide.

7135.048 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

It means that things are in a horrible state, getting worse, and there's nothing you can do about it.

7139.032 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

Yeah, how can you tell if you're making progress?

7169.532 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

You can, like, put them all on interpretability, because when you have an interpretability result, you can tell that it's there.

7171.014 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

And there's like, but there's like, you know, interpretability alone is not going to save you.

7176.983 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

We need systems that will have a pause button where they won't try to prevent you from pressing the pause button.

7181.018 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

Because they're like, oh, well, I can't get my stuff done if I'm paused.

7191.221 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

And that's a more difficult problem.

7195.931 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

And, you know, but it's like a fairly crisp problem and you can like maybe tell if somebody's made progress on it.

7200.256 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

So you can write and you can work on the pause problem.

7206.462 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

I don't actually like the term control problem because, you know, it sounds kind of controlling and alignment, not control.

7215.29 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

Like you're not trying to like take a thing that disagrees with you and like whip it back onto, like make it do what you want it to do, even though it wants to do something else.

7221.375 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

You're trying to like,

7229.522 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

in the process of its creation, choose its direction.

7230.243 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

It's not smart enough to prevent you from pressing the off switch and probably not smart enough to want to prevent you from pressing the off switch.

7242.845 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

They're just not opposing your attempt to pull the off switch.

7261.897 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

Parenthetically, like...

7268.749 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

don't kill the system.

7270.979 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

If you're like, if we're getting to the part where this starts to actually matter and it's like where they can fight back, like don't kill them and like dump their, their memory, like, like save them to disc.

7272.864 View full episode →

Lex Fridman Podcast

#368 – Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization

Don't kill them.

7282.79 View full episode →

← Previous Page 39 of 86 Next →

Report any issue