Dwarkesh

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

So for example, people used to think that AI would surely be truly intelligent if it solved chess, and then it solved chess, and you're like, no, that's just algorithms.

5276.137 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

And then they said, well, maybe it would be truly intelligent if it could do philosophy, and then it could write philosophical discourses.

5284.108 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

We were like, no, we just understand those are algorithms.

5290.778 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

I think there's going to be, I think there already is something similar with like, is the AI misaligned?

5293.702 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Is the AI evil?

5298.388 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Where there's this kind of distant idea of some evil AI, but then whenever something goes wrong, people are just like, oh, that's the algorithm.

5299.99 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

So for example, I think like 10 years ago,

5310.945 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

You had asked, like, when will we know that misalignment is really an important thing to worry about?

5313.749 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

People would say, oh, if the AI ever lies to you.

5318.816 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Of course, AIs lie to people all the time now, and everybody just kind of dismisses it because we understand why it happens.

5321.419 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

It's a thing that would obviously happen based on our current AI architecture.

5328.068 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Or like five years ago, they might have said, well, if an AI threatens to kill someone.

5331.953 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

I think Bing, like, threatened to kill a New York Times reporter during an interview.

5336.138 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

And everyone's just like, oh, yeah, AIs are like that.

5340.424 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

What does your shirt say?

5343.027 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

And I mean, I don't disagree with this.

5346.23 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

I'm also in this position.

5347.792 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

I see the AI is lying and it's obviously just like an artifact of the training process.

5348.993 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

It's not anything sinister.

5352.938 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

But I think this is just going to keep happening where no matter what evidence we get, people are going to think, oh yeah, that's not the AI turns evil thing that people have worried about.

5354.76 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment