Dwarkesh

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

That's not the Terminator scenario.

5364.03 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

That's just one of these natural consequences of how we train it.

5366.052 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

And I think that once a thousand of these natural consequences of training add up, the AI is evil in the same way that like once the AI can do chess and philosophy and all these other things, eventually you got to admit it's intelligent.

5369.656 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Yeah.

5382.133 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

So I think that each individual failure, like maybe it will make the national news.

5382.253 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Maybe people say, oh, it's so strange that GPT-7 did this particular thing and then they'll train it away and then it won't do that thing.

5386.879 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

And there will be some point at the process of becoming super intelligent at which it

5394.69 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

I don't want to say makes the last mistake because you'll probably have like gradually decreasing number of mistakes to some asymptote, but the last mistake that anyone worries about.

5398.715 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

And after that, it will be able to do its own thing.

5407.259 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Yeah, I think the alignment community did not really expect LLMs.

5450.04 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

I mean, if you look in Bostrom Superintelligence, there's a discussion of Oracle AIs, which are sort of like LLMs.

5455.606 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

I think that came as a surprise.

5461.932 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

I think one of the reasons I'm more hopeful than I used to be is that LLMs are great for

5463.374 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

compared to the kind of reinforcement learning self-play agents that they expected.

5468.459 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

I do think that now we are kind of starting to move away from the LLMs to those reinforcement learning agents.

5472.967 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

We're going to face all of these problems again.

5479.798 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

So...

5669.541 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

I am the writer and the celebrity spokesperson for this scenario.

5670.824 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

I am the only person on the team who is not a genius forecaster.

5676.151 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

And maybe related to that, my PDoom is the lowest of anyone on the team.

5679.876 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment