Scott Alexander

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Maybe if they make it even bigger still, they'll notice more of these connections.

1188.661 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

And then third thing, and here's I think the special one, have you tried training the model to do the thing?

1192.09 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Just because the pre-training model

1197.343 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

the pre-training process doesn't strongly incentivize this type of connection-making, right?

1201.514 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

In general, I think it's a helpful heuristic that I use to ask the question of, like, remind oneself, what was the AI trained to do?

1206.703 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

What was its training environment like?

1214.275 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

And if you're wondering why hasn't the AI done this, ask yourself, like, did the training environment train it to do this?

1215.797 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

And often the answer is no, and often I think that's a good explanation for why the AI is not good at it, is that, like, it wasn't trained to do it.

1221.146 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Like, wouldn't it be really gnarly to try to set up an RL environment to train to make new scientific discoveries?

1233.487 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Well, so, I mean, in our scenario, they don't just, like, leap from where we are now to solving this problem.

1243.403 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

They don't.

1248.672 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Instead, they just iteratively improve the coding agents until they've basically got coding solved.

1249.212 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

But even still, their coding agents are not able to do some of this stuff.

1255.43 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Like that's what early 2020s, like the first half of 2027 in our story is basically they've got these awesome automated coders, but they still lack research taste and they still lack maybe like organizational skills and stuff.

1259.957 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

And so they need to like overcome those remaining bottlenecks and gaps in order to completely automate the research cycle.

1271.054 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

But they're able to overcome those gaps faster than they normally would because the coding agents are doing all the grunt work really fast for them.

1276.783 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

I'm glad you mentioned that.

1363.295 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Our current scenario does not really take that into account very much.

1364.236 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

So that's an example in which our scenario is possibly underestimating the rate of progress.

1366.841 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

We're trying to be sort of like our median guess.

1392.58 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment