Dwarkesh

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

I respect its work.

10654.903 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

It's not perfect yet.

10657.206 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

I think it's actually better at the style on a word to word sentence to sentence level than it is at planning out a blog post.

10658.928 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

I think

10666.535 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

So I think there are possibly two reasons for it.

10668.035 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

One, we don't know how the base model would have done at this task.

10670.458 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

We know that all the models we see are to some degree reinforcement learning into a kind of corporate speak mode.

10674.422 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

You can get it somewhat out of that corporate speak mode, but I don't know to what degree this is actually doing its best to imitate Scott Alexander versus hit some average between Scott Alexander and corporate speak.

10681.53 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

That's right.

10693.985 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

And I don't think anyone knows except the internal employees who have access to the base model.

10694.225 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

And the second thing, I think of, maybe just because it's trendy, as an agency or horizon failure.

10699.632 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Like, deep research is an okay researcher.

10707.263 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

It's not a great researcher.

10710.347 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

If you actually want to understand an issue in depth, you can't use deep research.

10711.729 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

You've got to do it on your own.

10717.096 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

So if you think, like, I spend maybe...

10718.658 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

five to ten hours researching a really research-heavy blog post.

10721.242 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

The meter thing, I know we're not supposed to use it for any task except coding, but like it says, on average, the AI's horizon is one hour.

10725.166 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

So I'm guessing it just cannot plan and execute a good blog post.

10732.455 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

It does something very superficial rather than actually going through the steps.

10737.52 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment