Dwarkesh Patel

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

I'm confused why some people have super short timelines, yet at the same time are bullish on scaling up reinforcement learning atop LLMs.

0.385 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

If we're actually close to a human-like learner, then this whole approach of training on verifiable outcomes is doomed.

7.449 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

Now, currently the labs are trying to bake in a bunch of skills into these models through mid-training.

15.488 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

There's an entire supply chain of companies that are building RL environments, which teach the model how to navigate a web browser or use Excel to build financial models.

21.316 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

Now, either of these models will soon learn on the job in a self-directed way, which will make all this freebaking pointless, or they won't, which means that AGI is not imminent.

30.949 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

Humans don't have to go through the special training phase where they need to rehearse every single piece of software that they might ever need to use on the job.

39.62 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

Barron Milledge made an interesting point about this in a recent blog post he wrote.

45.667 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

He writes, quote, When we see frontier models improving at various benchmarks, we should think not just about the increased scale and the clever ML research ideas, but the billions of dollars that are paid to PhDs, MDs, and other experts to write questions and provide example answers and reasoning targeting these precise capabilities.

48.731 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

You can see this tension most vividly in robotics.

67.413 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

In some fundamental sense, robotics is an algorithms problem, not a hardware or data problem.

70.416 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

With very little training, a human can learn how to teleoperate current hardware to do useful work.

75.282 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

So if we actually had a human-like learner, robotics would be, in large part, a solved problem.

80.728 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

But the fact that we don't have such a learner makes it necessary to go out into a thousand different homes and practice a million times on how to pick up dishes or fold laundry.

84.873 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

Now, one common argument I've heard from the people who think we're going to have a takeoff within the next five years is that we have to do all this kludgy RL in service of building a superhuman AI researcher.

92.843 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

And then the million copies of this automated ILLIA can go figure out how to solve robust and efficient learning from experience.

103.799 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

This just gives me the vibes of that old joke, we're losing money on every sale, but we'll make it up in volume.

110.309 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

Somehow this automated researcher is going to figure out the algorithm for AGI, which is a problem that humans have been banging their head against for the better half of a century, while not having the basic learning capabilities that children have.

115.795 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

I find this super implausible.

128.209 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

Besides, even if that's what you believe, it doesn't describe how the labs are approaching reinforcement learning from verifiable reward.

129.671 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

You don't need to prebake in a consultant skill at crafting PowerPoint slides in order to automate ILIA.

136.398 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment