Dwarkesh Patel

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

We need X, Y and Z capabilities in these models.

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

Models keep getting more impressive at the rate that the short timelines people predict, but more useful at the rate that the long timelines people predict.

493.608 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

It's worth asking, what are we scaling?

503.345 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

With pre-trading, we had this extremely clean and general trend in improvement in loss across multiples orders of magnitude in compute.

505.549 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

Albeit, this was on a power law, which is as weak as exponential growth is strong.

513.603 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

But people are trying to launder the prestige that three-training scaling has, which is almost as predictable as a physical law of the universe, to justify bullish predictions about reinforcement learning from verifiable reward, for which we have no well-fit publicly known trend.

518.431 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

And when intrepid researchers do try to piece together the implications from scarce public data points, they get pretty bearish results.

534.462 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

For example, Toby Board has a great post where he cleverly connects the dots between the different O-series benchmarks.

541.65 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

And this suggested to him that, quote, we need something like a million X scale up in total RL compute to give a boost similar to a single GPT level, end quote.

548.578 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

So people have spent a lot of time talking about the possibility of a software-only singularity, where AI models will write the code that generates a smarter successor system, or a software plus hardware singularity, where AIs also improve their successor's computing hardware.

561.209 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

However, all these scenarios neglect what I think will be the main driver of further improvements atop AGI, continual learning.

576.907 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

Again, think about how humans become more capable than anything.

584.696 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

It's mostly from experience in the relevant domain.

587.52 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

Over conversation, Baron Millage made this interesting suggestion that the future might look like continual learning agents who are all going out and they're doing different jobs and they're generating value.

590.403 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

And then they're bringing back all their learnings to the hive mind model, which does some kind of batch distillation on all of these agents.

600.137 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

The agents themselves could be quite specialized, containing what Karpathy called the cognitive core, plus knowledge and skills relevant to the job they're being deployed to do.

607.588 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

Solving continual learning won't be a singular one-and-done achievement.

616.762 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

Instead, it will feel like solving in-context learning.

621.146 View full episode →

Dwarkesh Podcast

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

Now, GPT-3 already demonstrated in-context learning could be very powerful in 2020.