Dwarkesh

Ilya Sutskever – We're moving from the age of scaling to the age of research

And it thought a bunch.

2040.725 View full episode →

Ilya Sutskever – We're moving from the age of scaling to the age of research

And then it realized that the correct way to model the information you gain from a single yes or no outcome in RL is as the entropy of a random binary variable.

2041.826 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

It made a graph which showed how the bits you gain for a sample in RL versus supervised learning scale as the pass rate increases.

2052.341 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And as soon as I saw the graph that Gemini 3 made, immediately a ton of things started making sense to me.

2060.552 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

Then I wanted to see if there was any empirical basis to this theory.

2065.278 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

So I asked Gemini to code an experiment to show whether the improvement in loss scales in this way with pass rate.

2069.002 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

I just took the code that Gemini outputted, I copy pasted it into a Google Colab notebook, and I was able to run this toy ML experiment and visualize its results without a single bug.

2077.052 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

It's interesting because the results look similar but not identical to what we should have expected.

2086.723 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And so I downloaded this chart and I put it into Gemini and asked it, what is going on here?

2091.569 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And it came up with a hypothesis that I think is actually correct, which is that we're capping how much supervised learning can improve in the beginning by having a fixed learning rate.

2095.614 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And in fact, we should decrease the learning rate over time.

2104.545 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

It actually gives us an intuitive understanding for why in practice we have learning rate schedulers that decrease the learning rate over time.

2106.647 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

I did this entire flow from coming up with this vague initial question to building a theoretical understanding to running some toy ML experiments, all with Gemini 3.

2113.996 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

This feels like the first model where it can actually come up with new connections that I wouldn't have anticipated.

2124.192 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

It's actually now become the default place I go to when I want to brainstorm new ways to think about a problem.

2129.801 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

If you want to read more about RL scaling, you can check out the blog post that I wrote with a little help from Gemini 3.

2135.63 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And if you want to check out Gemini 3 yourself, go to gemini.google.

2140.798 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

I am curious, if you say we are back in an era of research, you were there from 2012 to 2020.

2145.946 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And do you have... Yeah, what is now the vibe going to be if we go back to the era of research?

2153.498 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

For example...

2160.589 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment