Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Dwarkesh

👤 Person
1735 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

It seems like humans have some solution, but I'm curious about like, well, how are they doing it?

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And like, why is it so hard to like, well, how do we need to reconceptualize the way we're training models to make something like this possible?

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

You know, that is a great question to ask.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

Nobody listens to this podcast, Ilya.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

Yeah.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

So I have to say that prepping for Ilya was pretty tough because neither I nor anybody else had any idea what he's working on and what SSI is trying to do.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

I had no basis to come up with my questions.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And the only thing I could go off, honestly, was trying to think from first principles about what are the bottlenecks to HEI?

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

Because clearly Ilya is working on them in some way.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

Part of this question involved thinking about RL scaling because everybody's asking how well RL will generalize and how we can make it generalize better.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

As part of this, I was reading this paper that came out recently on RL scaling and it showed that actually the learning curve on RL looks like a sigmoid.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

I found this very curious.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

Why should it be a sigmoid where it learns very little for a long time and then it quickly learns a lot and then it asymptotes?

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

This is very different from the power law you see in pre-training where the model learns a bunch at the very beginning and then less and less over time.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And it actually reminded me of a note that I had written down after I had a conversation with a researcher friend where he pointed out that the number of samples that you need to take in order to find a correct answer scales exponentially with how different your current probability distribution is from the target probability distribution.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And I was thinking about how these two ideas are related.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

I had this vague idea that they should be connected, but I really didn't know how.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

I don't have a math background, so I couldn't really formalize it.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

But I wondered if Gemini 3 could help me out here.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And so I took a picture of my notebook and I took the paper and I put them both in the context of Gemini 3 and I asked it to find the connection.