Dwarkesh

So I have to say that prepping for Ilya was pretty tough because neither I nor anybody else had any idea what he's working on and what SSI is trying to do.

1952.231 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

I had no basis to come up with my questions.

1961.559 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And the only thing I could go off, honestly, was trying to think from first principles about what are the bottlenecks to HEI?

1964.502 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

Because clearly Ilya is working on them in some way.

1971.088 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

Part of this question involved thinking about RL scaling because everybody's asking how well RL will generalize and how we can make it generalize better.

1974.271 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

As part of this, I was reading this paper that came out recently on RL scaling and it showed that actually the learning curve on RL looks like a sigmoid.

1981.477 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

I found this very curious.

1989.466 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

Why should it be a sigmoid where it learns very little for a long time and then it quickly learns a lot and then it asymptotes?

1990.667 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

This is very different from the power law you see in pre-training where the model learns a bunch at the very beginning and then less and less over time.

1997.274 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And it actually reminded me of a note that I had written down after I had a conversation with a researcher friend where he pointed out that the number of samples that you need to take in order to find a correct answer scales exponentially with how different your current probability distribution is from the target probability distribution.

2004.562 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And I was thinking about how these two ideas are related.

2021.183 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

I had this vague idea that they should be connected, but I really didn't know how.

2023.508 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

I don't have a math background, so I couldn't really formalize it.