Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Ilya Sutskever

👤 Person
766 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

But it does suggest that something strange is going on.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

I have two possible explanations.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

So here, this is the more kind of...

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

whimsical explanation is that maybe RL training makes the models a little bit too single-minded and narrowly focused, a little bit too, I don't know, unaware, even though it also makes them aware in some other ways.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And because of this, they can't do basic things.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

But there is another explanation, which is

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

Back when people were doing pre-training, the question of what data to train on was answered.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

Because that answer was everything.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

When you do pre-training, you need all the data.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

So you don't have to think, is it going to be this data or that data?

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

But when people do RL training, they do need to think.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

They say, okay, we want to have this kind of RL training for this thing and that kind of RL training for that thing.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And from what I hear, all the companies have teams that just produce new RL environments and just add it to the training mix.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And the question is, well, what are those?

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

There are so many degrees of freedom.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

There is such a huge variety of RL environments you could produce.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And one thing you could do, and I think that's something that is done inadvertently, is that people take inspiration from the evals.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

You say, hey, I would love our model to do really well when we release it.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

I want the evals to look great.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

what would be RL training that could help on this task, right?