Ilya Sutskever

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

But it does suggest that something strange is going on.

171.589 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

I have two possible explanations.

176.115 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

So here, this is the more kind of...

178.899 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

whimsical explanation is that maybe RL training makes the models a little bit too single-minded and narrowly focused, a little bit too, I don't know, unaware, even though it also makes them aware in some other ways.

181.454 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And because of this, they can't do basic things.

197.577 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

But there is another explanation, which is

200.781 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

Back when people were doing pre-training, the question of what data to train on was answered.

204.202 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

Because that answer was everything.

212.867 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

When you do pre-training, you need all the data.

216.157 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

So you don't have to think, is it going to be this data or that data?

221.043 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

But when people do RL training, they do need to think.

224.968 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

They say, okay, we want to have this kind of RL training for this thing and that kind of RL training for that thing.

228.513 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And from what I hear, all the companies have teams that just produce new RL environments and just add it to the training mix.

234.402 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And the question is, well, what are those?

242.713 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

There are so many degrees of freedom.

243.935 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

There is such a huge variety of RL environments you could produce.

246.158 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And one thing you could do, and I think that's something that is done inadvertently, is that people take inspiration from the evals.

250.051 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

You say, hey, I would love our model to do really well when we release it.

262.32 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

I want the evals to look great.

266.209 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

what would be RL training that could help on this task, right?

268.394 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment