Ilya Sutskever

Now, based on what people say on Twitter, they spend more compute on RL than on pre-training at this point because RL can actually consume quite a bit of compute.

1373.171 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

You know, you do very, very long rollouts.

1383.548 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

Yes.

1386.272 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

So it takes a lot of compute to produce those rollouts.

1387.173 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And then you get relatively small amount of learning for the rollout.

1390.002 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

So you really can spend a lot of compute.

1393.133 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And I could imagine, like, I wouldn't, at this, it's more like, I wouldn't even call it a scaling.

1395.661 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

I would say, hey, like, what are you doing?

1404.214 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And is the thing you are doing the most productive thing you could be doing?

1407.178 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

Can you find a more productive way of using your compute?

1411.805 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment