Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Ilya Sutskever

👤 Person
766 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And that's very...

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

But what does it say about the role of our built-in emotions in making us like a viable agent, essentially?

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And I guess to connect to your question about pre-training, it's like, maybe if you're good enough at getting everything out of pre-training, you could get that as well.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

But that's the kind of thing which seems...

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

Well, it may or may not be possible to get that from pre-training.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

It should be some kind of a value function thing.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

Yeah.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

But I don't think there is a great ML analogy because right now value functions don't play a very prominent role in the things people do.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

I mean, certainly.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

I'll be very happy to do that.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

Right?

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

So...

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

So when people do reinforcement learning, the very reinforcement learning is done right now.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

How do people train those agents?

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

So you have a neural net, and you give it a problem.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And then you tell the model, go solve it.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And the model takes maybe thousands, hundreds of thousands of actions, or thoughts, or something, and then it produces a solution, the solution is created.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And then the score is used to provide a training signal for every single action

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

in your trajectory.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

So that means that if you're doing something that goes for a long time, if you're training a task that takes a long time to solve, you will do no learning at all until you solve until you come up with a proposed solution.