Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Ilya Sutskever

👤 Person
766 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

That's how reinforcement learning is done naively.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

That's how 01, R1 ostensibly are done.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

The value function says something like, okay, look, maybe I could sometimes, not always, could tell you if you are doing well or badly.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

The notion of a value function is more useful in some domains than others.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

For example, when you play chess and you lose a piece, you know, I messed up.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

You don't need to play the whole game to know that what I just did was bad and therefore whatever preceded it was also bad.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

So the value function lets you short circuit the weight until the very end.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

Like let's suppose that you started to pursue some kind of, okay, let's suppose that you are doing some kind of a math thing or a programming thing, and you're trying to explore a particular solution direction.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And after, let's say after a thousand steps of thinking, you concluded that this direction is unpromising.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

As soon as you conclude this, you could already get a reward signal a thousand time steps previously when you decided to pursue down this path.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

You say, oh, next time I shouldn't pursue this path in a similar situation long before you actually came up with the proposed solution.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

This sounds like such lack of faith in deep learning.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

Like, I mean, sure, it might be difficult, but nothing deep learning can't do.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

So my expectation is that value functions should be useful and I fully expect that they will be used in the future if not already.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

What was I alluding to with the person whose emotional center got damaged is more that

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

Maybe what it suggests is that the value function of humans is modulated by emotions in some important way that's hard-coded by evolution.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And maybe that is important for people to be effective in the world.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

I do agree that compared to the kind of things that we learn and the things that we are talking about, the kind of ways we are talking about emotions are relatively simple.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

might even be so simple that maybe you could map them out in a human understandable way.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

I think it would be cool to do.