Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Dwarkesh

👤 Person
1735 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

I'm just saying it's not 100% obvious.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

But what is that?

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

How do you think about emotions?

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

What is the ML analogy for emotions?

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

It might be worth defining for the audience what a value function is if you want to do that.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

this was in the DeepSeq R1 paper, is that the space of trajectories is so wide that maybe it's hard to learn a mapping from an intermediate trajectory and value.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And also given that, you know, in coding, for example, you will have the wrong idea, then you'll go back, then you'll change something.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

That's the thing I was actually planning on asking you.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

There's something really interesting about emotions of the value function, which is that...

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

It's impressive that they have this much utility while still being rather simple to understand.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

So I have two responses.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

Yeah.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

People have been talking about scaling data, scaling parameter, scaling compute.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

Is there a more general way to think about scaling?

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

What are the other scaling axes?

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

That's a very interesting way to put it.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

But let me ask you the question you just posed then.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

What are we scaling and what would it mean to have a recipe?

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

Because I guess I'm not aware of a very clean relationship that almost looks like a law of physics, which existed in pre-training.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

There was a power law between data or computer parameters and loss.