Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Ilya Sutskever

👤 Person
766 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And so now you've got this great competitive programmer.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And with this analogy, I think it's more intuitive.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

I think it's more intuitive with this analogy that, yeah, okay, so if it's so well-trained, okay, it's like all the different algorithms and all the different proof techniques are like right at its fingertips.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And it's more intuitive that with this level of preparation, it will not necessarily generalize to other things.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

I think it's the it factor.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

Yeah.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

Right?

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And I know, when I was an undergrad, I remember there was a student like this that studied with me.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

So I know it exists.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

Like the main strength of pre-training is that there is A, so much of it.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

Yeah.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And B, you don't have to think hard about what data to put into pre-training.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And it's a very kind of natural data and it does include in it a lot of what people do.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

people's thoughts and a lot of the features of, you know, it's like the whole world as projected by people onto text.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And pre-training tries to capture that using a huge amount of data.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

It's very, pre-training is very difficult to reason about because it's so hard to understand the manner in which the model relies on pre-training data.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

And whenever the model makes a mistake, could it be because something by chance is not as supported by the pre-training data?

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

You know, and support by pre-training is maybe a loose term.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

I don't know if I can add anything more useful on this, but I don't think there is a human analog to pre-training.

Dwarkesh Podcast
Ilya Sutskever – We're moving from the age of scaling to the age of research

I think there are some similarities between both of these two pre-training and pre-training tries to play the role of both of these.