Ilya Sutskever

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And so now you've got this great competitive programmer.

436.33 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And with this analogy, I think it's more intuitive.

438.332 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

I think it's more intuitive with this analogy that, yeah, okay, so if it's so well-trained, okay, it's like all the different algorithms and all the different proof techniques are like right at its fingertips.

441.354 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And it's more intuitive that with this level of preparation, it will not necessarily generalize to other things.

452.517 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

I think it's the it factor.

472.301 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

Yeah.

473.842 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

Right?

474.203 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And I know, when I was an undergrad, I remember there was a student like this that studied with me.

474.744 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

So I know it exists.

479.95 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

Like the main strength of pre-training is that there is A, so much of it.

511.208 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

Yeah.

516.339 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And B, you don't have to think hard about what data to put into pre-training.

516.46 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And it's a very kind of natural data and it does include in it a lot of what people do.

522.753 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

people's thoughts and a lot of the features of, you know, it's like the whole world as projected by people onto text.

530.328 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And pre-training tries to capture that using a huge amount of data.

540.264 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

It's very, pre-training is very difficult to reason about because it's so hard to understand the manner in which the model relies on pre-training data.

544.471 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

And whenever the model makes a mistake, could it be because something by chance is not as supported by the pre-training data?

557.497 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

You know, and support by pre-training is maybe a loose term.

565.248 View full episode →

Dwarkesh Podcast

Ilya Sutskever – We're moving from the age of scaling to the age of research

I don't know if I can add anything more useful on this, but I don't think there is a human analog to pre-training.