Andrej Karpathy

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's going to start using words that are extremely rare.

3526.058 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So it's going to drift too much from the distribution.

3528.964 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I think controlling the distribution is just like a tricky... It's just like someone just has to...

3531.889 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's probably not trivial in that sense.

3536.879 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So it's really interesting in the history of the field because at one point everything was very scaling-pilled in terms of like, oh, we're going to make much bigger models, trillions of parameter models.

3551.576 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And actually what the models have done in size is they've gone up and now they've actually kind of like

3560.205 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

actually even come down.

3565.685 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

State-of-the-art models are smaller.

3566.606 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And even then, I actually think they memorized way too much.

3568.408 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I think I had a prediction a while back that I almost feel like we can get cognitive cores that are very good at even like a billion, billion parameters.

3571.731 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It should be all very like, like if you talk to a billion parameter model, I think in 20 years, you can actually have a very productive conversation, it thinks.

3579.198 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's a lot more like a human.

3586.985 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But if you ask it some factual question, it might have to look it up.

3589.328 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But it knows that it doesn't know and it might have to look it up and it will just do all the reasonable things.

3591.59 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

No, because I basically think that the training data is, so here's the issue.

3631.598 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

The training data is the internet, which is really terrible.

3634.201 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So there's a huge amount of gains to be made because the internet is terrible.

3637.506 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like if you actually, and even the internet, when you and I think of the internet, you're thinking of like a Wall Street Journal or that's not what this is.

3640.129 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

When you're actually looking at a pre-training data set in the front of your lab and you look at a random internet document, it's total garbage.

3646.538 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like I don't even know how this works at all.

3651.865 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment