Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Andrej Karpathy

๐Ÿ‘ค Speaker
3419 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

It's going to start using words that are extremely rare.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

So it's going to drift too much from the distribution.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

So I think controlling the distribution is just like a tricky... It's just like someone just has to...

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

It's probably not trivial in that sense.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

So it's really interesting in the history of the field because at one point everything was very scaling-pilled in terms of like, oh, we're going to make much bigger models, trillions of parameter models.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

And actually what the models have done in size is they've gone up and now they've actually kind of like

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

actually even come down.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

State-of-the-art models are smaller.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

And even then, I actually think they memorized way too much.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

So I think I had a prediction a while back that I almost feel like we can get cognitive cores that are very good at even like a billion, billion parameters.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

It should be all very like, like if you talk to a billion parameter model, I think in 20 years, you can actually have a very productive conversation, it thinks.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

And it's a lot more like a human.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

But if you ask it some factual question, it might have to look it up.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

But it knows that it doesn't know and it might have to look it up and it will just do all the reasonable things.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

No, because I basically think that the training data is, so here's the issue.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

The training data is the internet, which is really terrible.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

So there's a huge amount of gains to be made because the internet is terrible.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

Like if you actually, and even the internet, when you and I think of the internet, you're thinking of like a Wall Street Journal or that's not what this is.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

When you're actually looking at a pre-training data set in the front of your lab and you look at a random internet document, it's total garbage.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

Like I don't even know how this works at all.