Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Andrej Karpathy

๐Ÿ‘ค Speaker
3433 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

actually even come down.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

State-of-the-art models are smaller.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

And even then, I actually think they memorized way too much.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

So I think I had a prediction a while back that I almost feel like we can get cognitive cores that are very good at even like a billion, billion parameters.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

It should be all very like, like if you talk to a billion parameter model, I think in 20 years, you can actually have a very productive conversation, it thinks.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

And it's a lot more like a human.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

But if you ask it some factual question, it might have to look it up.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

But it knows that it doesn't know and it might have to look it up and it will just do all the reasonable things.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

No, because I basically think that the training data is, so here's the issue.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

The training data is the internet, which is really terrible.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

So there's a huge amount of gains to be made because the internet is terrible.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

Like if you actually, and even the internet, when you and I think of the internet, you're thinking of like a Wall Street Journal or that's not what this is.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

When you're actually looking at a pre-training data set in the front of your lab and you look at a random internet document, it's total garbage.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

Like I don't even know how this works at all.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

It's some like stock ticker symbols.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

It's a huge amount of slop and garbage from like all the corners of the internet.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

It's not like your Wall Street Journal article that's extremely rare.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

So I almost feel like because the internet is so terrible, we actually have to sort of like build really big models to compress all that.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

Most of that compression is memory work instead of like cognitive work.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

But what we really want is the cognitive part to actually delete the memory.