Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Dwarkesh Patel

👤 Person
12212 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

But why is the distilled version still a billion?

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

Is I guess the thing I'm curious about.

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

Why would you train on... Right, no, no, but why is the distillation in 10 years not getting below 1 billion?

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

Oh, you think it should be smaller than a billion?

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

Yeah, I mean, just like if you look at the trend over the last few years, just finding low-hanging fruit and going from like trillion-plus models that are like literally two orders of magnitude smaller in a matter of two years and having better performance.

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

It makes me think the sort of core of intelligence might be even way, way smaller.

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

Like plenty of room at the bottom, to paraphrase Feynman.

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

Yeah.

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

So we're discussing what, like, plausibly could be the cognitive core.

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

There's a separate question, which is, what will actually be the size of furniture models over time?

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

And I'm curious to have a prediction.

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

So we had increasing scale up to maybe 4.5, and now we're seeing decreasing slash plateauing scale.

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

There's many reasons that could be going on.

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

But do you have a prediction about going forward?

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

Will the biggest models be bigger?

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

Will they be smaller?

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

Will they be the same?

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

Do you think they're looking for it to be similar in kind to the kinds of things that have been happening over the last two to five years?

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

Like just in terms of like if I look at Nano Chat versus Nano GPT and then the architectural tweaks you made, is that basically like the flavor of things you continue to keep happening?

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

Or is there – you're not expecting any giant paradigm shift?