Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Andrej Karpathy

πŸ‘€ Speaker
3433 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

And then, so I guess what I'm saying is like we need

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

intelligent models to help us refine even the pre-training set to just narrow it down to the cognitive components.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

And then I think you get away with a much smaller model because it's a much better data set and you could train it on it.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

But probably it's not trained directly on it.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

It's probably distilled for a much better model still.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

I just feel like distillation works extremely well.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

So almost every small model, if you have a small model, it's almost certainly distilled.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

I mean, come on, right?

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

I don't know.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

At some point, it should take at least a billion knobs to do something interesting.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

You're thinking it should be even smaller?

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

I mean, I almost feel like I'm already contrarian by talking about a billion-parameter cognitive core, and you're outdoing me.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

I think, yeah, maybe we could get a little bit smaller.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

I mean, I still think that there should be enough.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

Yeah, maybe it can be smaller.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

I do think that, practically speaking, you want the model to have some knowledge.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

You don't want it to be looking up everything.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

Because then you can't think in your head.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

You're looking up way too much stuff all the time.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

So I do think it needs to be some basic curriculum needs to be there for knowledge.