Andrej Karpathy

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's some like stock ticker symbols.

3654.108 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's a huge amount of slop and garbage from like all the corners of the internet.

3658.174 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's not like your Wall Street Journal article that's extremely rare.

3661.699 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I almost feel like because the internet is so terrible, we actually have to sort of like build really big models to compress all that.

3665.605 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Most of that compression is memory work instead of like cognitive work.

3671.974 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But what we really want is the cognitive part to actually delete the memory.

3675.719 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then, so I guess what I'm saying is like we need

3678.563 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

intelligent models to help us refine even the pre-training set to just narrow it down to the cognitive components.

3681.768 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then I think you get away with a much smaller model because it's a much better data set and you could train it on it.

3687.417 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But probably it's not trained directly on it.

3692.164 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's probably distilled for a much better model still.

3693.586 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I just feel like distillation works extremely well.

3700.111 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So almost every small model, if you have a small model, it's almost certainly distilled.

3701.994 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, come on, right?

3713.572 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't know.

3715.615 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

At some point, it should take at least a billion knobs to do something interesting.

3715.896 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You're thinking it should be even smaller?

3721.184 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, I almost feel like I'm already contrarian by talking about a billion-parameter cognitive core, and you're outdoing me.

3743.159 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think, yeah, maybe we could get a little bit smaller.

3748.625 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, I still think that there should be enough.

3751.108 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment