Andrej Karpathy

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They will recite passages from all these training sources.

3346.42 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You can give them completely nonsensical data, like you can hash some amount of text or something like that.

3350.024 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You get a completely random sequence.

3354.868 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

If you train on it, even just, I think, a single iteration or two, it can suddenly regurgitate the entire thing.

3356.289 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It will memorize it.

3360.773 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

There's no way a person can read a single sequence of random numbers and recite it to you.

3361.354 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And that's a feature, not a bug almost, because it forces you to like only learn the generalizable components.

3366.138 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Whereas LLMs are distracted by all the memory that they have of the pre-trained documents.

3372.405 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's probably very distracting to them in a certain sense.

3377.29 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So that's why when I talk about the cognitive core, I actually want to remove the memory, which is what we talked about.

3380.574 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I'd love to have less memory so that they have to look things up.

3385.379 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And they only maintain the algorithms for like thought and the idea of an experiment and all this cognitive glue of acting.

3389.003 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I'm not sure.

3403.86 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think it's almost like a separate axis.

3407.164 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's almost like the models are way too good at memorization and somehow we should remove that.

3409.006 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think people are much worse, but it's a good thing.

3413.892 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, I think that's a great question.

3432.196 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, you can imagine having a regularization for entropy and things like that.

3433.478 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I guess they just don't work as well empirically because right now, like, the models are collapsed.

3436.441 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I will say...

3441.326 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment