Andrej Karpathy

Andrej Karpathy — AGI is still a decade away

I don't know that I fully resonate with that because I feel like these models, when you boot them up and they have zero tokens in the window, they're always like restarting from scratch where they were.

1326.895 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I don't actually know in that worldview what it looks like because, again, maybe making some analogies to humans just because I think it's roughly concrete and kind of interesting to think through.

1334.727 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I feel like when I'm awake, I'm building up a context window of stuff that's happening during the day.

1345.643 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I feel like when I go to sleep, something magical happens where I don't actually think that that context window stays around.

1349.768 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think there's some process of distillation into weights of my brain.

1355.154 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And this happens during sleep and all this kind of stuff.

1359.398 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

We don't have an equivalent of that in large language models.

1361.301 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And that's, to me, more adjacent to when you talk about continual learning and so on as absent.

1364.764 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

These models don't really have this distillation phase of taking what happened, analyzing it, obsessively thinking through it, basically doing some kind of a synthetic data generation process and distilling it back into the weights, and maybe having a specific neural net per person.

1369.69 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Maybe it's a LoRa, it's not a full...

1387.625 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, it's not a full-weight neural network.

1389.248 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's just some of the small sparse subset of the weights are changed.