Andrej Karpathy

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And they think through things.

2585.485 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

There's nothing in current LLMs that does this.

2586.828 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

There's no equivalent of it.

2588.631 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I do see papers popping out that are trying to do this because it's obvious to everyone in the field.

2590.734 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I kind of see it as like, the first imitation learning actually, by the way, was extremely surprising and miraculous and amazing that we can fine-tune by imitation in humans.

2595.743 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And that was incredible.

2603.631 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because in the beginning, all we had was base models.

2605.073 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Base models are autocomplete.

2606.875 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it wasn't obvious to me at the time, and I had to learn this, and the paper that blew my mind was InstructGPT.

2609.017 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because it pointed out that, hey, you can take the pre-trained model, which is autocomplete,

2615.824 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And if you just fine-tune it on text that looks like conversations, the model will very rapidly adapt to become very conversational.

2619.588 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it keeps all the knowledge from pre-training.

2626.175 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And this blew my mind because I didn't understand that this just like stylistically can adjust so quickly and become an assistant to a user through just a few loops of fine-tuning on that kind of data.

2628.318 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It was very miraculous to me that that worked.

2638.189 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So incredible.

2641.012 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And that was like two years, three years of work.