Andrej Karpathy

👤 Speaker

3419 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so the way I like to put it is you're sucking supervision through a straw because you've done all this work that could be a minute to roll out and you're like sucking the bits of supervision of the final reward signal through a straw and you're like putting it, you're like, you're basically like, yeah, you're broadcasting that across the entire trajectory and using that to upweigh or downweigh that trajectory.

2549.337 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's crazy.

2569.4 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

A human would never do this.

2570.581 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Number one, a human would never do hundreds of rollouts.

2571.502 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Number two, when a person sort of finds a solution, they will have a pretty complicated process of review of like, okay, I think these parts that I did well, these parts I did not do that well.

2573.705 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I should probably do this or that.

2583.582 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And they think through things.

2585.485 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

There's nothing in current LLMs that does this.

2586.828 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

There's no equivalent of it.

2588.631 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I do see papers popping out that are trying to do this because it's obvious to everyone in the field.

2590.734 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I kind of see it as like, the first imitation learning actually, by the way, was extremely surprising and miraculous and amazing that we can fine-tune by imitation in humans.

2595.743 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And that was incredible.

2603.631 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because in the beginning, all we had was base models.

2605.073 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Base models are autocomplete.

2606.875 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it wasn't obvious to me at the time, and I had to learn this, and the paper that blew my mind was InstructGPT.

2609.017 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because it pointed out that, hey, you can take the pre-trained model, which is autocomplete,

2615.824 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And if you just fine-tune it on text that looks like conversations, the model will very rapidly adapt to become very conversational.

2619.588 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it keeps all the knowledge from pre-training.

2626.175 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And this blew my mind because I didn't understand that this just like stylistically can adjust so quickly and become an assistant to a user through just a few loops of fine-tuning on that kind of data.

2628.318 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It was very miraculous to me that that worked.

2638.189 View full episode →

← Previous Page 24 of 171 Next →

Report any issue