Andrej Karpathy

👤 Speaker

3419 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You're given a math problem, and you're trying to find a solution.

2475.673 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Now, in reinforcement learning, you will try lots of things in parallel first.

2478.816 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So you're given a problem, you try hundreds of things,

2484.803 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

different attempts.

2487.946 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And these attempts can be complex, right?

2488.808 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They can be like, oh, let me try this, let me try that, this didn't work, that didn't work, et cetera.

2490.15 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then maybe you get an answer.

2493.836 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And now you check the back of the book and you see, okay, the correct answer is this.

2495.279 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then you can see that, okay, this one, this one, and that one got the correct answer, but these other 97 of them didn't.

2499.286 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So literally what reinforcement learning does is it goes to the ones that worked really well, and every single thing you did along the way, every single token gets up-weighted of, like, do more of this.

2504.956 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

The problem with that is, I mean, people will say that your estimator has high variance, but, I mean, it's just noisy.

2514.348 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's noisy.

2519.876 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So basically, it kind of almost assumes that every single little piece of the solution that you made that right at the right answer was the correct thing to do, which is not true.

2521.157 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like, you may have gone down the wrong alleys

2528.507 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

until you write the right solution.

2530.69 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Every single one of those incorrect things you did, as long as you got to the correct solution, will be up-weighted as do more of this.

2532.212 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's terrible.

2537.219 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's noise.

2538.481 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You've done all this work only to find a single, at the end, you get a single number of like, oh, you did correct.

2539.443 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And based on that, you weigh that entire trajectory as like up-weight or down-weight.

2545.171 View full episode →

← Previous Page 23 of 171 Next →

Report any issue