Andrej Karpathy

👤 Speaker

3419 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I'm going to tell you at every single step of the way how well you're doing.

2735.98 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And this is basically the reason we don't have that.

2738.683 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's tricky how you do that properly because you have partial solutions and you don't know how to assign credit.

2740.726 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So when you get the right answer, it's just an equality match to the answer.

2746.353 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Very simple to implement.

2750.577 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

If you're doing basically process supervision, how do you assign, in an automatable way, partial credit assignment?

2752.399 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's not obvious how you do it.

2758.265 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Lots of labs, I think, are trying to do it with these LLM judges.

2759.446 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So basically, you get LLMs to try to do it.

2762.149 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So you prompt an LLM, hey, look at a partial solution of a student.

2764.071 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

How well do you think they're doing if the answer is this?

2766.894 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And they try to tune the prompt.

2769.036 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

The reason that I think this is kind of tricky is quite subtle.

2771.058 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's the fact that anytime you use an LLM to assign a reward, those LLMs are giant things with billions of parameters and they're gameable.

2774.001 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And if you're reinforcement learning with respect to them, you will find adversarial examples for your LLM judges almost guaranteed.

2781.488 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You can't do this for too long.

2787.394 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You do maybe 10 steps or 20 steps, maybe it will work, but you can't do 100 or 1,000 because it's not obvious.

2788.455 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because I understand it's not obvious, but basically the model will find little cracks,

2793.98 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

it will find all these spurious things in the nooks and crannies of the giant model and find a way to cheat it.

2800.066 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So one example that's prominently in my mind is, I think this was probably public, but basically, if you're using an element judge for a reward, so you just give it a solution from a student and ask it if the student will or not,

2806.398 View full episode →

← Previous Page 26 of 171 Next →

Report any issue