Dwarkesh Patel

Andrej Karpathy — AGI is still a decade away

You're basically training the LLM to be a prompt injection model.

Andrej Karpathy — AGI is still a decade away

So to the extent you think this is the bottleneck to making RL more functional, then that will require making LLMs better judges if you want to do this in an automated way.

2886.803 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then so is it just going to be like some sort of GAN-like approach where you had to train models to be more robust?

2896.063 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Interesting.

2936.062 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Do you have some shape of what the other idea could be?

2936.743 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

2977.287 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I guess I see a very, not easy, but like I can conceptualize how you would be able to train on synthetic examples or synthetic problems that you have made for yourself.

2977.888 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But there seems to be another thing humans do, maybe sleep is this, maybe daydreaming is this, which is not necessarily come up with fake problems, but just like reflect.

2988.439 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

2998.15 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I'm not sure what the ML analogy for, you know, daydreaming or sleeping, but just like just reflecting.

2998.27 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I haven't come up with a new problem.

3003.156 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, yeah.

3004.037 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, obviously, the very basic analogy would just be fine-tuning on reflection bits, but I feel like in practice that probably wouldn't work that well.

3004.237 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I don't know if you have some take on what the analogy of this thing is.

3011.114 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Just to make sure I understood, the reason that the collapse is relevant to synthetic data generation is because you want to be able to come up with synthetic problems or reflections which are not already in your data distribution?

3138.815 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I guess what I'm saying is...

3151.818 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You can't just keep scaling, quote-unquote, reflection on the same amount of prompt information and then get returns from that.

3163.577 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Have you seen this super interesting paper that dreaming is a way of preventing this kind of overfitting and collapse?

3218.667 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That the reason dreaming is an evolutionary adaptive is to put you in weird situations that are very unlike your day-to-day reality to prevent this kind of overfitting?

3225.337 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

This is a very ill-formed thought, so I'll just put it out and let you react to it.

3265.833 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment