Andrej Karpathy

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And then there's all kinds of little insights peppered in on how to do it properly.

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Yeah, the data engine is what I call the almost biological feeling process by which you perfect the training sets for these neural networks.

5323.216 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

So because most of the programming now is in the level of these data sets and makes sure they're large, diverse, and clean, basically you have a data set that you think is good.

5334 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

You train your neural net, you deploy it, and then you observe how well it's performing.

5342.96 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And you're trying to always increase the quality of your data set.

5347.77 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

So you're trying to catch scenarios that are basically rare.

5351.277 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And it is in these scenarios that neural nets will typically struggle in because they weren't told what to do in those rare cases in the data set.

5355.826 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

But now you can close the loop because if you can now collect all those at scale, you can then feed them back into the reconstruction process I described and reconstruct the truth in those cases and add it to the dataset.

5362.4 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And so the whole thing ends up being like a staircase of improvement, of perfecting your training set.

5373.774 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And you have to go through deployments so that you can mine the parts that are not yet represented well in the dataset.

5379.381 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

So your data set is basically imperfect.