Andrej Karpathy

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

So people are scaling up the data sets, making them much, much bigger.

2779.622 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

They're working on the evaluation, making the evaluation much, much bigger.

2782.025 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And they're basically keeping the architecture unchanged.

2785.188 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And that's the last five years of progress in AI, kind of.

2789.472 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Basically, the way GPT is trained is you just download a massive amount of text data from the internet, and you try to predict the next word in the sequence, roughly speaking.

2819.607 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

You're predicting little word chunks, but roughly speaking, that's it.

2828.44 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And what's been really interesting to watch is, basically, it's a language model.

2832.747 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Language models have actually existed for a very long time.

2836.893 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

There's papers on language modeling from 2003, even earlier.

2840.212 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Yeah, so language model, just basically the rough idea is just predicting the next word in a sequence, roughly speaking.

2846.462 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

So there's a paper from, for example, Benjio and the team from 2003, where for the first time they were using a neural network to take, say, like three or five words and predict the next word.

2853.513 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And they're doing this on much smaller data sets.

2865.372 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And the neural net is not a transformer.

2867.436 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

It's a multi-layer perceptron.