Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Andrej Karpathy

πŸ‘€ Speaker
3433 total appearances

Appearances Over Time

Podcast Appearances

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

So people are scaling up the data sets, making them much, much bigger.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

They're working on the evaluation, making the evaluation much, much bigger.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And they're basically keeping the architecture unchanged.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And that's the last five years of progress in AI, kind of.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Basically, the way GPT is trained is you just download a massive amount of text data from the internet, and you try to predict the next word in the sequence, roughly speaking.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

You're predicting little word chunks, but roughly speaking, that's it.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And what's been really interesting to watch is, basically, it's a language model.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Language models have actually existed for a very long time.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

There's papers on language modeling from 2003, even earlier.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Yeah, so language model, just basically the rough idea is just predicting the next word in a sequence, roughly speaking.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

So there's a paper from, for example, Benjio and the team from 2003, where for the first time they were using a neural network to take, say, like three or five words and predict the next word.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And they're doing this on much smaller data sets.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And the neural net is not a transformer.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

It's a multi-layer perceptron.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

But it's the first time that a neural network has been applied in that setting.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

But even before neural networks, there were language models, except they were using n-gram models.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

So n-gram models are just count-based models.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

So if you start to take two words and predict a third one, you just count up how many times you've seen any two-word combinations and what came next.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And what you predict as coming next is just what you've seen the most of in the training set.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And so language modeling has been around for a long time.