Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Andrej Karpathy

๐Ÿ‘ค Speaker
3419 total appearances

Appearances Over Time

Podcast Appearances

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

But it's the first time that a neural network has been applied in that setting.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

But even before neural networks, there were language models, except they were using n-gram models.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

So n-gram models are just count-based models.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

So if you start to take two words and predict a third one, you just count up how many times you've seen any two-word combinations and what came next.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And what you predict as coming next is just what you've seen the most of in the training set.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And so language modeling has been around for a long time.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Neural networks have done language modeling for a long time.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

So really what's new or interesting or exciting is just realizing that when you scale it up,

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

with a powerful enough neural net, a transformer, you have all these emergent properties where basically what happens is if you have a large enough data set of text,

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

You are in the task of predicting the next word.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

You are multitasking a huge amount of different kinds of problems.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

You are multitasking understanding of, you know, chemistry, physics, human nature.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Lots of things are sort of clustered in that objective.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

It's a very simple objective, but actually you have to understand a lot about the world to make that prediction.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Yeah, so basically it gets a thousand words and it's trying to predict the thousandth and first.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And in order to do that very, very well over the entire data set available on the internet, you actually have to basically kind of understand the context of what's going on in there.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And it's a sufficiently hard problem that if you have a powerful enough computer, like a transformer, you end up with interesting solutions.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And you can ask it to do all kinds of things and

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

It shows a lot of emergent properties, like in-context learning.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

That was the big deal with GPT and the original paper when they published it, is that you can just sort of prompt it in various ways and ask it to do various things, and it will just kind of complete the sentence.