Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Andrej Karpathy

๐Ÿ‘ค Speaker
3419 total appearances

Appearances Over Time

Podcast Appearances

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Currently, it definitely looks like the transformer is taking over AI, and you can feed basically arbitrary problems into it.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And it's a general, differentiable computer, and it's extremely powerful.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And this convergence in AI has been really interesting to watch for me personally.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Definitely the zeitgeist today is just pushing.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Basically, right now, the zeitgeist is do not touch the transformer.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Touch everything else.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

So people are scaling up the data sets, making them much, much bigger.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

They're working on the evaluation, making the evaluation much, much bigger.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And they're basically keeping the architecture unchanged.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And that's the last five years of progress in AI, kind of.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Basically, the way GPT is trained is you just download a massive amount of text data from the internet, and you try to predict the next word in the sequence, roughly speaking.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

You're predicting little word chunks, but roughly speaking, that's it.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And what's been really interesting to watch is, basically, it's a language model.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Language models have actually existed for a very long time.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

There's papers on language modeling from 2003, even earlier.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Yeah, so language model, just basically the rough idea is just predicting the next word in a sequence, roughly speaking.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

So there's a paper from, for example, Benjio and the team from 2003, where for the first time they were using a neural network to take, say, like three or five words and predict the next word.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And they're doing this on much smaller data sets.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And the neural net is not a transformer.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

It's a multi-layer perceptron.