Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Andrej Karpathy

๐Ÿ‘ค Speaker
3419 total appearances

Appearances Over Time

Podcast Appearances

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

If it was too grand, it would overpromise and then underdeliver potentially.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

So you want to just meme your way to greatness.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

You want to have a general purpose computer that you can train on arbitrary problems, like say the task of next word prediction or detecting if there's a cat in an image or something like that.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And you want to train this computer, so you want to set its weights.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And I think there's a number of design criteria that sort of overlap in the transformer simultaneously that made it very successful.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And I think the authors were kind of deliberately trying to make this a really powerful architecture.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And so basically, it's very powerful in the forward pass because it's able to express...

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

very general computation as sort of something that looks like message passing.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

You have nodes and they all store vectors.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And these nodes get to basically look at each other and each other's vectors.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And they get to communicate.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And basically nodes get to broadcast, hey, I'm looking for certain things.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And then other nodes get to broadcast, hey, these are the things I have.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Those are the keys and the values.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

So it's not just attention.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Yeah, exactly.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Transformer is much more than just the attention component.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

It's got many pieces architectural that went into it.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

The residual connection, the way it's arranged, there's a multi-layer perceptron in there, the way it's stacked, and so on.

Lex Fridman Podcast
#333 โ€“ Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

But basically, there's a message passing scheme where nodes get to look at each other, decide what's interesting, and then update each other.