Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Andrej Karpathy

πŸ‘€ Speaker
3433 total appearances

Appearances Over Time

Podcast Appearances

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And so basically, it's very powerful in the forward pass because it's able to express...

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

very general computation as sort of something that looks like message passing.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

You have nodes and they all store vectors.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And these nodes get to basically look at each other and each other's vectors.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And they get to communicate.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And basically nodes get to broadcast, hey, I'm looking for certain things.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And then other nodes get to broadcast, hey, these are the things I have.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Those are the keys and the values.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

So it's not just attention.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Yeah, exactly.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Transformer is much more than just the attention component.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

It's got many pieces architectural that went into it.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

The residual connection, the way it's arranged, there's a multi-layer perceptron in there, the way it's stacked, and so on.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

But basically, there's a message passing scheme where nodes get to look at each other, decide what's interesting, and then update each other.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And so I think when you get to the details of it, I think it's a very expressive function.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

So it can express lots of different types of algorithms in forward pass.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Not only that, but the way it's designed with the residual connections, layer normalizations, the softmax attention and everything, it's also optimizable.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

This is a really big deal because there's lots of computers that are powerful that you can't optimize or that are not easy to optimize using the techniques that we have, which is backpropication and gradient descent.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

These are first-order methods, very simple optimizers, really.

Lex Fridman Podcast
#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And so...