Andrej Karpathy

👤 Speaker

3433 total appearances

Appearances Over Time

Podcast Appearances

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

You also need it to be optimisable.

2564.963 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And then lastly, you want it to run efficiently in our hardware.

2566.905 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Our hardware is a massive throughput machine like GPUs.

2570.349 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

They prefer lots of parallelism.

2574.433 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

So you don't want to do lots of sequential operations.

2576.896 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

You want to do a lot of operations serially.

2578.678 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And the Transformer is designed with that in mind as well.

2580.66 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And so it's designed for our hardware and it's designed to both be very expressive in a forward pass, but also very optimisable in the backward pass.

2583.023 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Right.

2603.361 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Think of it as, so basically a transformer is a series of blocks, right?

2604.082 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And these blocks have attention and a little multi-layer perceptron.

2609.627 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And so you go off into a block and you come back to this residual pathway.

2612.369 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And then you go off and you come back.

2616.092 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And then you have a number of layers arranged sequentially.

2617.233 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And so the way to look at it, I think, is because of the residual pathway in the backward pass, the gradients sort of flow along it uninterrupted because addition distributes the gradient equally to all of its branches.

2619.655 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

So the gradient from the supervision at the top just floats directly to the first layer.

2632.108 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And all the residual connections are arranged so that in the beginning during initialization, they contribute nothing to the residual pathway.

2637.733 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

Mm-hmm.

2644.24 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

So what it kind of looks like is, imagine the transformer is kind of like a Python function, like a dev.

2644.36 View full episode →

Lex Fridman Podcast

#333 – Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI

And you get to do various kinds of lines of code.

2651.791 View full episode →

← Previous Page 92 of 172 Next →

Report any issue