Andrej Karpathy

👤 Speaker

3419 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I almost feel like we are redoing a lot of the...

1432.47 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

cognitive tricks that evolution came up with through a very different process.

1435.655 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But we're, I think, going to converge on a similar architecture cognitively.

1438.98 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Well, the way I like to think about it is, okay, let's translation invariance in time, right?

1450.135 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So 10 years ago, where were we?

1453.62 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

2015, we had convolutional neural networks primarily.

1455.483 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Residual networks just came out.

1459.547 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So remarkably similar, I guess, but quite a bit different still.

1461.929 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, Transformer was not around.

1464.591 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You know, all these sort of like more modern tweaks on the Transformer were not around.

1467.494 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So maybe some of the things that we can bet on, I think, in 10 years by translational sort of equivariance is we're still training giant neural networks with forward, backward, pass, and update through gradient descent.

1473.039 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But maybe it looks a little bit different.

1485.47 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's just everything is much bigger.

1488.093 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Actually, recently, I also went back all the way to 1989, which was kind of a fun exercise for me a few years ago, because I was reproducing Jan LeCun's 1989 convolutional network, which was the first neural network I'm aware of trained via gradient descent, like modern neural network trained gradient descent on digit recognition.

1490.016 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I was just interested in, okay, how can I modernize this?

1508.639 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

How much of this is algorithms?

1511.585 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

How much of this is data?

1512.547 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

How much of this progress is compute and systems?

1513.369 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I was able to very quickly like half the learning rate, just knowing by time travel by 33 years.

1516.175 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So if I time travel by algorithms to 33 years, I could adjust what Yann LeCun did in 1989, and I could basically half the learning, half the error.

1522.168 View full episode →

← Previous Page 13 of 171 Next →

Report any issue