Andrej Karpathy

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But to get further gains, I had to add a lot more data.

1529.463 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I had to 10x the training set.

1533.007 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then I had to actually add more computational optimizations.

1534.549 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I had to basically train for much longer with dropout and other regularization techniques.

1538.273 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so it's almost like all these things have to improve simultaneously.

1542.057 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So we're probably going to have a lot more data.

1545.28 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

We're probably going to have a lot better hardware.

1548.023 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

We're probably going to have a lot better kernels and software.

1549.565 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

We're probably going to have better algorithms.

1551.968 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And all of those, it's almost like no one of them is winning too much.

1553.449 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

All of them are surprisingly equal.

1557.013 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And this has kind of been the trend for a while.

1559.416 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I guess to answer maybe your question, I expect differences algorithmically to what's happening today.

1561.78 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I do also expect that some of the things that have stuck around for a very long time will probably still be there.

1568.693 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's probably still a giant neural network trained with gradient descent.

1573.281 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That would be my guess.

1575.665 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I guess what was shocking to me is everything needs to improve across the board.

1591.565 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Architecture, optimizer, loss function, and also has improved across the board forever.

1595.61 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I kind of expect all those changes to be alive and well.

1600.035 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Building NanoChat?

1627.474 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment