Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Andrej Karpathy

๐Ÿ‘ค Speaker
3433 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

For example, we're not writing the assembly code because we have compilers, right?

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

Like compilers will take my highlight language in C and write the assembly code.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

Yeah.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

So we're abstracting ourselves very, very slowly.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

And there's this what I call autonomy slider of like more and more stuff is automated of the stuff that can be automated at any point in time.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

And we're doing a bit less and less and raising ourselves in the layer of abstraction over the automation.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

Yeah, maybe the way I would put it is humans don't use reinforcement learning is maybe what I, as I've said it all.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

I think they do something different, which is, yeah, you experience.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

So reinforcement learning is a lot worse than I think the average person thinks.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

Reinforcement learning is terrible.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

It just so happens that everything that we had before is much worse.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

Because previously, we were just imitating people, so it has all these issues.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

So in reinforcement learning, say you're working with, you're solving a math problem.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

This is very simple.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

You're given a math problem, and you're trying to find a solution.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

Now, in reinforcement learning, you will try lots of things in parallel first.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

So you're given a problem, you try hundreds of things,

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

different attempts.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

And these attempts can be complex, right?

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

They can be like, oh, let me try this, let me try that, this didn't work, that didn't work, et cetera.