Andrej Karpathy
๐ค SpeakerAppearances Over Time
Podcast Appearances
Like, I don't know, like ranking is kind of AI, right?
At some point, Google was like, even early on, they were thinking of themselves as an AI company doing Google search engine, which I think is totally fair.
And so I kind of see it as a lot more of a continuum than I think other people do, and I don't, it's hard for me to draw the line.
And I kind of feel like, okay, we're now getting a much better autocomplete.
And now we're also getting some agents which are kind of like these loopy things, but they kind of go off rails sometimes.
And what's going on is that the human is progressively doing a bit less and less of the low-level stuff.
For example, we're not writing the assembly code because we have compilers, right?
Like compilers will take my highlight language in C and write the assembly code.
Yeah.
So we're abstracting ourselves very, very slowly.
And there's this what I call autonomy slider of like more and more stuff is automated of the stuff that can be automated at any point in time.
And we're doing a bit less and less and raising ourselves in the layer of abstraction over the automation.
Yeah, maybe the way I would put it is humans don't use reinforcement learning is maybe what I, as I've said it all.
I think they do something different, which is, yeah, you experience.
So reinforcement learning is a lot worse than I think the average person thinks.
Reinforcement learning is terrible.
It just so happens that everything that we had before is much worse.
Because previously, we were just imitating people, so it has all these issues.
So in reinforcement learning, say you're working with, you're solving a math problem.
This is very simple.