Andrej Karpathy
👤 PersonAppearances Over Time
Podcast Appearances
And I think a lot of the reinforcement learning is actually more like motor tasks.
It's not intelligence tasks.
So I actually kind of think humans don't actually really use RL, roughly speaking is what I would say.
A lot of the reinforcement learning in my perspective would be things that are a lot more like motor-like, like simple kind of like tasks, throwing a hoop, something like that.
But I don't think that humans use reinforcement learning for a lot of intelligence tasks like problem solving and so on.
Interesting.
That doesn't mean we shouldn't do that for research, but I just feel like that's what animals do or don't.
I think so.
I would agree with you that there's some miraculous compression going on, because obviously the weights of the neural net are not stored in ATCGs.
There's some kind of a dramatic compression, and there's some kind of learning algorithms encoded that take over and do some of the learning online.
So I definitely agree with you on that.
Basically, I would say I'm a lot more kind of like practically minded.
I don't come at it from the perspective of like, let's build animals.
I come from the perspective of like, let's build useful things.
So I have a hard hat on.
And I'm just observing that, look, we're not going to do evolution, because I don't know how to do that.
But it does turn out we can build these ghost spirit-like entities by imitating internet documents.
This works.
And it's actually kind of like, it's a way to bring you up to something that has a lot of sort of built-in knowledge and intelligence in some way, similar to maybe what evolution has done.
So that's why I kind of call pre-training this kind of like crappy evolution.