Richard Sutton
👤 PersonAppearances Over Time
Podcast Appearances
That, yeah, they get, their lunch gets eaten by the methods that are truly scalable.
Yeah, give me a sense of what the scalable method is.
The scalable method is you learn from experience.
You try things, you see what works.
No one has to tell you.
First of all, you have a goal.
So without a goal, there's no sense of right or wrong or better or worse.
So large language models are trying to get by without having a goal or a sense of better or worse.
That's just, you know, it's exactly starting in the wrong place.
How old are these kids?
It's surprising.
You can have such a different point of view.
When I see kids, I see kids just trying things and waving their hands around and moving their eyes around.
And no one tells them... There's no imitation for how they move their eyes around or even the sounds they make.
They may want to create the same sounds, but the actions, the thing that the...
The large language model is learning from training data.
It's not learning from experience.
It's learning from something that will never be available during its normal life.
There's never any training data that says you should do this action in normal life.
Okay, I shouldn't have said never.