Richard Sutton
👤 PersonAppearances Over Time
Podcast Appearances
Well, yes, I think it's really quite a different point of view.
And it can easily get separated and lose the ability to talk to each other.
And yeah, large language models have become such a big thing.
Generative AI in general, a big thing.
And our field is subject to bandwagons and fashions.
So we lose track of the basic, basic things.
Because I consider reinforcement learning to be basic AI.
And what is intelligence?
The problem is to understand your world.
And reinforcement learning is about understanding your world.
Whereas large language models are about mimicking people, doing what people say you should do.
They're not about figuring out what to do.
I would disagree with most of the things you just said.
Just to mimic what people say is not really to build a model of the world at all, I don't think.
You know, you're mimicking things that have a model of the world, the people.
But I don't want to approach the question in an adversarial way.
But I would question the idea that they have a world model.
So a world model would enable you to predict what would happen.
They have the ability to predict what a person would say.
They don't have the ability to predict what will happen.