Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Richard Sutton

👤 Person
505 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

And you agree that large language models don't have goals.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

I think they have a goal.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

What's the goal?

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Next second prediction.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

That's not a goal.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

It doesn't change the world.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

You know, tokens come at you, and if you predict them, you don't influence them.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Yeah, it's not a goal.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

It's not a substantive goal.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

You can't look at a system and say, oh, it has a goal if it's just sitting there predicting and being happy with itself that it's predicting accurately.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Well, the math problems are different.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Making a model of the physical world and carrying out the consequences of mathematical assumptions or operations, those are very different things.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

The empirical world has to be learned.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

You have to learn the consequences.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Whereas the math is more just computational.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

It's more like standard planning.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

So there they can have a goal to find the proof.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

And they are in some way given that goal to find the proof.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

It's an interesting question whether large language models are a case of the bitter lesson.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Because they are clearly a way of using massive computation, things that will scale with computation up to the limits of the internet.