Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Richard Sutton

👤 Person
505 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

But they're also a way of putting in lots of

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

knowledge.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

And so this is an interesting question.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

It's a sociological or industry question.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Will they reach the limits of the data and be superseded by things that can get more data just from experience rather than from

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

from people.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

In some ways, it's a classic case of the bitter lesson.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

The more human knowledge we put into the large language models, the better they can do.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

And so it feels good.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

And yet, one, well, I in particular expect there to be systems that can learn from experience, which could well perform much, much better and be much more scalable, in which case it will be another instance of the bitter lesson that the things that used human knowledge were eventually superseded by things that just trained from experience and computation.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Well, in every case of the bitter lesson, you know, you could start with human knowledge.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Right.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

And then do the scalable things.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Yeah.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

That's always the case.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

And there's never any reason why that has to be bad.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Right.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

But in fact, and in practice, it has always turned out to be bad.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Because people get locked into the human knowledge approach and they psychologically, or, you know, now I'm speculating why it is, but this is what has always happened.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Yeah.