Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Richard Sutton

👤 Person
505 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

What we want, I think, to quote Alan Turing, what we want is a machine that can learn from experience.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Right.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Where experience is the things that actually happen in your life.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

You do things, you see what happens, and that's what you learn from.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

The large language models learn from something else.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

They learn from here's a situation and here's what a person did.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

And implicitly, the suggestion is you should do what the person did.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

No, I agree that it's the large language model perspective.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

I don't think it's a good perspective.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Yeah, curious why.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

So, to be a prior for something, there has to be a real thing.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

I mean, a prior bit of knowledge should be the basis for actual knowledge.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

What is actual knowledge?

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

There's no definition of actual knowledge in that large language framework.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

What makes an action a good action to take?

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

You recognize the value, the need for continual learning.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Right.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

So if you need to learn continually, continually means learning during normal interaction with the world.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Yeah.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

And so then there must be some way during the normal interaction to tell what's right.