Richard Sutton

Richard Sutton – Father of RL thinks LLMs are a dead-end

We're not seeing general... Critical to good performance is that you can generalize well from one state to another state.

2151.272 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

We don't have any methods that are good at that.

2158.141 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

What we have are people try different things and they settle on something that a representation that transfers well or that generalizes well.

2160.545 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

But we don't have any automated techniques to promote.

2171.725 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

We have very few automated techniques to promote transfer.

2176.072 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

And none of them are used in modern deep learning.

2179.839 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

The researchers did it.

2202.381 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

Because there's no other explanation.

2203.702 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

Gradient descent will not make you generalize well.

2205.865 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

It will make you solve the problem.

2208.309 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

It will not make you get new data.

2210.331 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

you generalize in a good way.

2214.267 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

Generalization means train on one thing that affects what you do on the other things.

2216.049 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

So we know deep learning is really bad at this.

2220.996 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

For example, we know that if you train on some new thing, it will often catastrophically interfere with all the old things that you knew.

2223.019 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

So this is exactly bad generalization.

2230.709 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

Right.

2234.635 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

Generalization, as I said, is some kind of

2234.775 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

influence of training on one state on other states.

2237.825 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

And generalization is not necessarily good or bad.

2241.271 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment