Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Richard Sutton

👤 Person
505 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Just the fact that you generalize is not necessarily good or bad.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

You can generalize poorly, you can generalize well.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

So generalization always will happen, but we need algorithms that will cause the generalization to be good rather than bad.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Well, large language models, so complex.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

We don't really know what information they had prior.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

We have to guess because they've been fed so much.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

This is one reason why they're not a good way to do science.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

It's just so uncontrolled, so unknown.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

But if you come up with an entirely new... They're getting a bunch of things right, perhaps.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

And so the question is why?

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Well, it may be that they don't need to generalize to get them right because the only way to get some of them right is to form something which gets all of them right.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Right.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

So if there's only one answer and you find it, that's not called generalization.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

It's the only way to solve it, and so they find the only way to solve it.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Generalization is when it could be this way, it could be that way, and they do it the good way.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Well, there's nothing in them which will cause it to generalize well.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Creating dissent will cause them to find a solution to the problems they've seen.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

And if there's only one way to solve them, they'll do that.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

But there are many ways to solve it, some which generalize well, some which generalize poorly.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

There's nothing in them, in the algorithms, that will cause them to generalize well.