Richard Sutton

Well, it may be that they don't need to generalize to get them right because the only way to get some of them right is to form something which gets all of them right.

2337.528 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

Right.

2346.45 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

So if there's only one answer and you find it, that's not called generalization.

2346.55 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

It's the only way to solve it, and so they find the only way to solve it.

2353.299 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

Generalization is when it could be this way, it could be that way, and they do it the good way.

2357.064 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

Well, there's nothing in them which will cause it to generalize well.

2393.868 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

Creating dissent will cause them to find a solution to the problems they've seen.

2399.076 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

And if there's only one way to solve them, they'll do that.

2404.704 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

But there are many ways to solve it, some which generalize well, some which generalize poorly.

2407.688 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

There's nothing in them, in the algorithms, that will cause them to generalize well.

2411.594 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment