Richard Sutton

👤 Speaker

See mentions of this person in podcasts

505 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

It's the content of the knowledge is statements about the stream.

1446.325 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

And so because it's a statement about the stream, you can test it by comparing it to the stream and you can learn it continually.

1451.23 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

So when you're imagining this future continual learning agent.

1459.158 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

They're not future.

1463.002 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

Of course, they exist all the time.

1463.803 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

This is what reinforcement learning paradigm is, learning from experience.

1465.805 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

The reward function is arbitrary.

1486.668 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

And so if you're playing chess, it's to win the game of chess.

1489.593 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

If you're a squirrel, maybe the reward has to do with getting nuts.

1494.922 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

Right.

1501.433 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

In general, for an animal, you would say the reward is to avoid pain and to acquire pleasure.

1504.338 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

Right.

1512.252 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

And there's also would be a component having to do with, I think there should be a component having to do with your increasing understanding of your environment.

1513.634 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

That would be sort of an intrinsic motivation.

1525.117 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

I don't like the word model when used the way you just did.

1571.887 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

I think a better word would be the network.

1575.412 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

So I think you mean the network.

1578.436 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

Maybe there's many networks.

1581.46 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

So anyway, things would be learned and then you'd have copies and many instances.

1583.382 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

And sure, you'd want to share knowledge across all.

1588.99 View full episode →

← Previous Page 11 of 26 Next →

Report any issue