Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Richard Sutton

👤 Person
505 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

That, yeah, they get, their lunch gets eaten by the methods that are truly scalable.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Yeah, give me a sense of what the scalable method is.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

The scalable method is you learn from experience.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

You try things, you see what works.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

No one has to tell you.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

First of all, you have a goal.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

So without a goal, there's no sense of right or wrong or better or worse.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

So large language models are trying to get by without having a goal or a sense of better or worse.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

That's just, you know, it's exactly starting in the wrong place.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

How old are these kids?

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

It's surprising.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

You can have such a different point of view.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

When I see kids, I see kids just trying things and waving their hands around and moving their eyes around.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

And no one tells them... There's no imitation for how they move their eyes around or even the sounds they make.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

They may want to create the same sounds, but the actions, the thing that the...

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

The large language model is learning from training data.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

It's not learning from experience.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

It's learning from something that will never be available during its normal life.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

There's never any training data that says you should do this action in normal life.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Okay, I shouldn't have said never.