Richard Sutton

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

That, yeah, they get, their lunch gets eaten by the methods that are truly scalable.

748.326 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

Yeah, give me a sense of what the scalable method is.

754.856 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

The scalable method is you learn from experience.

757.415 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

You try things, you see what works.

761.339 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

No one has to tell you.

765.723 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

First of all, you have a goal.

767.364 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

So without a goal, there's no sense of right or wrong or better or worse.

769.466 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

So large language models are trying to get by without having a goal or a sense of better or worse.

774.031 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

That's just, you know, it's exactly starting in the wrong place.

780.557 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

How old are these kids?

812.482 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

It's surprising.

844.152 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

You can have such a different point of view.

845.353 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

When I see kids, I see kids just trying things and waving their hands around and moving their eyes around.

848.036 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

And no one tells them... There's no imitation for how they move their eyes around or even the sounds they make.

855.924 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

They may want to create the same sounds, but the actions, the thing that the...

864.473 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

The large language model is learning from training data.

898.422 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

It's not learning from experience.

901.488 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

It's learning from something that will never be available during its normal life.

904.633 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

There's never any training data that says you should do this action in normal life.

909.121 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

Okay, I shouldn't have said never.

925.065 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment