Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Richard Sutton

๐Ÿ‘ค Speaker
See mentions of this person in podcasts
505 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Richard Sutton โ€“ Father of RL thinks LLMs are a dead-end

It just sacrifices material for sort of positional advantages.

Dwarkesh Podcast
Richard Sutton โ€“ Father of RL thinks LLMs are a dead-end

And it's just content and patient to sacrifice that material for a long period of time.

Dwarkesh Podcast
Richard Sutton โ€“ Father of RL thinks LLMs are a dead-end

And so that was surprising that it worked so well, but also gratifying and fitting into my worldview.

Dwarkesh Podcast
Richard Sutton โ€“ Father of RL thinks LLMs are a dead-end

Yeah.

Dwarkesh Podcast
Richard Sutton โ€“ Father of RL thinks LLMs are a dead-end

So this has led me where I am.

Dwarkesh Podcast
Richard Sutton โ€“ Father of RL thinks LLMs are a dead-end

Where I am is I'm in some sense a contrarian or thinking differently from the field is.

Dwarkesh Podcast
Richard Sutton โ€“ Father of RL thinks LLMs are a dead-end

And I am personally just kind of content being out of sync with my field for a long period of time, perhaps decades, because occasionally I have improved right in the past.

Dwarkesh Podcast
Richard Sutton โ€“ Father of RL thinks LLMs are a dead-end

And the other thing I do to help me not feel I'm out of sync and thinking in a strange way is to look not at my local environment or my local field, but to look back in time and into history and to see what people have thought classically about the mind in many different fields.

Dwarkesh Podcast
Richard Sutton โ€“ Father of RL thinks LLMs are a dead-end

And I don't feel I'm out of sync with the larger traditions.

Dwarkesh Podcast
Richard Sutton โ€“ Father of RL thinks LLMs are a dead-end

I really view myself as a classicist rather than as a contrarian.

Dwarkesh Podcast
Richard Sutton โ€“ Father of RL thinks LLMs are a dead-end

I go to what the larger community of thinkers about the mind have always thought.

Dwarkesh Podcast
Richard Sutton โ€“ Father of RL thinks LLMs are a dead-end

You want to presume that it's been done.

Dwarkesh Podcast
Richard Sutton โ€“ Father of RL thinks LLMs are a dead-end

Well, but you're using it to get AGI again.

Dwarkesh Podcast
Richard Sutton โ€“ Father of RL thinks LLMs are a dead-end

So these AGIs, if they're not superhuman already, then the knowledge that they might impart would be not superhuman.

Dwarkesh Podcast
Richard Sutton โ€“ Father of RL thinks LLMs are a dead-end

I'm not sure your idea makes sense because it seems to presume the existence of AGI.

Dwarkesh Podcast
Richard Sutton โ€“ Father of RL thinks LLMs are a dead-end

And then we've already worked that out.

Dwarkesh Podcast
Richard Sutton โ€“ Father of RL thinks LLMs are a dead-end

And the way AlphaZero was an improvement was it did not use the human knowledge, but just went from experience.

Dwarkesh Podcast
Richard Sutton โ€“ Father of RL thinks LLMs are a dead-end

Right.

Dwarkesh Podcast
Richard Sutton โ€“ Father of RL thinks LLMs are a dead-end

So why do you say bring in other agents' expertise to teach it when it's worked so well from experience and not by help from another agent?

Dwarkesh Podcast
Richard Sutton โ€“ Father of RL thinks LLMs are a dead-end

Right.