Dwarkesh Patel

👤 Speaker

15787 total appearances

Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 4

Confidence: High

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

It explained the last 20 years of gradual innovation and explained how each step made the RL learning process more stable or more sample efficient or more scalable.

2452.637 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

I asked Deep Research to put all of this together like an Andrej Karpathy style tutorial.

2463.054 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

And it did that.

2467.422 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

What was cool is that it combined this whole lesson together into one coherent, cohesive document in the style that I wanted.

2468.303 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

It was also great that it assembled all of the best links in the same place so that if I wanted to understand any specific algorithm better, I could just access the right explainer right there.

2475.175 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

Go to gemini.google.com to try it out yourself.

2484.072 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

All right, back to Richard.

2488.56 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

I want to zoom out and ask about being in the field of AI for longer than almost anybody who is commentating on it or working in it now.

2489.802 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

I'm just curious about what the biggest surprises have been, how much new stuff you feel like is coming out, or does it feel like people are just playing with old ideas?

2501.097 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

Zooming out, you got into this even before deep learning was popular, so...

2511.971 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

How do you see this trajectory of this field over time and how new ideas have come about and everything?

2516.617 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

And what's been surprising?

2522.66 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

Have there felt like whenever the public conception has been changed because some new technique was... Sorry, some new application was developed.

2636.907 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

For example, when AlphaZero became this viral sensation, to you as somebody who has...

2646.96 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

Literally came up with many of the techniques that were used.

2653.388 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

Did it feel to you like new breakthroughs were made or does it feel like, oh, we've had these techniques since the 90s and people are simply combining them and applying them now?

2655.791 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

Okay.

2800.86 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

Some sort of left-field questions for you, if you'll tolerate them.

2801.301 View full episode →

Dwarkesh Podcast

Richard Sutton – Father of RL thinks LLMs are a dead-end

So the way I read the bitter lesson is that it's not saying necessarily that human artisanal researcher tuning doesn't work, but that...