Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Dwarkesh Patel

πŸ‘€ Speaker
15787 total appearances
Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 4
Confidence: High

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Some thoughts on the Sutton interview

It's the abysmal sample efficiency of these models.

Dwarkesh Podcast
Some thoughts on the Sutton interview

It's their dependence on exhaustible human data.

Dwarkesh Podcast
Some thoughts on the Sutton interview

If the LLMs do get to HEI first, which is what I expect to happen, the successor systems that they build will almost certainly be based on Richard's vision.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Today, I'm chatting with Richard Sutton, who is one of the founding fathers of reinforcement learning and inventor of many of the main techniques used there, like TD learning and policy gradient methods.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

And for that, he received this year's Turing Award, which, if you don't know, is basically the Nobel Prize for Computer Science.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Richard, congratulations.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Thank you, Dvarkis.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

And thanks for coming on the podcast.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

It's my pleasure.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Okay, so first question is,

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

My audience and I are familiar with the LLM way of thinking about AI.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Conceptually, what are we missing in terms of thinking about AI from the RL perspective?

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Huh.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

I guess you would think that to emulate the trillions of tokens in the corpus of internet text, you would have to build a world model.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

In fact, these models do seem to have very robust world models, and they're the best world models we've made to date in AI, right?

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

So what do you think that's missing?

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Great.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Yeah.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Right.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

I guess maybe the crux, and I'm curious if you disagree with this, is some people will say, okay, so...