Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Dwarkesh Patel

πŸ‘€ Speaker
15787 total appearances
Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 4
Confidence: High

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

No, of course not.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Really?

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Yeah.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

I think kids just like watch people.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

They like kind of try to like say the same words.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

I think the level- What about the first six months?

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

I think they're kind of imitating things.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

They're trying to like make their mouth sound the way they see their mother's mouth sound.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

And then they'll say the same words without understanding what they mean.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

And as they get older, the complexity of the imitation they do increases.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

So you're imitating maybe the skills that your people in your band are using to hunt down the deer or something.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

And then you go into the learning from experience RR regime.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

But I think there's a lot of imitation learning happening with humans.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

uh infant actually does there's no targets for that there are no examples for that I agree that doesn't explain everything infants do but I think it guides the learning process I mean even uh llm when it's trying to predict the next token early in training it will like make a guess it'll be different from what like it actually sees and in some sense it's like very short horizon RL where it's like making this guess of like I think this token will be this it's actually the other thing similar to how a kid will try to say a word it comes out wrong

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

I think this is maybe more of a semantic distinction.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Like, what do you call school?

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Is that not training data?

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

You're not going to school because it's like... School is much later.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

You shouldn't base your theories on that.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

But the idea of having phases of learning where... I think you're just sort of programming your biology that early on you're not that useful.