Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Dwarkesh Patel

πŸ‘€ Speaker
15787 total appearances
Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 4
Confidence: High

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

is in his view that you just have to imitate your elders in order to learn that skill because you can't think your way through how to hunt and kill and process a seal.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

You have to just watch other people maybe make tweaks and adjustments.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

And that's how cultural knowledge accumulates.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

But the initial step of the cultural gain has to be imitation.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

But maybe you think about it a different way.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

I do think you make a very interesting point that continual learning is a capability that most mammals have.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

I guess all mammals have.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

So it's quite interesting that we have something that all mammals have, but our AI systems don't have, right?

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Whereas maybe like the ability to understand math and solve difficult math problems depends on how you define math.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

But like this is a capability our AIs have, but that almost no animal has.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

And so it's quite interesting what ends up being difficult and what ends up being easy.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Perilics.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

That's right.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

For the era of experience to commence, we're going to need to train AIs in complex real-world environments.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

But building effective RL environments is hard.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

You can't just hire a software engineer and have them write a bunch of cookie-cutter validation tests.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Real-world domains are messy.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

You need deep subject matter experts to get the data, the workflows, and all the subtle rules right.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

When one of Labelbox's customers wanted to train an agent to shop online, Labelbox assembled a team with a ton of experienced engineering internet storefronts.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

For example, the team built a product catalog that could be updated during the episode because most shopping sites have constantly changing state.