Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Dwarkesh Patel

πŸ‘€ Speaker
15787 total appearances
Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 4
Confidence: High

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

They also added a Redis cache to simulate stale data since that's how real e-commerce sites actually work.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

These are the kinds of things that you might not have naively thought to do, but that Labelbox can anticipate.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

These details really matter.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Small tweaks are often the difference between cool demos and agents that can actually operate in the real world.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

So whether it's correcting traces that you already produced or building an entirely new suite of environments, Labelbox can help you turn your RL projects into working systems.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Reach out at labelbox.com slash dvarkash.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

All right, back to Richard.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

This alternative paradigm that you're imagining.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

The experiential paradigm.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Yes.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Yeah.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

I guess maybe what I meant to say is human level general continual learning agent.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Yeah.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

What is the reward function?

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Is it just predicting the world?

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

Is it then having a specific effect on it?

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

What would the general reward function be?

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

I see.

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

I guess this AI would be deployed...

Dwarkesh Podcast
Richard Sutton – Father of RL thinks LLMs are a dead-end

to like lots of people would want it to be doing lots of different kinds of things.