Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Dwarkesh Patel

πŸ‘€ Speaker
15656 total appearances
Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 1
Confidence: Medium

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Some thoughts on the Sutton interview

Now, these are ground truth examinations.

Dwarkesh Podcast
Some thoughts on the Sutton interview

Can you solve this unseen Math Olympiad question?

Dwarkesh Podcast
Some thoughts on the Sutton interview

Can you build this application to match the specific features request?

Dwarkesh Podcast
Some thoughts on the Sutton interview

But you couldn't have RL'd a model to accomplish these tasks from scratch, or at least we don't know how to do that yet.

Dwarkesh Podcast
Some thoughts on the Sutton interview

You needed a reasonable prior over human data in order to kickstart this RL process.

Dwarkesh Podcast
Some thoughts on the Sutton interview

Whether you want to call this prior a proper world model or just a model of humans, I don't think is that important.

Dwarkesh Podcast
Some thoughts on the Sutton interview

It honestly seems like a semantic debate.

Dwarkesh Podcast
Some thoughts on the Sutton interview

Because what you really care about is whether this model of humans has...

Dwarkesh Podcast
Some thoughts on the Sutton interview

helps you start learning from ground truth, aka become a true world model.

Dwarkesh Podcast
Some thoughts on the Sutton interview

It's a bit like saying to somebody pasteurizing milk, hey, you should stop boiling that milk because eventually you want to serve it cold.

Dwarkesh Podcast
Some thoughts on the Sutton interview

Of course, but this is an intermediate step to facilitate the final output.

Dwarkesh Podcast
Some thoughts on the Sutton interview

By the way, LLMs are clearly developing a deep representation of the world because their training process is incentivizing them to develop one.

Dwarkesh Podcast
Some thoughts on the Sutton interview

I use LLMs to teach me about everything from biology to AI to history, and they are able to do so with remarkable flexibility and coherence.

Dwarkesh Podcast
Some thoughts on the Sutton interview

Now, are LLMs specifically trained to model how their actions will affect the world?

Dwarkesh Podcast
Some thoughts on the Sutton interview

No, they are not.

Dwarkesh Podcast
Some thoughts on the Sutton interview

But if we're not allowed to call their representations a world model,

Dwarkesh Podcast
Some thoughts on the Sutton interview

then we're defining the term world model by the process that we think is necessary to build one, rather than the obvious capabilities that this concept implies.

Dwarkesh Podcast
Some thoughts on the Sutton interview

Okay, continual learning.

Dwarkesh Podcast
Some thoughts on the Sutton interview

I'm sorry to bring up my hobby horse again.

Dwarkesh Podcast
Some thoughts on the Sutton interview

I'm like a comedian who has only come up with one good bit, but I'm going to milk it for all it's worth.