Dwarkesh Patel

👤 Speaker

15656 total appearances

Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 1

Confidence: Medium

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast

Some thoughts on the Sutton interview

The agent is in no substantial way learning from organic and self-directed engagement with the world.

83.401 View full episode →

Dwarkesh Podcast

Some thoughts on the Sutton interview

Having to learn only from human data, which is an inelastic and hard to scale resource, is not a scalable way to use compute.

89.452 View full episode →

Dwarkesh Podcast

Some thoughts on the Sutton interview

Furthermore, what these LLMs learn from training is not a true world model, which would tell you how the environment changes in response to different actions that you take.

98.61 View full episode →

Dwarkesh Podcast

Some thoughts on the Sutton interview

Rather, they're building a model of what a human would say next.

108.709 View full episode →

Dwarkesh Podcast

Some thoughts on the Sutton interview

And this leads them to rely on human-derived concepts.

111.895 View full episode →

Dwarkesh Podcast

Some thoughts on the Sutton interview

A way to think about this would be, suppose you trained an LLM on all the data up to the year 1900.

115.703 View full episode →

Dwarkesh Podcast

Some thoughts on the Sutton interview

That LLM probably wouldn't be able to come up with relativity from scratch.

120.774 View full episode →

Dwarkesh Podcast

Some thoughts on the Sutton interview

And maybe here's a more fundamental reason to think this whole paradigm will eventually be superseded.

125.525 View full episode →

Dwarkesh Podcast

Some thoughts on the Sutton interview

LLMs aren't capable of learning on the job, so we'll need some new architecture to enable this kind of continual learning.

131.538 View full episode →

Dwarkesh Podcast

Some thoughts on the Sutton interview

And once we do have this architecture, we won't need a special training phase.

138.607 View full episode →

Dwarkesh Podcast

Some thoughts on the Sutton interview

The agents will just be able to learn on the fly, like all humans, and in fact, like all animals are able to do.

143.273 View full episode →

Dwarkesh Podcast

Some thoughts on the Sutton interview

And this new paradigm will render our current approach with LLMs and their special training phase that's super sample and efficient totally obsolete.

149.62 View full episode →

Dwarkesh Podcast

Some thoughts on the Sutton interview

So that's my understanding of Rich's position.

159.095 View full episode →

Dwarkesh Podcast

Some thoughts on the Sutton interview

My main difference with Rich is just that I don't think the concepts he's using to distinguish LLMs from true intelligence or animal intelligence are actually that mutually exclusive or dichotomous.

162.079 View full episode →

Dwarkesh Podcast

Some thoughts on the Sutton interview

For example, I think imitation learning is continuous with and complementary to RL.

173.172 View full episode →

Dwarkesh Podcast

Some thoughts on the Sutton interview

And relatedly, models of humans can give you a prior which facilitates learning quote-unquote true world models.

179.159 View full episode →

Dwarkesh Podcast

Some thoughts on the Sutton interview

I also wouldn't be surprised if some future version of test-time fine-tuning could replicate continual learning, given that we've already managed to accomplish this somewhat with in-context learning.

186.989 View full episode →

Dwarkesh Podcast

Some thoughts on the Sutton interview

So let's start with my claim that imitation learning is continuous with and complementary to RL.

197.579 View full episode →

Dwarkesh Podcast

Some thoughts on the Sutton interview

So I tried to ask Richard a couple of times whether free-trained LLMs can serve as a good prior on which we can accumulate the experiential learning, aka do the RL, which would lead to AGI.

202.584 View full episode →

Dwarkesh Podcast

Some thoughts on the Sutton interview

So Ilya Seskovor gave a talk a couple months ago that I thought was super interesting, and he compared pre-training data to fossil fuels.

214.276 View full episode →

← Previous Page 209 of 783 Next →

Report any issue

Dwarkesh Patel

Voice Profile Active

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment