Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Dwarkesh Patel

πŸ‘€ Speaker
15785 total appearances
Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 4
Confidence: High

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Fully autonomous robots are much closer than you think – Sergey Levine

So it sounds like the first fun thing to do is probably to start looking at what an order book actually looks like.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think – Sergey Levine

If this sounds interesting to you, you should consider working at Hudson River Trading.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think – Sergey Levine

I was talking to this researcher, Sander, at GDM, and he works on video and audio models.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think – Sergey Levine

And he made the interesting point that the reason, in his view, we aren't seeing that much transfer learning between different modalities, that is to say, like training a language model on video and images, doesn't seem to necessarily make it that much better at textual learning.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think – Sergey Levine

questions and tasks, is that images are represented at a different semantic level than text.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think – Sergey Levine

And so his argument is that text has this high-level semantic representation within the model, whereas images and videos are just like compressed pixels.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think – Sergey Levine

There's not really a semantic... When they're embedded, they don't represent some high-level semantic information.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think – Sergey Levine

They're just like compressed pixels.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think – Sergey Levine

And therefore, there's...

Dwarkesh Podcast
Fully autonomous robots are much closer than you think – Sergey Levine

there's no transfer learning at the level at which they're going through the model.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think – Sergey Levine

And obviously, this is super relevant to the work you're doing because your hope is that by training the model both on the visual data that the robot sees, visual data generally, maybe even from YouTube or whatever eventually, plus language information, plus action information from the robot itself, all of this together will make it generally robust.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think – Sergey Levine

And then you had a really interesting blog post about why video models aren't as robust as language models.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think – Sergey Levine

Sorry, this is not a super well-formed question.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think – Sergey Levine

I just wanted you to react to that.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think – Sergey Levine

By the way, the fact that video models aren't as robust, is that bearish for robotics?

Dwarkesh Podcast
Fully autonomous robots are much closer than you think – Sergey Levine

Because it will, so much of the data you will have to use will not, I guess some of, you're saying a lot of it will be labeled, but like, ideally you just want to be able to like throw all of everything on YouTube, every video we ever recorded and have it learn how the physical world works and how to like move about, et cetera, just see humans performing tasks and learn from that.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think – Sergey Levine

But if, yeah, I guess you're saying like it's hard to learn just from that and it actually needs to practice the task itself.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think – Sergey Levine

famously LLMs have all these emergent capabilities that were never engineered in because somewhere in internet text is the data to train and to give it the knowledge to do a certain kind of thing.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think – Sergey Levine

With robots, it seems like you are collecting all the data manually.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think – Sergey Levine

So there won't be this mysterious new capability that like is somewhere in the data set that you haven't purposefully collected, which seems like it should make it even harder to then have