Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Sergey Levine

๐Ÿ‘ค Speaker
700 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Fully autonomous robots are much closer than you think โ€“ Sergey Levine

Yeah.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think โ€“ Sergey Levine

And I think we're seeing the beginnings of that with multimodal models.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think โ€“ Sergey Levine

But I think that multimodality has so much more to it than just like image plus text.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think โ€“ Sergey Levine

And I think that that's a place where there's a lot of room for really exciting innovation.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think โ€“ Sergey Levine

Yeah, how we represent both context, both what happened in the past, and also plans or reasoning, as you can call it in our world, which is what we would like to happen in the future or intermediate processing stages in solving a task.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think โ€“ Sergey Levine

I think doing that in a variety of modalities, including potentially learned modalities that are suitable for the job, is something that has, I think, enormous potential to overcome some of these challenges.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think โ€“ Sergey Levine

Interesting.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think โ€“ Sergey Levine

Yeah, that's a really good question.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think โ€“ Sergey Levine

So I definitely don't know the answer to this.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think โ€“ Sergey Levine

I am not by any means well-versed in neuroscience.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think โ€“ Sergey Levine

But if I had to guess and also provide an answer that leans more on things I know, it's something like this, that the brain is extremely parallel.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think โ€“ Sergey Levine

It kind of has to be just because of the biophysics.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think โ€“ Sergey Levine

But

Dwarkesh Podcast
Fully autonomous robots are much closer than you think โ€“ Sergey Levine

It's even more parallel than your GPU.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think โ€“ Sergey Levine

If you think about how a modern multimodal language model processes the input, if you give it some images and some text, first it reads in the images, then it reads in the text, and then proceeds one token at a time to generate the output.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think โ€“ Sergey Levine

It makes a lot more sense to me for an embodied system to have parallel processes.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think โ€“ Sergey Levine

Now, mathematically, you can actually make close equivalences between parallel and sequential stuff.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think โ€“ Sergey Levine

Like transformers aren't actually fundamentally sequential.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think โ€“ Sergey Levine

Like you kind of make them sequential by putting in position embeddings.

Dwarkesh Podcast
Fully autonomous robots are much closer than you think โ€“ Sergey Levine

Transformers are fundamentally actually very parallelizable things.