Sholto Douglas

So for example, most of the robotics companies are doing this kind of to the bi-level thing, where they have a motor policy that's running at whatever, like 60 hertz or whatever, and some higher level visual language model.

4341.929 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

I'm pretty sure almost all the big robot companies are doing this.

4353.507 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And they're doing this for a number of reasons.

4357.112 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

One of them is that they want something to act at a very high frequency.

4358.314 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And two is they can't train the big visual language model.

4362.4 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And so they like relying on that for general space, like world knowledge and this kind of stuff and like constructing longer running plans, but then they're like, you offload to the motor policy.

4365.425 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

I'm very much of the opinion that if you are able to train the big model, eventually at some point in the future, the distinction between big models and small models should disappear because you should be able to use the amount of computation in a model that is necessary to complete the task.

4375.215 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Ultimately, there's some amount of task complexity.

4391.733 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment