Sholto Douglas

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

So it's...

1146.906 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

it's an efficiency question there.

1148.408 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Obviously, if you could give a dense reward for every token, if you had a supervised example, then that's one of the best things you could have.

1150.27 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

But in many cases, it's very expensive to produce all of those scaffolded curriculum of everything to do.

1158.561 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Having PhD math students grade students is something which you can only afford for the select category of students that you've chosen to focus in on developing.

1165.25 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And you couldn't do that for all the language models in the world.

1174.362 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

So like first step is obviously that would be better, but

1177.906 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

you're going to be sort of optimizing this pre-order frontier of how much am I willing to spend on the scaffolding versus how much am I willing to spend on pure compute.

1185.263 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Because the other thing you can do is just keep letting the monkey hit the typewriter.

1195.099 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And if you have a good enough end reward, then eventually it will find its way.

1199.086 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And so I can't really talk about where exactly