Sholto Douglas

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

OK.

37.652 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

So I think the biggest thing that's changed is RL and language models has finally worked.

38.072 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And this is manifested in we finally have proof of an algorithm that can give us expert human reliability and performance given the right feedback loop.

41.716 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And so I think this is only really being conclusively demonstrated in competitive programming and math, basically.

51.126 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And so if you think of these two axes, one is the intellectual complexity of the task, and the other is the time horizon of which the task is being completed on.

57.153 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And I think we have proof that we can reach the peaks of intellectual complexity along many dimensions.

66.288 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

But we haven't yet demonstrated long-running agentic performance.

72.678 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And you're seeing the first stumbling steps of that now and should see much more conclusive evidence of that basically by the end of the year with real software engineering agents doing real work.

77.346 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And I think, Trenton, you're like,

87.883 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

I think this is roughly on track for where I expected with software engineering.

122 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

I think I expected them to be a little bit better at computer use.

125.505 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Yeah.

127.548 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

But I understand all the reasons for why that is, and I think that's well on track to be solved.

128.509 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

It's just a temporary lapse.

133.676 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And holding me accountable for my predictions next year, I really do think end of this year, sort of like this time next year, we have software engineering agents that can do close to a day's worth of work for a junior engineer or a couple of hours of quite competent independent work.

138.524 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

I think my description there was I think like in retrospect probably not what's limiting them.

185.149 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

I think what we're seeing now is closer to lack of context.

190.436 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

lack of ability to do complex, very multi-file changes and maybe scope of the change or scope of the task in some respects.

195.764 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

They can cope with high intellectual complexity in a focused context with a really scoped problem.

208.341 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

But when something's a bit more amorphous or requires a lot of discovery and iteration with the environment, this kind of stuff, they struggle more.

214.469 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment