Trenton Bricken

One prediction I have is that we're going to move away from can an agent do XYZ and more towards can I efficiently deploy, launch 100 agents and then give them the feedback they need and even just be able to like easily verify what they're up to, right?

5609.753 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

There's this generator verify fire gap that people talk about where it's like much easier to check something than it is to produce the solution on your own.

5626.755 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Yeah.

5634.885 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

It's very plausible to me we'll be at the point where it's so easy to generate with these agents that the bottleneck is actually can I as the human verify the answer.

5634.965 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And again, you're guaranteed to get an answer with these things.

5643.899 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And so ideally you have some automated way to evaluate and test a score for like how well it worked, how well did this thing generalize.

5647.244 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And at a minimum, you have a way to easily summarize what a bunch of agents are finding.

5657.881 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And it's like, OK, well, if 20 of my 100 agents all found this one thing, then it has a higher chance of being true.

5662.866 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Yeah, but just to be really concrete or pedantic about the bottlenecks here, a lot of it is again just tooling and are the pipes connected?

5761.61 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Like a lot of things I can't just launch Claude and have it go and solve because maybe it needs a GPU.

5768.521 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

or maybe I need very careful permissioning so that it can't just like take over an entire cluster and like launch a whole bunch of things, right?

5775.432 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

So you really do need good sandboxing and the ability to use all of the tools that are necessary.

5782.341 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

But I think part of it is, is it async or not?

5824.689 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment