Trenton Bricken

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

I'm pretty sure there's nothing in the envelope.

2834.885 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

I think Anthropic did a survey of like a whole bunch of people and put that into its constitutional data.

2945.14 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

But yeah, I mean, there's a lot more to be done here.

2951.854 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

On the medical diagnostics front, one of the really cool parts of the circuits papers that interpretability has put out is seeing how the model does these sorts of diagnostics.

3297.98 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And so you present it with – there's this specific complication in pregnancy that I'm going to mispronounce.

3308.564 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

It presents a number of symptoms that are hard to diagnose, and you basically are like, human, we're in the emergency room.

3315.8 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Sorry, sorry, like human colon, like as in the human prompt is we're in the emergency room, and a woman 20 weeks into gestation is experiencing like these three symptoms.

3323.431 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Like what is the – you can only ask about one symptom.

3337.009 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

What is it?

3339.413 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And then you can see the circuit for the model and how it reasons.

3340.454 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

One, you can see it maps 20 weeks of gestation to that the woman's pregnant.

3345.723 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

You never explicitly said that.

3351.974 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And then you can see it extract each of these different symptoms early on in the circuit, map all of them to this specific medical case, which is the correct answer here that we were going for.

3353.657 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

and then project that out to all of the different possible other symptoms that weren't mentioned, and then have it decide to ask about one of those.

3365.378 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And so it's pretty cool to see this clean medical understanding of cause and effect inside the circuit.

3373.667 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

I think people are still sleeping on the circuits work that came out.

3392.868 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

If anything because it's just kind of hard to wrap your head around or we're like still getting used to the fact you can even get features for a single layer.

3397.873 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Yeah.

3403.539 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Like in another case there's this poetry example and by the end of the first sentence the model already knows what it wants to write in the poem at the end of the second sentence and it will like backfill and then plan out the whole thing.

3404.68 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Yeah.

3418.674 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment