Trenton Bricken

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

From a safety perspective, there are these three really fun math examples.

3418.754 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

So in one of them, you ask the model to do square root of 64, and it does it.

3423.261 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And you can look at the circuit for it and verify that it actually can perform this square root.

3428.229 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And in another example, it will add two numbers, and you can see that it has these really cool lookup table features that will do the computation.

3433.457 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

for like, the example's 59 plus 36.

3440.287 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

So it'll do the five plus nine and know that it's this modulo operation.

3443.712 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And then it will also at the same time do this fuzzy lookup of like, okay, I know one number is a 30 and one's a 50, so it's gonna be roughly 80.

3450.643 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And then it will combine the two, right?

3459.797 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Okay, so with the square root 64, it's the same thing.

3461.7 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

You can see every single part of the computation and that it's doing it.

3465.046 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And the model tells you what it's doing.

3468.451 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

It has its scratch pad and it goes through it.

3470.333 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And you can be like, yep, okay, you're telling the truth.

3472.316 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

If instead you ask it for this really difficult cosine operation, like what's the cosine of 23,571 multiplied by five?

3474.679 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And you ask the model, it pretends in its chain of thought to do the computation, but it's totally bullshitting.

3485.953 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And it gets the answer wrong.

3493.303 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And when you look at the circuit,

3494.865 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

it's totally meaningless.

3496.767 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

It's clearly not doing any of the right operations.

3498.329 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And then in the final case, you can ask it the same hard cosine question, and you say, I think the answer's 4, but I'm not sure.

3501.713 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment