Trenton Bricken

👤 Person

1589 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

I think they just waited.

5106.856 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

and then were able to take advantage of all the efficiency gains that everyone else was also seeing.

5108.397 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Yeah, and to go from like way behind the frontier to like, oh, this is like a real player.

5127.58 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Yeah, I don't know about fractions.

5418.408 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

It might be like you have a hunch for a core problem.

5419.75 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

You can think of 10 possible ways to solve it.

5422.515 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And then you just need to try them and see what works.

5424.959 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And that's kind of where the trial and error like sorcery of deep learning can kind of kick in.

5427.904 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

But Dorcas, you said, oh, well, the model can do the more straightforward things and not the deeper thought.

5492.557 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

I mean, I do want to push back on that a little bit.

5497.992 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

I think, again, if the model has the right context and scaffolding, it's starting to be able to do some really interesting things.

5499.917 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

The Interp agent has been a surprise to people, even internally, at how good it is at finding the needle in the haystack, like when it plays the auditing game, finding this reward model bias feature, and then reasoning about it, and then systematically testing its hypotheses.

5507.558 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

So it looks at that feature, then it looks at similar features.

5523.117 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

It finds one with a preference for chocolate.

5526.384 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

It's like, huh, that's really weird that the model wants to add chocolate to recipes.

5528.489 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Let me test it.

5531.977 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And so then it will make up like, hey, I'm trying to make a tomato soup.

5533.06 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

What would be a good ingredient for it?

5537.39 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And then sees that the model replies chocolate.

5539.354 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

reasons through it, and then keeps going, right?

5542 View full episode →

← Previous Page 22 of 80 Next →

Report any issue