Trenton Bricken

I wonder if it will emerge more once we allow agents to talk to each other in ways where currently it's kind of trained more in isolation or with a human.

4688.164 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Yeah, I mean, one scary thing, though, is like the way we render text, you can use hidden white space tokens that also encode information.

4717.285 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

That's true.

4726.328 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And so you can imagine a world where it looks like the agent's reasoning and it's scratchpad harmlessly, but it's actually hiding a bunch of data.

4726.79 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

5%.

4826.984 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

I mean, bringing it back to the there's so much low-hanging fruit, it's been wild seeing the efficiency gains that these models have experienced over the last two years.

5061.641 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And yeah, with respect to DeepSeq, I mean, just really hammering home, and Dario has a nice essay on this.

5071.859 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

It's good, yeah.

5077.508 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Deep Seek was nine months after Claude III saw it.

5078.41 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And if we retrained the same model today or at the same time as the Deep Seek work, we also could have trained it for five million or whatever the advertised amount was.

5084.275 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And so what's impressive or surprising is that Deep Seek has gotten to the frontier.

5094.985 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

But I think there's a common misconception still that they are above and beyond the frontier.

5101.01 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And I don't think that's right.

5105.775 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment