Sholto Douglas

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

You don't have to use 100% of your brain all the time.

4399.683 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

MARK MANDELBACH- Right.

4402.346 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Welcome to my world.

4405.229 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

FRANCESC CAMPOY- And so you should be able to run that faster and this kind of stuff, basically.

4405.79 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

So I think it's net-net, I think, typically the same model.

4409.154 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Because you want to be able to scale the understanding as the complexity and difficulty.

4412.919 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

You want to be able to do that dynamically.

4417.884 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

So, yeah.

4445.547 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

The residual screamers is like this operating RAM.

4449.312 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

You're doing stuff to it.

4451.034 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Right.

4452.817 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

It's like the mental model I think one takes away from interpretability work.

4452.957 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

There's a surprisingly strong bias so far towards token syntax.

4555.572 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

It seems to work very well.

4559.774 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

There already is some amount of neuralese, right?

4564.87 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

If you think about the residual stream for each token is like neuralese to some degree.

4567.212 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And so now we're just trading off axes, like how much neuralese are you doing versus how much actually is read out to tokens all the time.

4571.056 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Interpretability becomes dramatically more important as you shift in this direction of neuralese.

4657.463 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And there'll be like some selective pressure against it so long as the agents are working with humans because they'll want to sort of cooperate.

4697.592 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

But then like as agents begin to work more and more with each other, then that selective pressure like changes the other direction basically.

4705.067 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment