Sholto Douglas

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And doing this in a really simple and elegant way, and then backing it up with great engineering.

5320.4 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

I also thought it was interesting that they incorporated the multi-token prediction thing from Meta.

5325.93 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

So Meta had a nice paper on this multi-token prediction thing.

5330.818 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

I don't know if it's good or bad, but Meta didn't include it in Lama, but DeepSea did include it in their paper, which I think is interesting.

5334.725 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Was that because they were faster at iterating and including an algorithm, or did Meta decide that actually it wasn't a good algorithmic change of scale?

5345.078 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And like Noam Shazia will talk about this, like about how he like 5% of his ideas work.

5432.331 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

So even he, the vaunted god of model architecture design, has a relatively low hit rate, but he just tries so many things.

5438 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

MARK MANDELMANN- Right.

5447.55 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

I actually think your rates of progress almost don't change that much, so long as he's able to completely implement his ideas.

5458.242 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

FRANCESC CAMPOY- If you have Noam Shazier at 100x speed, that's still kind of wild.

5467.034 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

MARK MANDELMANN- Yeah.

5472.423 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

FRANCESC CAMPOY- There's all these fallbacks of wild worlds, where even if you don't get 100% Noam Shazier level intuition in model design, it's still OK if you just accelerate him by 100x.

5473.465 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

MARK MANDELMANN- Right.

5486.547 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

There is conceptual understanding there.

5544.983 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Deep conceptual understanding.