Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Sholto Douglas

👤 Person
1567 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

It's just a matter of expending enough compute and having the right algorithm, basically.

Dwarkesh Podcast
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

You know the parable about when you choose to launch a space mission?

Dwarkesh Podcast
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

How you should acquire, go further up the tech tree, because if you launch later on, your ship will go faster and this kind of stuff?

Dwarkesh Podcast
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

I think it's quite similar to that.

Dwarkesh Podcast
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

You want to be sure that you've algorithmically got the right thing.

Dwarkesh Podcast
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And then when you bet and you do the large compute spend on the run, then it'll actually pay off.

Dwarkesh Podcast
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

You'll have the right compute efficiencies and this kind of stuff.

Dwarkesh Podcast
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And I think RL is slightly different to pre-training in this regard, where RL can be a more iterative thing.

Dwarkesh Podcast
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

You're progressively adding capabilities to the base model.

Dwarkesh Podcast
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Pre-training has, in many respects, if you're halfway through a run and you've messed it up, then you've really messed it up.

Dwarkesh Podcast
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

But I think that's the main reason why, is people are still figuring out exactly what they want to do.

Dwarkesh Podcast
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

I mean, 01 to 03, OpenAI put in their blog post that it was a 10x compute multiplier over 01.

Dwarkesh Podcast
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

So clearly they bet on one level of compute, and they were like, OK, this seems good.

Dwarkesh Podcast
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Let's actually release it.

Dwarkesh Podcast
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Let's get it out there.

Dwarkesh Podcast
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And then they spent the next few months increasing the amount of compute that they spent on that.

Dwarkesh Podcast
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And I expect, as everyone is, that everyone else is scaling up RL right now.

Dwarkesh Podcast
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

So I basically don't expect that to be true for very long.

Dwarkesh Podcast
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

You literally do have a monkey, and it's making Shakespeare.

Dwarkesh Podcast
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

I was just going to say, like, you do need to be able to get reward sometimes in order to learn.