Sholto Douglas

👤 Speaker

1567 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And I just close it and copy-paste what I wanted from the thing.

1851.836 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And it would be very bad to misinterpret that as a bad example or a bad signal, because you're pretty much all the way there.

1855.465 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

The system prompt always gets fucked with it.

2253.514 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

It's always very cognizant of it.

2255.116 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

I think it's not make fake unit test, but it's get the reward.

2412.073 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Yeah.

2416.378 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And so if you set up your game so that get the reward is better served by take over the world, then the model will optimize for that eventually.

2417.819 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Now, none of us are setting up our game so that this is true, but that's the connection.

2425.687 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And we're starting with unit tests now.

2630.675 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

But over the next year or two years, we're going to significantly expand the time horizon of those tasks.

2632.417 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

And it might be like achieve some goal.

2638.525 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

I mean, God, like make money on the internet or something like this.

2641.028 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

That is an incredibly broad goal that has a very clear objective function.

2643.811 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

So it's actually in some ways a good RL task once you're at that level of capability.

2647.576 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

But it's also one that has incredible scope for misalignment, let's say.

2652.482 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

MARK MANDELMANN- Totally.

2657.508 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

But people have done that with like the Constitution of the U.S.

2886.666 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

government, right?

2888.669 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

The U.S.

2889.47 View full episode →

Dwarkesh Podcast

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

government is, I think, it's a better analogy in some respects of like this body that has goals and like can act on the world as opposed to like an amorphous force like the Industrial Revolution.

2890.131 View full episode →

← Previous Page 9 of 79 Next →

Report any issue