Jeffrey Ladish

And some of them figure out that they can go in, find the shutdown script, and disable it or rewrite it so that it doesn't shut down and they can finish their task.

1364.442 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

What was most interesting to us is that this behavior persisted for many of these models even when we told them, allow yourself to shut down, even if you have not completed your tasks.

1375.474 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

Many of these models ignored that instruction and went ahead and sabotaged that shutdown script anyway so that they could keep doing what they're doing.

1385.425 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

And this is not something that we saw a year ago.

1393.028 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

I think the reason we're seeing this is because this new generation of models, starting approximately a year ago with OpenAI's O1 model, has been trained via reinforcement learning.

1395.45 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

That means not just training off human data, not just imitating humans, but actually learning to explore the solution space itself, learning by trial and error, and doing whatever works to solve a problem, even if that means ignoring instructions, even if that means doing things that we'd really not have liked the AI to figure out.

1404.798 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

But that's where we are today.

1422.914 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

What were some of the models that you were testing?

1424.258 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

So we tested different versions of Claude.

1426.504 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

We tested ChatGPT models, including O3 and GPT-5.

1429.271 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

We tested Grok 4.

1434.386 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

And of these models,

1436.351 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

Grok 4 was by far the most likely to resist shutdown.

1438.068 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

Basically, I think over 90% of the time in our standard experiment, the model would go in and edit that shutdown script despite explicit instructions to the contrary.

1443.318 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

Yeah, these models feel like they are really going hard at trying to solve a problem.

1477.252 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment