Joe Allen

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

All right, Pasi, welcome back.

1875.957 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

We are here with Jeffrey Ladish of Palisade Research.

1877.559 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

Palisade Research works on AI evaluations, taking these models, which are extremely unpredictable and in some sense uncontrollable, and running them through a series of tests to see exactly what the limits are of their capabilities and really what the limits are of their will to survive.

1881.523 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

Jeffrey, if we could just return really quickly to the Palisade studies showing that the models had some desire, so to speak, or at least a goal to continue beyond the user's desire that it shut down.

1904.088 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

There are other examples that we have out of other organizations, right?

1923.715 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

So Anthropic did the now widely publicized study

1928.021 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

in which they created a virtual environment, told the model that one of the engineers had had an affair.

1932.708 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

They weren't directing its attention to the email.

1963.714 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

There were a whole lot of other potential emails.

1965.896 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

So if you see the same type of behavior across models,

1999.299 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

How do you explain it?

2005.102 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

I mean, it's possible to say, I suppose, that this is something that the engineers working on it are kind of prompting it to do.

2006.103 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

But I think you've done at least a fairly good job of showing at least an alternative explanation.

2014.57 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

It's not that they were directing their attention to the email, for instance, nor were you guys giving instructions to rewrite the script or rewrite the code.

2019.835 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

It simply arrived at it on its own.

2030.344 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

So on a kind of philosophical level,

2032.366 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

What do you think is going on?

2035.088 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

Why would just code or just a machine have a will to survive at all?

2037.931 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

and so they were inadvertently rewarding this behavior of the dog pushing children into the river so pavlov's dog goes rogue that's right yeah it goes predatory yeah i'd like to i really want to talk about some of the other studies that you know around situational awareness or emergent misalignment uh things that you could speak to much better than i could but

2140.557 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

before you know i would like for the war room audience to have a sound sense of exactly what goes on inside organizations like palisade research center for ai safety apollo research all these what does it look like day to day just briefly when you are testing a model

2163.035 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment