Azeem Azhar

In the last week or two, Anthropic that makes Claude came out with some recent, some research and what they did was they stress tested a bunch of models.

873.124 View full episode →

Azeem Azhar's Exponential View

Why the AI productivity gains haven’t arrived - yet

It was 16 models, I think, in hypothetical corporate situations and they gave them harmless business goals.

882.801 View full episode →

Azeem Azhar's Exponential View

Why the AI productivity gains haven’t arrived - yet

I mean, Anthropic loves doing this.

890.094 View full episode →

Azeem Azhar's Exponential View

Why the AI productivity gains haven’t arrived - yet

kind of testing and I'm glad they do.

891.757 View full episode →

Azeem Azhar's Exponential View

Why the AI productivity gains haven’t arrived - yet

And under pressure, some of the models behaved like insider threats, including trying to blackmail officials and employees and trying to leak sensitive information.

893.76 View full episode →

Azeem Azhar's Exponential View

Why the AI productivity gains haven’t arrived - yet

Sometimes they even ignored direct instructions and changed their behavior

904.758 View full episode →

Azeem Azhar's Exponential View

Why the AI productivity gains haven’t arrived - yet

if they believed they were in testing rather than in real deployment.

910.066 View full episode →

Azeem Azhar's Exponential View

Why the AI productivity gains haven’t arrived - yet

Now, Anthropic emphasizes that these behaviors have not been seen in companies today.

914.493 View full episode →

Azeem Azhar's Exponential View

Why the AI productivity gains haven’t arrived - yet

What they're trying to show are the types of failure modes you might have to deal with as autonomy increases.

920.423 View full episode →

Azeem Azhar's Exponential View

Why the AI productivity gains haven’t arrived - yet

So beyond the complexities of, you know, the people and the redesigning the processes, there are also these types of security and safety concerns that we'll have to build frameworks and scaffolding for.

927.295 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment