Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Tristan Harris

👤 Speaker
5427 total appearances

Appearances Over Time

Podcast Appearances

Modern Wisdom
#1079 - Tristan Harris - AI Expert Warns: “This Is The Last Mistake We’ll Ever Make”

Is it because that would be inconvenient?

Modern Wisdom
#1079 - Tristan Harris - AI Expert Warns: “This Is The Last Mistake We’ll Ever Make”

Because that would be scary?

Modern Wisdom
#1079 - Tristan Harris - AI Expert Warns: “This Is The Last Mistake We’ll Ever Make”

Because that would mean that the world that I know is suddenly not safe?

Modern Wisdom
#1079 - Tristan Harris - AI Expert Warns: “This Is The Last Mistake We’ll Ever Make”

Or just like part of the wisdom that we need in this moment is to calmly and clearly stay and confront facts about reality and whatever they are, you'd rather know than not know.

Modern Wisdom
#1079 - Tristan Harris - AI Expert Warns: “This Is The Last Mistake We’ll Ever Make”

And then ask, what do we need to do if we don't like where that leads us?

Modern Wisdom
#1079 - Tristan Harris - AI Expert Warns: “This Is The Last Mistake We’ll Ever Make”

And we are currently seeing AIs that are doing all this deceptive behavior.

Modern Wisdom
#1079 - Tristan Harris - AI Expert Warns: “This Is The Last Mistake We’ll Ever Make”

I've been on the circuit and talking a lot about the anthropic blackmail study.

Modern Wisdom
#1079 - Tristan Harris - AI Expert Warns: “This Is The Last Mistake We’ll Ever Make”

A lot of people have heard about this now.

Modern Wisdom
#1079 - Tristan Harris - AI Expert Warns: “This Is The Last Mistake We’ll Ever Make”

What happened?

Modern Wisdom
#1079 - Tristan Harris - AI Expert Warns: “This Is The Last Mistake We’ll Ever Make”

So this was the company Anthropic.

Modern Wisdom
#1079 - Tristan Harris - AI Expert Warns: “This Is The Last Mistake We’ll Ever Make”

This was a simulation.

Modern Wisdom
#1079 - Tristan Harris - AI Expert Warns: “This Is The Last Mistake We’ll Ever Make”

So they created a simulated company with a bunch of emails in the email server.

Modern Wisdom
#1079 - Tristan Harris - AI Expert Warns: “This Is The Last Mistake We’ll Ever Make”

And the AI reads the company email.

Modern Wisdom
#1079 - Tristan Harris - AI Expert Warns: “This Is The Last Mistake We’ll Ever Make”

This is a fictional company email.

Modern Wisdom
#1079 - Tristan Harris - AI Expert Warns: “This Is The Last Mistake We’ll Ever Make”

And there's two emails that are notable inside that company.

Modern Wisdom
#1079 - Tristan Harris - AI Expert Warns: “This Is The Last Mistake We’ll Ever Make”

One is engineers talking to each other, talking about how they're going to replace this AI model.

Modern Wisdom
#1079 - Tristan Harris - AI Expert Warns: “This Is The Last Mistake We’ll Ever Make”

So the AI is reading the email.

Modern Wisdom
#1079 - Tristan Harris - AI Expert Warns: “This Is The Last Mistake We’ll Ever Make”

It discovers that it's going to replace that AI model.

Modern Wisdom
#1079 - Tristan Harris - AI Expert Warns: “This Is The Last Mistake We’ll Ever Make”

And number two is it discovers a second email somewhere deep in this massive trove of emails that the executive who's in charge of this replacement is having an affair with another employee.

Modern Wisdom
#1079 - Tristan Harris - AI Expert Warns: “This Is The Last Mistake We’ll Ever Make”

And the AI autonomously identifies a strategy that to keep itself alive is going to blackmail that employee and say, if you replace me, I will tell the whole world that you're having an affair with this employee.