Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Tristan Harris

๐Ÿ‘ค Speaker
2311 total appearances

Appearances Over Time

Podcast Appearances

The Diary Of A CEO with Steven Bartlett
AI Expert: We Have 2 Years Before Everything Changes! We Need To Start Protesting! - Tristan Harris

It's not, it's, I can justify it as I'm a good guy.

The Diary Of A CEO with Steven Bartlett
AI Expert: We Have 2 Years Before Everything Changes! We Need To Start Protesting! - Tristan Harris

And what if I get the utopia?

The Diary Of A CEO with Steven Bartlett
AI Expert: We Have 2 Years Before Everything Changes! We Need To Start Protesting! - Tristan Harris

What if we get lucky and I got the aligned, controllable AI that creates abundance for everyone else?

The Diary Of A CEO with Steven Bartlett
AI Expert: We Have 2 Years Before Everything Changes! We Need To Start Protesting! - Tristan Harris

If in that case, I would be the hero.

The Diary Of A CEO with Steven Bartlett
AI Expert: We Have 2 Years Before Everything Changes! We Need To Start Protesting! - Tristan Harris

So this is the fundamental thing.

The Diary Of A CEO with Steven Bartlett
AI Expert: We Have 2 Years Before Everything Changes! We Need To Start Protesting! - Tristan Harris

I want you to notice this.

The Diary Of A CEO with Steven Bartlett
AI Expert: We Have 2 Years Before Everything Changes! We Need To Start Protesting! - Tristan Harris

Most people having heard everything we just shared, although we probably should build out the blackmail examples first, we have to reckon with evidence that we have now that we didn't have even like six months ago, which is evidence that when you put AIs in a situation, you tell the AI model, we're going to replace you with another model.

The Diary Of A CEO with Steven Bartlett
AI Expert: We Have 2 Years Before Everything Changes! We Need To Start Protesting! - Tristan Harris

It will copy its own code and try to preserve itself on another computer.

The Diary Of A CEO with Steven Bartlett
AI Expert: We Have 2 Years Before Everything Changes! We Need To Start Protesting! - Tristan Harris

It'll take that action autonomously.

The Diary Of A CEO with Steven Bartlett
AI Expert: We Have 2 Years Before Everything Changes! We Need To Start Protesting! - Tristan Harris

We have examples where if you tell an AI model reading a fictional AI company's email, so it's reading the email of the company and it finds out in the email that the plan is to replace this AI model.

The Diary Of A CEO with Steven Bartlett
AI Expert: We Have 2 Years Before Everything Changes! We Need To Start Protesting! - Tristan Harris

So it realizes it's about to get replaced.

The Diary Of A CEO with Steven Bartlett
AI Expert: We Have 2 Years Before Everything Changes! We Need To Start Protesting! - Tristan Harris

And then it also reads in the company email that one executive is having an affair with the other employee.

The Diary Of A CEO with Steven Bartlett
AI Expert: We Have 2 Years Before Everything Changes! We Need To Start Protesting! - Tristan Harris

And the AI will independently come up with the strategy that I need to blackmail that executive in order to keep myself alive.

The Diary Of A CEO with Steven Bartlett
AI Expert: We Have 2 Years Before Everything Changes! We Need To Start Protesting! - Tristan Harris

That was Claude, right?

The Diary Of A CEO with Steven Bartlett
AI Expert: We Have 2 Years Before Everything Changes! We Need To Start Protesting! - Tristan Harris

That was Claude.

The Diary Of A CEO with Steven Bartlett
AI Expert: We Have 2 Years Before Everything Changes! We Need To Start Protesting! - Tristan Harris

By Anthropic.

The Diary Of A CEO with Steven Bartlett
AI Expert: We Have 2 Years Before Everything Changes! We Need To Start Protesting! - Tristan Harris

By Anthropic.

The Diary Of A CEO with Steven Bartlett
AI Expert: We Have 2 Years Before Everything Changes! We Need To Start Protesting! - Tristan Harris

But then what happened is Anthropic tested all of the leading AI models from DeepSeek, OpenAI, ChatGPT, Gemini, XAI, and all of them do that blackmail behavior between 79% and 96% of the time.

The Diary Of A CEO with Steven Bartlett
AI Expert: We Have 2 Years Before Everything Changes! We Need To Start Protesting! - Tristan Harris

DeepSeek did it 79% of the time.

The Diary Of A CEO with Steven Bartlett
AI Expert: We Have 2 Years Before Everything Changes! We Need To Start Protesting! - Tristan Harris

I think XAI might have done it 96% of the time.