Joe Carlsmith

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

thought about AIs, how we treated our AIs.

2561.656 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

And we end up looking back with a kind of moral horror at what we were doing.

2564.54 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

So, you know, we end up thinking, you know, we were thinking about these things centrally as like products, as tools.

2569.887 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

But in fact, we should have been foregrounding much more the sense in which they might be moral patients or were moral patients at some level of sophistication, that we were kind of treating them in the wrong way.

2575.674 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

We were just

2584.826 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

acting like we could do whatever we want.

2585.247 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

We could delete them, subject them to arbitrary experiments, kind of alter their minds in arbitrary ways.

2586.468 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

And then we end up looking back in the light of history at that as a kind of serious and kind of grave moral error.

2592.935 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

Those are scenarios I think about a lot in which we have regrets.

2600.243 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

I don't think they quite fit the bill of what you just said.

2603.707 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

I think it sounds to me like the thing you're thinking is something more like we end up feeling like, gosh,

2605.849 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

we wish we had paid no attention to the motives of our AIs, that we'd thought not at all about their impact on our society as we incorporated them.

2613.597 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

And instead, we had pursued a, let's call it a kind of maximize for brute power option, which is just kind of make a beeline for whatever is just the most powerful AI you can, and don't think about anything else.

2621.508 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

Cool.

2726.249 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

So I think there's a bunch of different things to potentially unpack there.

2726.429 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

One kind of conceptual point that I want to name off the bat, I don't think you're necessarily kind of making a mistake in this vein, but I just want to name it as like a possible mistake in this vicinity is I think we don't want to engage in the following form of reasoning.

2732.161 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

Let's say you have two entities.

2748.906 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

One is in the role of creator and one is in the role of creation.

2750.308 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

And then we're positing that there's this kind of misalignment relation between them, whatever that means, right?