Joe Carlsmith

Based on my understanding of how moral reasoning works, if you look at the type of moral reasoning that analytic ethicists do, it's just reflective equilibrium, right?

6138.828 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

They just take their intuitions and they systematize them.

6147.216 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

I don't see how that process gets a sort of injection

6152.721 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

of like the kind of mind independent moral truth, or like, I guess it, like if you sort of start with like only all of your intuition say to maximize paperclips, I don't see how you end up maximizing or like doing some like rich human morality.

6157.685 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

I just don't like, it doesn't look to me like that's how human ethical reasoning works.

6171.099 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

I think like most of what normative philosophy does is make consistent and kind of systematize pre-theoretic intuitions.

6176.584 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

And so,

6185.774 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

And I think, but we'll get evidence about this.

6186.997 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

Like, you know, in some sense, I think this view predicts like, you know, you keep trying to train the AIs to like do something and they keep being like, no, I'm not going to.

6190.25 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

like do that.

6197.003 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

It's like, no, that's not good.

6198.145 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

Or so they keep like pushing back.