Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Joe Carlsmith

๐Ÿ‘ค Speaker
1218 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

But that's a kind of empirical claim.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

I'm also just, like, kind of low on this, like, everyone converges thing.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

So, you know, if you imagine, like,

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

you train a chess-playing AI or you have a real paper clipper, right?

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Somehow you had a real paper clipper.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

And then you're like, okay, go and reflect.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Based on my understanding of how moral reasoning works, if you look at the type of moral reasoning that analytic ethicists do, it's just reflective equilibrium, right?

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

They just take their intuitions and they systematize them.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

I don't see how that process gets a sort of injection

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

of like the kind of mind independent moral truth, or like, I guess it, like if you sort of start with like only all of your intuition say to maximize paperclips, I don't see how you end up maximizing or like doing some like rich human morality.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

I just don't like, it doesn't look to me like that's how human ethical reasoning works.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

I think like most of what normative philosophy does is make consistent and kind of systematize pre-theoretic intuitions.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

And so,

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

And I think, but we'll get evidence about this.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Like, you know, in some sense, I think this view predicts like, you know, you keep trying to train the AIs to like do something and they keep being like, no, I'm not going to.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

like do that.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

It's like, no, that's not good.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Or so they keep like pushing back.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Like the sort of momentum of like AI cognition is like always in the direction of this like moral truth.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

And whenever we like try to push it in some other direction, we'll find kind of resistance from like the rational structure of things.