Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Joe Carlsmith

๐Ÿ‘ค Speaker
1218 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Where does that fit into this picture?

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

I think it's a good question.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

I mean, I think...

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

I think it's like some guess about like, if there's like no part of me that recognizes it as good, then I think I'm not sure that it's good according to me in some sense.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Like, so yeah, I mean, it is a question of like what it takes for it to be the case that a part of you recognizes it as good.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

But I think if there's really none of that, then I'm not sure, yeah.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

it's a reflection of my values at all.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Yeah, I mean, you definitely don't want to be like...

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

you know, if you transform me into a paper clipper gradually, then I will eventually be like, and then I saw the light.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

I saw the true paper clips.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

But that's part of what's complicated about this thing about reflection.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

You have to find some way of differentiating between the sort of development processes that preserve what you care about and the development processes that don't.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

And that in itself is this fraught question, which itself requires taking some stand on what you care about and what sorts of

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

meta processes you endorse and all sorts of things.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

But you definitely shouldn't just be like, it is not a sufficient criteria that the thing at the end thinks it got it right.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Because that's compatible with having gone like wildly off the rails.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Yeah, so the context on that post is I'm talking about this hazy cluster, which I call in the essay, niceness slash liberalism slash boundaries, which is this sort of like somewhat more minimal set of like cooperative norms involved in like respecting the boundaries of others and kind of...

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

cooperation and peace amongst differences and like tolerance and stuff like that, as opposed to like your favorite structure of matter, which is sort of sometimes the paradigm of like values that people use in the context of AI risk.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

And, you know, I talk for a while about the sort of ethical virtues of these like norms, but it's pretty clear that

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Also, like, why do we have these norms?