Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Joe Carlsmith

๐Ÿ‘ค Speaker
1218 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Like that's like a really bad, because like I think one of the clearest things about human processes of reflection

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

like the kind of easiest thing to be like, let's at least get this is like not, uh, acting on the basis of a, of a incorrect empirical picture of the world.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Right.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

And so if you find yourself like asking your way, by the way, like this is true and I need you to always be reasoning as though blah is true.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Um, I'm like, Ooh, I think that's a no, no from an anti-realist perspective too.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Right.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Cause I want to, I want to like my reflective values, I think will be such that I formed them in light of the truth about the world.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

And so I think, and I think this is a real concern about as, as we move into this era of kind of aligning AIs, I don't actually think this like binary between like values and other things is going to be a very obvious in how we're training them.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

I think it's going to be much more like, um,

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Ideologies and like you can just train an AI to like output stuff right output utterances And so you can easily end up in a situation where you like decided that blah is true about some issue an empirical issue right not a moral issue and So like I think I think people should not for example I do not think people should hard code Belief in God into their a eyes or like I would I would advise people to not hard code their religion into their a eyes if they also want to like

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Discover if their religion is false I would just in general if you if you would like to have your behavior be sensitive to whether something is true or false like it's sort of generally not good to like Etch it into things and so and so that is definitely a form of blinder I think we should be really watching out for and I'm kind of hopeful so like I have enough credence on some sort of moral realism that like I'm hoping that if we just do the anti realism thing and

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

of just being consistent, learning all the stuff, reflecting.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

If you look at how moral realists and moral anti-realists actually do normative ethics, it's the same.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

It's basically the same.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

There's some amount of different heuristics on things like properties like simplicity and stuff like that, but I think it's like they're mostly just doing the same game.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

And so I'm kind of hoping that, and also metaethics is itself a discipline that AIs can help us with,

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

I'm hoping that we can just figure this out either way.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

So if there is, if moral realism is somehow true, I want us to be able to notice that.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Um, and I want us to be able to like adjust accordingly.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

So I'm not like writing off those worlds and be like, let's just like totally assume that's false.