Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Joe Carlsmith

👤 Person
1218 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

The runs were structurally quite similar.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Everyone was using the same techniques.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Maybe someone just stole the weights.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

So

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Yeah, I guess I think it's really important, this idea that to the extent you haven't solved alignment, you likely haven't solved it anywhere.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

And if someone has solved it and someone hasn't, then I think it's a better question.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

But if everyone's building systems that are going to go rogue, then I don't think that's much comfort, as we talked about.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Yeah, I mean, I'll just say on that front, I mean, I do think the otherness and control series is...

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

you know, I think kind of in some sense separable.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

I mean, it has a lot, it has a lot to do with like misalignment stuff, but I think it's not.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

I think a lot of those issues are relevant, even if, if even given various degrees of skepticism about some of the stuff I've been saying here.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Yeah.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

I think like, yeah, in terms of,

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

why is it possible that I guys could take over from a given position in one of these projects I've been describing or something.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

I think Carl's discussion is pretty good and gets into a bunch of kind of the weeds that I think might give a more concrete sense.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

One scenario I think about a lot is one in which it just turns out that maybe kind of fairly basic measures are enough to ensure, for example, that AIs don't cause catastrophic harm, don't kind of seek power in problematic ways, etc.,

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

And it could turn out that we learned that it was easy in a way that, such that we regret, you know, we wish we had prioritized differently.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

We end up thinking, oh, you know, I wish we could have cured cancer sooner.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

We could have handled some geopolitical dynamic differently.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

There's another scenario where we end up looking back at some period of our history and how we