Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Joe Carlsmith

๐Ÿ‘ค Speaker
1218 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

I mean, look, it will be a very interesting fact if it's like, man, we keep training these AIs in all sorts of different ways.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Like, we're doing all this crazy stuff and they keep...

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

acting like bourgeois liberals.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

It's like, wow.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

They keep professing this weird alien reality.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

They all converge on this one thing.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

They're like, can't you see?

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

It's like Zorgle.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

And all the AIs.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Interesting.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Very interesting.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

I think my personal prediction is that that's not what we see.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

And my actual prediction is that the AIs are going to be very malleable.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

We're going to be like...

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

you know, if you push an AI towards evil, like it'll just go.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

Um, and, and I think that's, um, uh, obviously, or sort of reflectively consistent evil.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

I mean, I think there's also a question with some of these AIs.

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

It's like, um, uh,

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

will they even be consistent in their values, right?

Dwarkesh Podcast
Joe Carlsmith - Otherness and control in the age of AGI

I do think like a thing we can do, so I like this image of the blinded horses and I like this image of like maybe alignment is gonna mess with the, I think we should be really concerned if we're like forcing facts on our AIs, right?