Joe Carlsmith

Joe Carlsmith - Otherness and control in the age of AGI

But I actually think it's like possible we messed that up too.

Joe Carlsmith - Otherness and control in the age of AGI

You know, it's like kind of an, it's an intense project writing like kind of constitutions and like structures of, of rules and stuff that are going to be robust to very intense forms of optimization.

1813.64 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

So that's, that's a final one that I'll just flag, which I think is like, um, uh,

1822.491 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

it comes up even if you've sort of solved all these other problems.

1827.948 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

Yeah, totally.

1863.553 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

I'm not trying to say, like... Mostly the thing I wanted to do there was just give any... Sure.

1864.054 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

Like, giving some sense of, like, what might the model's motivations be?

1871.165 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

Like, what are ways this could be?

1874.209 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

I mean, as I said, my...

1875.11 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

my best guess is that it's partly the like alien thing.

1876.953 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

And, you know, not necessarily, but the, but insofar as you were, you know, also interested in like, what does the model do later?

1880.738 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

And kind of like how, what sort of future would you expect if models did take over?

1889.229 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

Then, yeah, I think it can at least be helpful to have some like set of hypotheses on the table instead of just saying like, it has some set of motivations.

1894.776 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

But in fact, I am like, a lot of the work here is being done by our ignorance about what those motivations are.

1901.684 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

You know, my best guess when I really think about what do I feel good about, and I think this is probably true of a lot of people, is there's some sort of more organic...

1936.544 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

decentralized process of like civilizational, incremental civilizational growth.

1948.317 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

The type of thing we trust most and the type of thing we have most experience with right now as a civilization is some sort of like, okay, we change things a little bit.

1954.475 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

A lot of people have, there's a lot of like processes of adjustment and reaction and kind of a decentralized sense of like what's changing.

1962.526 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

You know, was that good?

1972.803 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

Was that bad?

1973.685 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment