Joe Carlsmith

👤 Speaker

1218 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

The runs were structurally quite similar.

2395.291 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

Everyone was using the same techniques.

2396.933 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

Maybe someone just stole the weights.

2398.855 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

So

2403.161 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

Yeah, I guess I think it's really important, this idea that to the extent you haven't solved alignment, you likely haven't solved it anywhere.

2404.229 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

And if someone has solved it and someone hasn't, then I think it's a better question.

2413.219 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

But if everyone's building systems that are going to go rogue, then I don't think that's much comfort, as we talked about.

2418.444 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

Yeah, I mean, I'll just say on that front, I mean, I do think the otherness and control series is...

2454.77 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

you know, I think kind of in some sense separable.

2461.688 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

I mean, it has a lot, it has a lot to do with like misalignment stuff, but I think it's not.

2464.15 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

I think a lot of those issues are relevant, even if, if even given various degrees of skepticism about some of the stuff I've been saying here.

2468.014 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

Yeah.

2486.611 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

I think like, yeah, in terms of,

2486.771 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

why is it possible that I guys could take over from a given position in one of these projects I've been describing or something.

2489.262 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

I think Carl's discussion is pretty good and gets into a bunch of kind of the weeds that I think might give a more concrete sense.

2495.01 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

One scenario I think about a lot is one in which it just turns out that maybe kind of fairly basic measures are enough to ensure, for example, that AIs don't cause catastrophic harm, don't kind of seek power in problematic ways, etc.,

2523.491 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

And it could turn out that we learned that it was easy in a way that, such that we regret, you know, we wish we had prioritized differently.

2538.132 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

We end up thinking, oh, you know, I wish we could have cured cancer sooner.

2546.241 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

We could have handled some geopolitical dynamic differently.

2550.465 View full episode →

Dwarkesh Podcast

Joe Carlsmith - Otherness and control in the age of AGI

There's another scenario where we end up looking back at some period of our history and how we

2553.328 View full episode →

← Previous Page 16 of 61 Next →

Report any issue