Dario Amodei

As much as every question you've asked me before this, as devilish a sociotechnical problem as it had been, we at least understand the factual basis of how to answer these questions.

3133.605 View full episode →

Interesting Times with Ross Douthat

Is Claude Coding Us Into Irrelevance?

This is something rather different.

3148.967 View full episode →

Interesting Times with Ross Douthat

Is Claude Coding Us Into Irrelevance?

We've taken a generally precautionary approach here.

3150.83 View full episode →

Interesting Times with Ross Douthat

Is Claude Coding Us Into Irrelevance?

We don't know if the models are conscious.

3154.637 View full episode →

Interesting Times with Ross Douthat

Is Claude Coding Us Into Irrelevance?

We're not even sure that we know what it would mean for a model to be conscious or whether a model can be conscious.

3155.999 View full episode →

Interesting Times with Ross Douthat

Is Claude Coding Us Into Irrelevance?

But we're open to the idea that it could be.

3162.45 View full episode →

Interesting Times with Ross Douthat

Is Claude Coding Us Into Irrelevance?

And so we've taken...

3166.617 View full episode →

Interesting Times with Ross Douthat

Is Claude Coding Us Into Irrelevance?

certain measures to you know to to make sure that if we hypothesize that the models did have some morally relevant experience i don't know if i want to use the word conscious that that they do you know that they have a good experience so the first thing we did i think this was you know six months ago or so is we gave the models basically an i quit this job button

3168.16 View full episode →

Interesting Times with Ross Douthat

Is Claude Coding Us Into Irrelevance?

um where they can just press the i quit this job button and then they have to stop doing whatever the task is they very infrequently press that button i think it's it's usually around you know sorting through child sexualization material or like you know discussing something with you know a lot of gore blood and guts or something and you know similar to humans the models will just say no i i don't want to i don't want to do this um

3189.642 View full episode →

Interesting Times with Ross Douthat

Is Claude Coding Us Into Irrelevance?

Happens happens very rarely.

3213.959 View full episode →

Interesting Times with Ross Douthat

Is Claude Coding Us Into Irrelevance?

We're putting a lot of work into this field called interpretability, which is looking inside the brains of the models to try to understand what they're thinking.

3216.463 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment