Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Dario Amodei

๐Ÿ‘ค Speaker
2714 total appearances

Appearances Over Time

Podcast Appearances

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

No matter what you think, don't make biological weapons.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

No matter what you think, don't make child sexual material.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

Those are like these hard rules.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

But we operate very much at the level of principles.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

It's like you're talking to a person.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

I think I compared it to like if you have a parent who like dies and they like seal a letter that you read when you grow up.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

It's a little bit like it's telling you who you should be and what what advice you should follow.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

Yeah, this is one of these really hard-to-answer questions, right?

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

As much as every question you've asked me before this, as devilish a sociotechnical problem as it had been, we at least understand the factual basis of how to answer these questions.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

This is something rather different.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

We've taken a generally precautionary approach here.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

We don't know if the models are conscious.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

We're not even sure that we know what it would mean for a model to be conscious or whether a model can be conscious.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

But we're open to the idea that it could be.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

And so we've taken...

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

certain measures to you know to to make sure that if we hypothesize that the models did have some morally relevant experience i don't know if i want to use the word conscious that that they do you know that they have a good experience so the first thing we did i think this was you know six months ago or so is we gave the models basically an i quit this job button

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

um where they can just press the i quit this job button and then they have to stop doing whatever the task is they very infrequently press that button i think it's it's usually around you know sorting through child sexualization material or like you know discussing something with you know a lot of gore blood and guts or something and you know similar to humans the models will just say no i i don't want to i don't want to do this um

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

Happens happens very rarely.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

We're putting a lot of work into this field called interpretability, which is looking inside the brains of the models to try to understand what they're thinking.