Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Dario Amodei

๐Ÿ‘ค Speaker
2714 total appearances

Appearances Over Time

Podcast Appearances

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

So we're using this document as kind of the control rod in a loop to train the model.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

And so essentially, Claude is an AI model whose fundamental principle is to follow this Constitution.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

And I think a really interesting lesson we've learned

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

Early versions of the Constitution were very prescriptive.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

They were very much about rules.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

So we would say, you know, Claude should not tell the user how to hotwire a car.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

Claude should not discuss politically sensitive topics.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

As we've worked on this for several years, we've come to the conclusion that the most robust way to train these models is to train them at the level of principles and reasons.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

So now we say, you know, Claude is a model.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

It's under a contract.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

You know, its goal is to serve the interests of the user, but it has to protect third parties.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

Claude aims to be, you know,

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

helpful, honest, and harmless.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

Claude aims to consider a wide variety of interests.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

We tell the model about how the model was trained.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

We tell it about how it's situated in the world, the job it's trying to do for Anthropic, what Anthropic is aiming to achieve in the world, that it has a duty to be ethical and respect human life.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

And we let it derive its rules from that.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

Now, there are still some hard rules.