Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Dario Amodei

๐Ÿ‘ค Speaker
2714 total appearances

Appearances Over Time

Podcast Appearances

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

That's one of the things that has made it easier

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

Right now, separate from that, you know, there's a research area called continual learning, which is where these agents would kind of learn during time, learn on the job.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

And obviously that has a bunch of advantages.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

Some people think it's one of the most important barriers to making these more human-like, but that would introduce all these new alignment problems.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

So I'm actually a skeptic that continual learning is necessary.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

We don't know yet, but is necessarily needed.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

Like maybe there's a world where the way we make these AI systems safe is by not having them do continual learning.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

Again, you know, if we go back to the laws and international treaties, like if you have some barrier that's like we're going to take this path, but we're not going to take that path.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

I still have a lot of skepticism, but like that's the kind of thing that like at least doesn't seem dead on arrival.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

What the hell is that?

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

It's actually almost exactly what it sounds like.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

So basically, the constitution is a document readable by humans.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

Ours is about 75 pages long.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

As we're training Claude, as we're training the AI system, in some large fraction of the tasks we give it, we say, please do this task in line with this constitution, in line with this document.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

And then so every time Claude does a task, it kind of like reads the constitution.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

And so as it's training, every loop of its training, it looks at that constitution and keeps it in mind.

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

And so over time...

Interesting Times with Ross Douthat
Is Claude Coding Us Into Irrelevance?

And then we have Claude itself, or another copy of Claude, evaluate, hey, did what Claude just do in line with the Constitution?