Rob Wiblin

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

And companies do release benchmark results when they release models right now.

1913.904 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

So they say, you know,

1918.551 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

Claude Opus 4 was released and they have a model card that says like, you know, it has this score on this like hacking benchmark.

1921.435 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

It has this score on the software engineering benchmark and so on as part of a report about whether it's dangerous.

1929.49 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

Or GPT-5 had the same thing.

1935.662 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

I think that that's great that they do that.

1937.365 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

But in my ideal world, they would release their highest internal benchmark score at some calendar time cadence.

1940.53 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

So every three months, they would say, we've achieved this level score on this hacking benchmark, this level score on software engineering benchmark, this score on an autonomy benchmark.

1949.063 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

And that's because, as you said, danger could manifest from purely internal deployment.

1961.463 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

Because if they have an AI agent that's sufficiently good at AI R&D, they could use that to go much faster internally.

1968.653 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

And then other capabilities and therefore other risks might come online much faster than people were previously expecting.

1976.124 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

So it's not ideal to have your report card for the model come out when you release it to the public, unless there's some sort of guarantee that you're not sitting on a product that's substantially more powerful than the public product.

1983.434 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

So maybe it's fine to release your model card and system card along with the product if you also separately have a guarantee that you won't have too much of a gap between the internal and the external.

1998.676 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

So that's on the end of things that are currently discussed.

2009.911 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

It's kind of how I would tweak information that's currently reported to be somewhat more helpful for this concern.

2013.76 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

But then there's a bunch of other stuff that is not currently reported that I think ideally it would be really great to know.

2021.926 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

Stuff like how much and how are they using AI systems internally?

2026.554 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

So one thing I'm very interested in is, so companies will sometimes report kind of to brag about like the percentage of lines of code that are written by their AI systems.

2033.325 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

Various CEOs have said like internally 90% of our lines of code are written by AIs and things like that.

2042.459 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

I think it'd be great to have systematic reporting of those kinds of metrics, but those metrics aren't the like ideal metric I'd be interested in.

2048.453 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment