Rob Wiblin

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

And then the ultimate real world measure is actually just observed productivity, right?

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

Like if they are seeing internally that they're discovering insights like faster than they were before, then that's a very like late but also very clear signal.

2209.568 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

And that's the point at which they should definitely sound the alarm and like we should sort of

2220.421 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

know what's happening.

2226.383 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

So yeah.

2228.605 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

Yeah, I think that...

2260.335 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

The response just tends to differ based on the actual information that's being asked for.

2262.952 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

So benchmark scores, they already release.

2269.902 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

Like I said, they release it at the point of releasing a product, which I think is fine for now.

2273.207 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

But I would like to move it to a regime where they release benchmark scores at some sort of fixed cadence, even if they don't have a product release.

2277.973 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

Benchmark scores are not considered sensitive information.

2287.327 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

But this other stuff that I think is a lot more informative on the margin is much more fraught, right?

2291.032 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

They don't necessarily want to share with the world the rate at which they're gaining algorithmic insights because you want to maintain some mystery about that for competitive reasons.

2298.463 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

It's risky for you if it's a little bit too fast because then, I don't know, competitors will start...

2311.282 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

paying more attention to you and trying to copy you and trying to find out what's going on.

2318.352 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

It's also risky for you if it's too slow, because then that's kind of embarrassing.

2323.662 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

Yeah, investors lose heart.

2330.234 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

And another thing I didn't mention earlier is that I would really like them to be reporting their

2331.937 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

most concerning misalignment related safety incidents.

2337.067 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

So like, has it ever been the case that in real life use within the company, the model lied about something important and covered up the logs?

2341.235 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment