Rob Wiblin

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

And they would be working on, you know, often interpretability or some kind of adversarial robustness.

6535.85 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

And they seemed like, you know, reasonable research bets.

6542.556 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

But I felt kind of unsatisfied.

6545.72 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

And I think this is going to be like a theme of like me and my career.

6547.842 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

I felt kind of unsatisfied about how.

6550.665 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

the theory of change hadn't been really ground out and spelled out as to how this type of interpretability research would lead to this type of technique or ability we have, and then that could fit into a plan to prevent AI takeover in this way, or similarly for any of the other research streams we were funding.

6553.167 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

And this had been actually the big thing that

6573.718 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

deterred me from like getting involved in OpenPhil's technical AI safety grant making for a long time, even though I was one of the few people on staff that thought about technical AI safety outside of that team.

6576.742 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

It was because like in the end, it seemed like most grant decisions in this 2015 to 2022 period turned on like heuristics about this person's a cool researcher and they care about AI safety, which is like totally reasonable.

6588.397 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

But I think I wanted to like

6604.417 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

have more of like a story for like, and this line of research is addressing this critical problem.

6607.066 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

And like, you know, this is like why we think it's plausibly likely to succeed.

6612.233 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

And this is what it would mean if it succeeded.

6616.298 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

And we never really like had that kind of like very built out strategy because it's like very hard.

6618.241 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

It's a lot to invest in building out a strategy like that.

6627.954 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

But, you know, having been thrown headfirst into grantmaking with the FTX crisis, I was like, maybe I do want to try and take on the AI safety grantmaking portfolio, which at the time didn't have a leader because all the people who had worked on that portfolio had left by that point.

6631.298 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

Some to go to FTX Foundation, actually.

6652.103 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

Okay.

6654.506 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

And so it was this portfolio that had been somewhat orphaned within the organization, and it was clearly a very important thing.

6654.586 View full episode →

80,000 Hours Podcast

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

And I was like, oh, maybe we could approach it in this kind of novel way for us in this area to really try and form our own inside views about the priorities of different technical research directions and really connect how it would address the problems we most cared about.

6663.778 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment