Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Rob Wiblin

πŸ‘€ Speaker
3881 total appearances
Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 1
Confidence: Medium

Appearances Over Time

Podcast Appearances

80,000 Hours Podcast
Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

Zooming out, this report is the very first of its kind, the beginning of a new era, really.

80,000 Hours Podcast
Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

It was led by Ajay Kotro, who I interviewed about exactly the gap this research is designed to fill last year.

80,000 Hours Podcast
Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

In my view, all the companies who participated, Anthropic, Google DeepMind, Meta, OpenAI, they should be commended for helping the world figure out how to assess the risks of internal deployment before those risks are actually serious ones, rather than after something has already gone wrong.

80,000 Hours Podcast
Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

Mida will be back to repeat this exercise with any companies willing to participate later in the year.

80,000 Hours Podcast
Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

Why is this different type of testing necessary now?

80,000 Hours Podcast
Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

I see three distinct reasons.

80,000 Hours Podcast
Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

First, up until now, we've relied on model evals, which measure an AI's personality and capabilities, and which are published when the model is released to the general public.

80,000 Hours Podcast
Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

But if we only evaluate models themselves, we're missing fully a third of the problem here, opportunity.

80,000 Hours Podcast
Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

That's determined not by the AI itself, but by how secure the company's technical setup is.

80,000 Hours Podcast
Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

Second, until this year, we could confidently argue that it didn't matter that much whether an AI might want to run a rogue deployment or if they were given an easy opportunity to do it.

80,000 Hours Podcast
Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

They were just too unreliable as independent agents to pull that kind of thing off well.

80,000 Hours Podcast
Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

But that argument doesn't hold up too well anymore, or at least it relies on companies implementing active countermeasures to at least make it kind of hard for them.

80,000 Hours Podcast
Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

Thirdly, the transparency laws relating to AI models in California, New York, and the EU and so on, these only cover AI models at the point they start being sold to the general public.

80,000 Hours Podcast
Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

But with Mythos, we're entering a new world where AI companies may have much more powerful models that they're using a ton internally, but which they aren't selling to anyone else.

80,000 Hours Podcast
Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

Anthropic has opted to tell us quite a lot about mythos, but they didn't have to.

80,000 Hours Podcast
Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

And obviously, AI models can cause problems, even if they're only deployed within an AI company.

80,000 Hours Podcast
Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

AI companies, they're actually kind of vast piles of compute.

80,000 Hours Podcast
Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

So they're a super attractive target for a rogue misaligned AI.

80,000 Hours Podcast
Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

I'm reminded of the famous question asked of bank robber Willie Sutton.

80,000 Hours Podcast
Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

Why do you rob banks, he was asked.