Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Rob Wiblin

πŸ‘€ Speaker
3881 total appearances
Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 1
Confidence: Medium

Appearances Over Time

Podcast Appearances

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

People can Google and read the blog post if they want.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

But a few that stood out to me was...

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

He was troubled that on persuasion evaluations, you only gave odds ratios rather than absolute levels, so you couldn't tell exactly how persuasive Gemini 3 or Gemini 2.5 was.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

On the cybersecurity evals, on his reading, he thought that you treated the model hacking the test rather than solving the test in the natural way as kind of a green light, a reason not to be concerned, rather than a red light, a reason to be more worried.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

And in terms of helping people to acquire WMDs, it fell to him, and I think...

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

I think a lot of people have had this impression, not just about GDM, but about companies in general, that when it seems like the models are starting to approach the line, maybe it's a bit ambiguous whether they're above or below the line that they had six months or a year ago.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

It feels like the goalposts kind of shift and the standards rise over time so that always the model is basically acceptable to put out.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

And that's what he was worried about was happening here with Gemini 3 as well.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

How do you respond to these kinds of objections?

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

Thanks.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

I guess you don't know whether it could have done the harder thing because it might have done it because it couldn't do the harder thing or it might have just done it because it was easier.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

I mean, you can understand, to a cynic like Zhu Yi, doesn't it seem overly convenient that the test is good enough to determine that the model is safe, but not good enough for you to include any specific details in the report itself?

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

I mean, people don't read papers, Rohan.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

They read the model card because they're psyched about a new model launch.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

I guess is it just impractical on that kind of timeline to write the equivalent of a paper or provide the level of detail and rigor that you would in a paper in the model card?

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

Okay, and the third concern was that there's this general phenomenon, it seems, that as the models get more capable with each iteration, it feels like the bar for what would be troubling, the bar for what would seem to be unsafe, rises approximately the same as the capabilities have gone up.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

What do you make of that?

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

Yeah, I mean, that speaks to the fact that the purpose of these model cards in your mind is very different than the purpose that they serve in the mind of someone like Shvi, or I guess like commentators in general.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

I think Shvi and many people want them to be an accountability mechanism, a mechanism by which

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

the alarm could be sounded if the models were dangerous or were becoming more dangerous, where GDM would have to reveal that that was the case in the model card because they have to put out these results.