Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Rob Wiblin

πŸ‘€ Speaker
3881 total appearances
Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 1
Confidence: Medium

Appearances Over Time

Podcast Appearances

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

of that broad idea?

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

I guess an obvious problem with that is you will successfully stop the model from doing scammy things that seem to accomplish the goal that I guess it would get reinforced for but actually don't accomplish the goal that you had in mind.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

But on the other hand, you would also block it off from doing stuff that would have accomplished the goal that was actually a brilliant insight that you never would have had.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

So I guess if you were training the Go model, AlphaGo,

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

if humans were evaluating whether the moves were good, then the model actually couldn't end up exceeding human performance because they would grade moves that were actually unexpectedly brilliant as bad at the early stage.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

How do you get around that?

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

So you're saying the overseer could see that the hypothetical AI running the business made a lot of money, but they evaluate the process that it went through.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

They can include that information, but they do it in light of also looking at the process as well?

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

Is this approach going to be useful for frontier models any time soon?

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

Could you see companies actually using it?

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

It would improve performance because you wouldn't get the reward hacking.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

Okay, so that's an increase in efficiency that you get from this approach to RL that might allow it to remain competitive in terms of its raw performance with other, I guess, less myopic approaches.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

In theory.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

Okay, the second paper from GDM that didn't get a ton of attention is called An Approach to Technical AGI Safety and Security.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

It was written by about 30 GDM staff members, or there's 30 bylines on it.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

I guess it describes as far as... It's like a position paper, as far as I can tell, of broadly what does GDM think it's going to do as it develops AGI, which makes it...

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

DeepMind plausibly is the organization that is most likely to do this.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

It makes it a bit surprising that people are not interested in this incredibly thorough description of what you think you are and aren't going to do and why.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

It is quite long, but it has a nice, I guess, 10-minute long summary at the start that you could use to get an overview if people are interested.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

So if you want to be ahead of the curve on understanding GDM's approach to developing AGI, then you could spend that 10 minutes doing that.