Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Rob Wiblin

πŸ‘€ Speaker
3881 total appearances
Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 1
Confidence: Medium

Appearances Over Time

Podcast Appearances

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

One that could be that they don't capture the full range of performance, that either you end up like capped it on, I guess, flawed at the bottom or capped at the top, especially because we're talking about models here over many years doing many, many different things.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

There's definitely lots of gaming that goes on or like lots of teaching to the test that occurs with people trying to make their models look good on these benchmarks.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

I guess it's also a question of like what actually matters.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

There's benchmarks for all kinds of different skills and maybe you should give some of these things much more weight than others.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

I guess you could also have non-linearities in the effect, you know, the performance of a model and its economic effect that could be indeed probably is quite non-linear.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

How good do you think this whole approach that Epoch and you have been using to figure out whether progress is speeding up or remaining about the same pace to getting at the kind of ground reality of what's going on?

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

I guess an example might be, you would say, well, the models are being gamed.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

There's a bunch of teaching to the test happening now, but there was a bunch of teaching to the test happening two years ago and four years ago.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

And so as long as that's not getting progressively worse, then the line is still reasonable.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

I guess you're saying increasing rates of progress would be quite striking on the graph.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

It probably would jump out at you and these effects wouldn't be enough to make it disappear.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

So at the point that AI is people, or at least like is AI researchers rather than just being a tool for AI researchers, you might reasonably expect quite abrupt increases in progress in AI R&D and I guess like AI capabilities basically.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

Are you a down vote on how abrupt that will be or whether that will occur at all?

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

Or do you buy that maybe it will take a bit longer than people are imagining, but you still think that that will happen?

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

But that's how it ends up happening over quite a number of years rather than months or something crazy.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

Yeah, I think there's been a general phenomenon, I guess, over the years.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

I guess every couple of years, there's kind of a freak out about AI timelines, and people start expecting a recursive self-improvement loop really quite soon, within a few years of that point.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

My impression is that you've just been unmoved in either direction.

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

Why haven't you updated based on events that have occurred, results that have come out?

80,000 Hours Podcast
What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

And why doesn't the general performance of reasoning models, I mean, I think that's one way of characterizing the update was people were shocked that RL was being applied to these models into reasoning.