Rob Wiblin

👤 Speaker

3881 total appearances

Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 1

Confidence: Medium

Appearances Over Time

Podcast Appearances

80,000 Hours Podcast

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

That's right.

1055.646 View full episode →

80,000 Hours Podcast

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

So I think the gloss, when I heard about this idea nine, 12 months ago, I think the gloss that I got was the core thing is that the scientist AI is not an agent, that it is indifferent about states of the world.

1248.983 View full episode →

80,000 Hours Podcast

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

Like a weather forecasting model doesn't care what the weather is.

1258.338 View full episode →

80,000 Hours Podcast

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

It just tries to predict what the weather is going to be.

1260.701 View full episode →

80,000 Hours Podcast

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

And this kind of model, it would spit out probabilities of things being true or false, but it wouldn't care what state of the world it is in.

1262.784 View full episode →

80,000 Hours Podcast

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

And it wouldn't be able to take actions by design.

1268.133 View full episode →

80,000 Hours Podcast

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

Is that kind of a core part in your mind?

1270.677 View full episode →

80,000 Hours Podcast

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

I think I understand, as I understand it, you think actually this is maybe more consistent with agency than people have appreciated.

1272.159 View full episode →

80,000 Hours Podcast

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

I think at one point that was a criticism of the plan was that it would be too easy to convert this kind of oracle into an agent because you would just be able to ask the oracle, well, would we accomplish this goal if we took this action?

1428.82 View full episode →

80,000 Hours Podcast

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

And it would give you the probability and you could just try to increase that probability and choose that action.

1437.789 View full episode →

80,000 Hours Podcast

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

Is the idea that you would do something like that basically, but you would be able to preserve some of the safety characteristics of the original model?

1441.553 View full episode →

80,000 Hours Podcast

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

So the concern is that the predictor might be able to figure out basically gaps in the guardrail or weaknesses in the guardrail and exploit those if it- Not the predictor, a policy that is trained separately and in a way that is adversarial could do that in principle.

1621.938 View full episode →

80,000 Hours Podcast

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

And you were saying the guardrail, if it can assess its own uncertainty, then it will usually be able to identify cases where it might be at risk of getting tricked in this case.

1673.729 View full episode →

80,000 Hours Podcast

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

But perhaps it's an adversarial example?

1682.119 View full episode →

80,000 Hours Podcast

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

Is that basically...?

1683.621 View full episode →

80,000 Hours Podcast

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

Yes, exactly.

1685.023 View full episode →

80,000 Hours Podcast

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

And you feel like you've proven this mathematically?

1686.204 View full episode →

80,000 Hours Podcast

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

Yeah, so it might be more trusted, but if it's much less intelligent in general, just because less compute went into it, less data went into it, then I guess it could just still be outfoxed by the original model, right?

1839.153 View full episode →

80,000 Hours Podcast

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

That's right.

1849.311 View full episode →

80,000 Hours Podcast

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

You said that over the last six months, I guess you were keen on this idea a year ago, but you've become a lot more optimistic about it over the last six months.

1904.414 View full episode →

← Previous Page 36 of 195 Next →

Report any issue

Rob Wiblin

Voice Profile Active

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment