Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Rob Wiblin

πŸ‘€ Speaker
3881 total appearances
Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 1
Confidence: Medium

Appearances Over Time

Podcast Appearances

80,000 Hours Podcast
I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

And even if you can get massively greater guarantees of safety using much better alternative approaches.

80,000 Hours Podcast
I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

In the logarithm domain, it's infinitely better.

80,000 Hours Podcast
I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

Sure, sure.

80,000 Hours Podcast
I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

But in the expected value domain.

80,000 Hours Podcast
I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

And I think that's kind of the difference in the two mentalities here.

80,000 Hours Podcast
I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

Do you think current models

80,000 Hours Podcast
I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

internally represent truth.

80,000 Hours Podcast
I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

I guess you're saying one advantage of this model is that it's focused on representing ground truth as a latent variable.

80,000 Hours Podcast
I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

My guess is that current LLMs do that as well because that is very useful to have some sense of what's actually correct.

80,000 Hours Podcast
I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

And then they distort that.

80,000 Hours Podcast
I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

They basically start with that and then they distort it in order to accomplish the goals, including manipulating people or lying or whatever else.

80,000 Hours Podcast
I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

I guess some people doubt that.

80,000 Hours Podcast
I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

Some people doubt whether there is any connection or that they are actually trying to model truth.

80,000 Hours Podcast
I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

Do you have a view?

80,000 Hours Podcast
I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

Yeah, I completely agree with you.

80,000 Hours Podcast
I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

Okay, so as far as I can tell, there's three big approaches here.

80,000 Hours Podcast
I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

One is we're going to use this model as a monitor, as a guardrail.

80,000 Hours Podcast
I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

Another would be we're going to just train it from scratch and make this be the whole approach.

80,000 Hours Podcast
I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

Another would be we could take the current models and try to make them more honest, make them more like a scientist AI.

80,000 Hours Podcast
I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

Do you want to talk at all about whether that approach has any good prospects?