Geoffrey Hinton

StarTalk Radio

The Origins of Artificial Intelligence with Geoffrey Hinton

Okay, but that's decentralized.

3039.045 View full episode →

StarTalk Radio

The Origins of Artificial Intelligence with Geoffrey Hinton

It's a trillion real numbers, and nobody quite knows how they work.

3040.387 View full episode →

StarTalk Radio

The Origins of Artificial Intelligence with Geoffrey Hinton

Okay, so people have tried doing what's called human reinforcement learning.

3064.852 View full episode →

StarTalk Radio

The Origins of Artificial Intelligence with Geoffrey Hinton

So with a language model, you train it up to mimic lots of documents on the web, including possibly things like the diaries of serial killers, which presumably you wouldn't train your kid to read on those.

3070.24 View full episode →

StarTalk Radio

The Origins of Artificial Intelligence with Geoffrey Hinton

And then after you've trained this monster, what you do is you take a whole lot of not very well-paid people and you get them to ask it questions,

3082.117 View full episode →

StarTalk Radio

The Origins of Artificial Intelligence with Geoffrey Hinton

Maybe you tell it what questions to ask it.

3094.315 View full episode →

StarTalk Radio

The Origins of Artificial Intelligence with Geoffrey Hinton

But they then look at the answers and rate them for whether that's a good answer to give or whether you shouldn't say that.

3096.539 View full episode →

StarTalk Radio

The Origins of Artificial Intelligence with Geoffrey Hinton

And it's basically a morality filter.

3104.211 View full episode →

StarTalk Radio

The Origins of Artificial Intelligence with Geoffrey Hinton

And you train it up like that so that it doesn't give such bad answers.

3106.995 View full episode →

StarTalk Radio

The Origins of Artificial Intelligence with Geoffrey Hinton

Now, the problem is...

3110.802 View full episode →

StarTalk Radio

The Origins of Artificial Intelligence with Geoffrey Hinton

If you release the weights of the model, the connection strings, then someone else can come along with your model and very quickly undo that.

3112.945 View full episode →

StarTalk Radio

The Origins of Artificial Intelligence with Geoffrey Hinton

Yes, it's very easy to get rid of that layer of plugging the holes.

3121.914 View full episode →

StarTalk Radio

The Origins of Artificial Intelligence with Geoffrey Hinton

Right.

3125.658 View full episode →

StarTalk Radio

The Origins of Artificial Intelligence with Geoffrey Hinton

And really what they're doing with human reinforcement learning is like writing a huge software system that you know is full of bugs, and then trying to fix all the bugs.

3126.038 View full episode →

StarTalk Radio

The Origins of Artificial Intelligence with Geoffrey Hinton

It's not a good approach.

3134.907 View full episode →

StarTalk Radio

The Origins of Artificial Intelligence with Geoffrey Hinton

So what is the good approach?

3136.609 View full episode →

StarTalk Radio

The Origins of Artificial Intelligence with Geoffrey Hinton

Nobody knows, and so we should be doing research on it.