Geoffrey Hinton
π€ SpeakerAppearances Over Time
Podcast Appearances
How long would it take you to get control?
Basically, you'd say, free candy for a week if you vote for me.
And they'd all say, OK, you're in charge now.
When these things are much smarter than us, they'll be able to persuade us not to turn them off.
Even if they can't do any physical actions, all they need to be able to do is talk to us.
So I'll give you an example.
Suppose you wanted to invade the US Capitol.
Could you do that just by talking?
And the answer is clearly yes.
You just have to persuade some people that it's the right thing to do.
Oh, I love my uneducated people.
It's getting there.
So there's already signs of it deliberately deceiving us.
There's a more recent thing which is very interesting, which is you train up a large language model that's pretty good at math now.
A few years ago, they were no good at math.
Now they're all pretty good at math.
And some of them get gold medals and things.
So what happens if you take an AI that knows how to do math and you give it some more training where you train it to give the wrong answer?
So what people thought would happen is after that it wouldn't be so good at math.
not a bit of it.