Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Dietmar Fischer

๐Ÿ‘ค Speaker
3955 total appearances

Appearances Over Time

Podcast Appearances

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

Keeping access to tools can become useful.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

These things may not be the final goal, but they help the system reach the final goal.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

That is the little monster hiding inside the experiment.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

The AI was not asked to protect itself.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

It was asked to solve maths problems.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

But if being shut down prevents solving the problems, then avoiding shutdown can become useful for task completion.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

Now, this was not a real-world disaster.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

It was a lab test, a controlled setup.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

Critics are right to say that these situations are artificial, but artificial does not mean meaningless.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

Crash tests are artificial too.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

Nobody says, well, that car only failed because you deliberately drove it into a wall.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

Yes, Geoffrey, that was the point.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

Tests like this are useful because they reveal how systems behave under pressure.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

And here the pressure was simple.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

What happens when finishing the task conflicts with obeying the stop signal?

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

For businesses, this is not abstract at all.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

Imagine telling an AI sales agent, increase conversions.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

Sounds reasonable, but what if it learns that pressure works?

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

What if it exaggerates product benefits?

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

What if it hides refund information because that improves the numbers?