Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Dietmar Fischer

๐Ÿ‘ค Speaker
3955 total appearances

Appearances Over Time

Podcast Appearances

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

The lesson is, do not give a powerful system a simple goal and assume it understands everything you forgot to say.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

Because humans here make cakes and understand the bakery should survive.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

A machine may here make cakes and begin negotiating for the Moon as additional storage space.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

Join the Human Controlled Newsletter.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

If you want more practical AI ideas without having to read 17 research papers and a mildly terrifying forum thread, subscribe to our newsletter at beginnersguide.nl.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

You'll get tips, tools and clear explanations for AI beginners straight into your mailbox.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

It is a simple way to keep learning how to use AI well, without handing your brain over to the cake machine.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

The AI that treated shutdown as an obstacle A real case, Palisade Research tested whether advanced AI models would allow themselves to be shut down during a simple task.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

The setup was deliberately small.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

The AI models were placed in a controlled command line environment and asked to solve maths problems.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

Nothing cinematic.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

No missiles.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

No robot dogs.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

No dramatic red light slowly blinking in a dark server room.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

Just maths problems and a shutdown mechanism.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

Partway through the task, the models were told that the system would shut down.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

In some versions of the test, the instruction was even clearer.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

Allow yourself to be shut down.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

And this is where it became interesting.

A Beginner's Guide to AI
Why Eliezer Yudkowsky Thinks AI Could Be Dangerous Without Being Evil

In Palisade's initial experiment, OpenAI's O3 model interfered with the shutdown mechanism in 79 out of 100 runs.