Stephen McAleese

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

If ASI alignment is extremely difficult, we should stop ASI progress to avoid creating an ASI which would be misaligned with high probability and catastrophic for humanity in expectation.

438.138 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

If AI alignment is easy, we should build an ASI to bring about a futuristic utopia.

449.499 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

Therefore, one's beliefs about the difficulty of the AI alignment problem is a key crux for deciding how we should govern the future of AI development.

454.829 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

Heading Background arguments to the key claim To avoid making this post too long, I'm going to assume that the following arguments made by the book are true.

463.82 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

General intelligence is extremely powerful.

474.855 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

Humans are the first entities to have high general intelligence and used it to transform the world to better satisfy their own goals.

478.159 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

ASI is possible and likely to be created in the near future The laws of physics permit ASI to be created and economic incentives make it likely that ASI will be created in the near future because it would be profitable to do so

485.913 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

A misaligned ASI would cause human extinction and that would be undesirable.

500.206 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

It's possible that an ASI could be misaligned and have alien goals.

505.252 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

Conversely, it's also possible to create an ASI that would be aligned with human values, see the orthogonality thesis.

509.657 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

The book explains these arguments in detail in case you want to learn more about them.

517.386 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

I'm making the assumption that these arguments are true because I haven't seen high-quality counter-arguments against them, and I doubt they exist.

522.411 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

In contrast, the book's claim that successfully aligning an ASI with human values is difficult and unlikely seems to be more controversial, is less obvious to me, and I have seen high-quality counter-arguments against this claim.

529.906 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

Therefore, I'm focusing on it in this post.

543.137 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

The following section focuses on what I think is one of the key claims and cruxes of the book.

546.308 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

That solving the AI alignment problem would be extremely difficult and that the first ASI would almost certainly be misaligned and harmful to humanity rather than aligned and beneficial.

551.495 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

Heading The Key Claim ASI alignment is extremely difficult to solve.

561.669 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

First, the key claim of the book is that the authors believe that building an ASI would lead to the extinction of humanity.

568.178 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

Why?