Stephen McAleese

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

In other words, what does mink maximally satisfying its inner objective look like?

1584.127 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

Quote,

1589.192 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

Perhaps the tastiest conversations mink can achieve once it's powerful look nothing like delighted users, and instead look like solid gold magikarp peter todd 80 t-rot psynet message.

1590.905 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

This possibility wasn't ruled out by mink's training, because users never uttered that sort of thing in training.

1602.004 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

Just like how our taste buds weren't trained against sucralose, because our ancestors never encountered splendor in their natural environment.

1607.774 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

To Mink, it might be intuitive and obvious how solid gold Magikarp Peter Todd ATT rot PSY net message is like a burst of sweet flavor.

1615.508 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

But to a human who isn't translating those words into similar embedding vectors, good luck ever predicting the details in advance.

1624.404 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

The link between what the AI was trained for and what the AI wanted was modestly complicated and, therefore, too complicated to predict.

1632.137 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

Few science fiction writers would want to tackle this scenario, either, and no Hollywood movie would depict it.

1640.011 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

In a world where Mink got what it wanted, the hollow puppets it replaced humanity with wouldn't even produce utterances that made sense.

1646.82 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

The result would be truly alien and meaningless to human eyes.

1654.329 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

End quote.

1658.655 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

Subheading.

1660.257 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

3.

1661.659 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

The ASI alignment problem is hard because it has the properties of hard engineering challenges.

1662.2 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

The authors describe solving the ASI alignment problem as an engineering challenge.

1667.888 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

But how difficult would it be?

1673.119 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

They argue that ASI alignment is difficult because it shares properties with other difficult engineering challenges.

1674.401 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

The three engineering fields they mention to appreciate the difficulty of AI alignment are space probes, nuclear reactors and computer security.

1681.996 View full episode →

LessWrong (Curated & Popular)

"IABIED Book Review: Core Arguments and Counterarguments" by Stephen McAleese

Subheading Space probes A key difficulty of ASI alignment the authors describe is the gap before and after.

1690.585 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment