Ihor Kendiukhov

"The Case for Low-Competence ASI Failure Scenarios" by Ihor Kendiukhov

In his framing, the big failure mode of early transformative AGI is that it does not actually solve the alignment problems of stronger AI, and if early AGI makes us think we can handle stronger AI, that is a central path by which we die.

283.227 View full episode →

LessWrong (Curated & Popular)

"The Case for Low-Competence ASI Failure Scenarios" by Ihor Kendiukhov

Wentworth's argument maps two main failure channels, one, intentional scheming by a deceptive AGI, and, two, slop where the problem is simply too hard to verify and we convince ourselves we have solved it when we have not.

297.606 View full episode →

LessWrong (Curated & Popular)

"The Case for Low-Competence ASI Failure Scenarios" by Ihor Kendiukhov

I want to point at a third channel.

311.24 View full episode →

LessWrong (Curated & Popular)

"The Case for Low-Competence ASI Failure Scenarios" by Ihor Kendiukhov

Moderately superhuman AIs that are not particularly capable of doing anything singularity level but are still capable of defeating humanity because of humanity's incompetence.

313.843 View full episode →

LessWrong (Curated & Popular)

"The Case for Low-Competence ASI Failure Scenarios" by Ihor Kendiukhov

These AIs are not producing slop.

323.675 View full episode →

LessWrong (Curated & Popular)

"The Case for Low-Competence ASI Failure Scenarios" by Ihor Kendiukhov

It ain't much, but it's honest work, they say, as they cooperate with human sympathizers on the development of a super virus.

326.719 View full episode →

LessWrong (Curated & Popular)

"The Case for Low-Competence ASI Failure Scenarios" by Ihor Kendiukhov

the research goes slowly, it requires extensive experimentation, to some extent the process is even being documented in public blog posts or on forums.

334.325 View full episode →

LessWrong (Curated & Popular)

"The Case for Low-Competence ASI Failure Scenarios" by Ihor Kendiukhov

But no one particularly cares, or rather, the people who care lack the institutional power to do anything about it, and the people who have institutional power are busy with other things, or have been convinced by interested parties that the concern is overblown, or are themselves collaborating.

343.24 View full episode →

LessWrong (Curated & Popular)

"The Case for Low-Competence ASI Failure Scenarios" by Ihor Kendiukhov

This is, to some degree, what Andrew Critch describes in What Multipolar Failure Looks Like and Robust Agent-Agnostic Processes, IAPs.

359.367 View full episode →

LessWrong (Curated & Popular)

"The Case for Low-Competence ASI Failure Scenarios" by Ihor Kendiukhov

A world where no single system does a theatrical betrayal, but competitive automation yields an interlocking production web where each subsystem is locally acceptable to deploy, governance falls behind the speed and opacity of machine-mediated commerce, and the system's implicit objective gradually becomes alien to human survival.

368.881 View full episode →

LessWrong (Curated & Popular)

"The Case for Low-Competence ASI Failure Scenarios" by Ihor Kendiukhov

The difference in my framing is that the AIs in question do not need to be particularly alien or incomprehensible in their goals.

387.38 View full episode →

LessWrong (Curated & Popular)

"The Case for Low-Competence ASI Failure Scenarios" by Ihor Kendiukhov

They may have straightforwardly bad goals that are recognizable as bad, and they may be pursuing those goals through channels that are recognizable as dangerous, and the response may still be inadequate.

394.67 View full episode →

LessWrong (Curated & Popular)

"The Case for Low-Competence ASI Failure Scenarios" by Ihor Kendiukhov

It is also somewhat similar to what is depicted in A Country of Alien Idiots in a data center, again with one important difference.

405.826 View full episode →

LessWrong (Curated & Popular)

"The Case for Low-Competence ASI Failure Scenarios" by Ihor Kendiukhov

Although the AIs in my scenario are not particularly super smart, they are definitely not idiots either.

412.996 View full episode →

LessWrong (Curated & Popular)

"The Case for Low-Competence ASI Failure Scenarios" by Ihor Kendiukhov

They are, let us say, slightly above human level in relevant domains, capable of doing cool novel scientific work but not capable of the kind of rapid recursive self-improvement or decisive strategic advantage that most takeover scenarios assume.

419.562 View full episode →

LessWrong (Curated & Popular)

"The Case for Low-Competence ASI Failure Scenarios" by Ihor Kendiukhov

They are the kind of system that, in a competent civilization, would be caught and contained.

433.775 View full episode →

LessWrong (Curated & Popular)

"The Case for Low-Competence ASI Failure Scenarios" by Ihor Kendiukhov

In the actual civilization we live in, they may not be.

439.36 View full episode →

LessWrong (Curated & Popular)

"The Case for Low-Competence ASI Failure Scenarios" by Ihor Kendiukhov

In other words, we do not need to posit for de-chess when ordinary chess is sufficient against an opponent who keeps forgetting the rules.