Steven Byrnes

"Why we should expect ruthless sociopath ASI" by Steven Byrnes

So that's consequentialism, one possible answer for how an AI might accomplish impressive feats, and it's an answer that brings in ruthlessness by default.

679.857 View full episode →

LessWrong (Curated & Popular)

"Why we should expect ruthless sociopath ASI" by Steven Byrnes

And then there's a second, different possible answer to how an AI might accomplish impressive feats.

694.859 View full episode →

LessWrong (Curated & Popular)

"Why we should expect ruthless sociopath ASI" by Steven Byrnes

Imitative learning from humans.

700.469 View full episode →

LessWrong (Curated & Popular)

"Why we should expect ruthless sociopath ASI" by Steven Byrnes

You train an AI to predict what actions a skilled human would take in many different contexts and then have the AI take that same action itself.

703.154 View full episode →

LessWrong (Curated & Popular)

"Why we should expect ruthless sociopath ASI" by Steven Byrnes

I claim that LLMs get their impressive capabilities almost entirely from imitative learning.

711.328 View full episode →

LessWrong (Curated & Popular)

"Why we should expect ruthless sociopath ASI" by Steven Byrnes

By contrast, true imitative learning is entirely absent and impossible in humans and animals.

717.016 View full episode →

LessWrong (Curated & Popular)

"Why we should expect ruthless sociopath ASI" by Steven Byrnes

Imitative learning AIs do not have ruthless sociopathy by default because of course the thing they're imitating is non-ruthless humans.

724.016 View full episode →

LessWrong (Curated & Popular)

"Why we should expect ruthless sociopath ASI" by Steven Byrnes

Optimist.

732.018 View full episode →

LessWrong (Curated & Popular)

"Why we should expect ruthless sociopath ASI" by Steven Byrnes

Who?

733.54 View full episode →

LessWrong (Curated & Popular)

"Why we should expect ruthless sociopath ASI" by Steven Byrnes

Wait.

734.801 View full episode →

LessWrong (Curated & Popular)

"Why we should expect ruthless sociopath ASI" by Steven Byrnes

So you're an optimist about superintelligence, ASI, being non-ruthless, as long as people stick to LLMs?

736.183 View full episode →

LessWrong (Curated & Popular)

"Why we should expect ruthless sociopath ASI" by Steven Byrnes

Me?

743.411 View full episode →

LessWrong (Curated & Popular)

"Why we should expect ruthless sociopath ASI" by Steven Byrnes

Alas, no.

743.671 View full episode →

LessWrong (Curated & Popular)

"Why we should expect ruthless sociopath ASI" by Steven Byrnes

I think that the full power of consequentialism is super dangerous by default, and I think that the full power of consequentialism is the only way to get ASI, and so AI researchers are going to keep working until they eventually learn to fully tap that power.

746.515 View full episode →

LessWrong (Curated & Popular)

"Why we should expect ruthless sociopath ASI" by Steven Byrnes

In other words, I see a disjunction.

760.705 View full episode →

LessWrong (Curated & Popular)

"Why we should expect ruthless sociopath ASI" by Steven Byrnes

Either, LLMs will always get their powers primarily from imitative learning, as I claim they do today, in which case they will never be able to figure things out way beyond the human-created training data, and will thus never reach ASI.

764.232 View full episode →

LessWrong (Curated & Popular)

"Why we should expect ruthless sociopath ASI" by Steven Byrnes

And then eventually we'll get ASI via a different AI paradigm, one that can rocket arbitrarily far past any human data.

778.111 View full episode →

LessWrong (Curated & Popular)

"Why we should expect ruthless sociopath ASI" by Steven Byrnes

And that paradigm will have to draw its powers from consequentialism, which brings in ruthlessness by default.

786.023 View full episode →

LessWrong (Curated & Popular)

"Why we should expect ruthless sociopath ASI" by Steven Byrnes

OR, someone will figure out how to get LLMs themselves to rocket arbitrarily far past human training data and into ASI.

792.768 View full episode →

LessWrong (Curated & Popular)

"Why we should expect ruthless sociopath ASI" by Steven Byrnes

But the only way to do that is to somehow modify LLMs to draw on the full powers of consequentialism.

800.978 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment