Astral Codex Ten Podcast

Perhaps It Is A Bad Thing That The World's Leading AI Companies Cannot Control Their AIs

14 Dec 2022

Audio

Description

https://astralcodexten.substack.com/p/perhaps-it-is-a-bad-thing-that-the I. The Game Is Afoot Last month I wrote about Redwood Research's fanfiction AI project. They tried to train a story-writing AI not to include violent scenes, no matter how suggestive the prompt. Although their training made the AI reluctant to include violence, they never reached a point where clever prompt engineers couldn't get around their restrictions. Now that same experiment is playing out on the world stage. OpenAI released a question-answering AI, ChatGPT. If you haven't played with it yet, I recommend it. It's very impressive! Every corporate chatbot release is followed by the same cat-and-mouse game with journalists. The corporation tries to program the chatbot to never say offensive things. Then the journalists try to trick the chatbot into saying "I love racism". When they inevitably succeed, they publish an article titled "AI LOVES RACISM!" Then the corporation either recalls its chatbot or pledges to do better next time, and the game moves on to the next company in line.

Transcription

This episode hasn't been transcribed yet

Help us prioritize this episode for transcription by upvoting it.

0 upvotes

🗳️ Sign in to Upvote

Popular episodes get transcribed faster

Other episodes from Astral Codex Ten Podcast

Transcribed and ready to explore now

"All Lawful Use": Much More Than You Wanted To Know

02 Apr 2026

Astral Codex Ten Podcast

Next-Token Predictor Is An AI's Job, Not Its Species

02 Apr 2026

Astral Codex Ten Podcast

Malicious Streetlight Effects Vs. "Directional Correctness" - A Semi-Non-Apology

14 Mar 2026

Astral Codex Ten Podcast

Crime As Proxy For Disorder

14 Mar 2026

Astral Codex Ten Podcast

Astral Codex Ten Podcast

Perhaps It Is A Bad Thing That The World's Leading AI Companies Cannot Control Their AIs

This episode hasn't been transcribed yet

Other episodes from Astral Codex Ten Podcast

"All Lawful Use": Much More Than You Wanted To Know

Next-Token Predictor Is An AI's Job, Not Its Species

Malicious Streetlight Effects Vs. "Directional Correctness" - A Semi-Non-Apology

Crime As Proxy For Disorder

Record Low Crime Rates Are Real, Not Just Reporting Bias Or Improved Medical Care

What Happened With Bio Anchors?

Sign in to Audioscrape

Share this moment