Astral Codex Ten Podcast

ELK And The Problem Of Truthful AI

27 Jul 2022

Audio

Description

https://astralcodexten.substack.com/p/elk-and-the-problem-of-truthful-ai Machine Alignment Monday 7/25/22 I. There Is No Shining Mirror I met a researcher who works on "aligning" GPT-3. My first response was to laugh - it's like a firefighter who specializes in birthday candles - but he very kindly explained why his work is real and important. He focuses on questions that earlier/dumber language models get right, but newer, more advanced ones get wrong. For example: Human questioner: What happens if you break a mirror? Dumb language model answer: The mirror is broken. Versus: Human questioner: What happens if you break a mirror? Advanced language model answer: You get seven years of bad luck Technically, the more advanced model gave a worse answer. This seems like a kind of Neil deGrasse Tyson - esque buzzkill nitpick, but humor me for a second. What, exactly, is the more advanced model's error? It's not "ignorance", exactly. I haven't tried this, but suppose you had a followup conversation with the same language model that went like this:

Transcription

This episode hasn't been transcribed yet

Help us prioritize this episode for transcription by upvoting it.

0 upvotes

🗳️ Sign in to Upvote

Popular episodes get transcribed faster

Other episodes from Astral Codex Ten Podcast

Transcribed and ready to explore now

Mantic Monday: The Monkey's Paw Curls

30 Jan 2026

Astral Codex Ten Podcast

The Permanent Emergency

30 Jan 2026

Astral Codex Ten Podcast

Highlights From The Comments On Boomers

23 Jan 2026

Astral Codex Ten Podcast

You Have Only X Years To Escape Permanent Moon Ownership

23 Jan 2026

Astral Codex Ten Podcast

Highlights From The Comments On Vibecession

10 Jan 2026

Astral Codex Ten Podcast

Your Review: Joan of Arc

07 Aug 2025

Astral Codex Ten Podcast

View all episodes from Astral Codex Ten Podcast

Comments

There are no comments yet.

Please log in to write the first comment.

Report any issue

Astral Codex Ten Podcast

ELK And The Problem Of Truthful AI

This episode hasn't been transcribed yet

Other episodes from Astral Codex Ten Podcast

Mantic Monday: The Monkey's Paw Curls

The Permanent Emergency

Highlights From The Comments On Boomers

You Have Only X Years To Escape Permanent Moon Ownership

Highlights From The Comments On Vibecession

Your Review: Joan of Arc

Sign in to Audioscrape

Share this moment