Exploring the Biology of LLMs with Circuit Tracing with Emmanuel Ameisen - #727 - The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) | Transcription & Insights

Description

In this episode, Emmanuel Ameisen, a research engineer at Anthropic, returns to discuss two recent papers: "Circuit Tracing: Revealing Language Model Computational Graphs" and "On the Biology of a Large Language Model." Emmanuel explains how his team developed mechanistic interpretability methods to understand the internal workings of Claude by replacing dense neural network components with sparse, interpretable alternatives. The conversation explores several fascinating discoveries about large language models, including how they plan ahead when writing poetry (selecting the rhyming word "rabbit" before crafting the sentence leading to it), perform mathematical calculations using unique algorithms, and process concepts across multiple languages using shared neural representations. Emmanuel details how the team can intervene in model behavior by manipulating specific neural pathways, revealing how concepts are distributed throughout the network's MLPs and attention mechanisms. The discussion highlights both capabilities and limitations of LLMs, showing how hallucinations occur through separate recognition and recall circuits, and demonstrates why chain-of-thought explanations aren't always faithful representations of the model's actual reasoning. This research ultimately supports Anthropic's safety strategy by providing a deeper understanding of how these AI systems actually work. The complete show notes for this episode can be found at https://twimlai.com/go/727.

Audio

Featured in this Episode

No persons identified in this episode.

Transcription

This episode hasn't been transcribed yet

Help us prioritize this episode for transcription by upvoting it.

0 upvotes

🗳️ Sign in to Upvote

Popular episodes get transcribed faster

Other recent transcribed episodes

Transcribed and ready to explore now

NPR News: 12-08-2025 2AM EST

08 Dec 2025

NPR News Now

NPR News: 12-07-2025 11PM EST

08 Dec 2025

NPR News Now

NPR News: 12-07-2025 10PM EST

08 Dec 2025

NPR News Now

Meidas Health: AAP President Strongly Pushes Back on Hepatitis B Vaccine Changes

08 Dec 2025

The MeidasTouch Podcast

Democrat Bobby Cole Discusses Race for Texas Governor

07 Dec 2025

The MeidasTouch Podcast

Fox News Crashes Out on Air Over Trump’s Rapid Fall

07 Dec 2025

The MeidasTouch Podcast

Comments

There are no comments yet.

Please log in to write the first comment.

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Exploring the Biology of LLMs with Circuit Tracing with Emmanuel Ameisen - #727

This episode hasn't been transcribed yet

Other recent transcribed episodes

NPR News: 12-08-2025 2AM EST

NPR News: 12-07-2025 11PM EST

NPR News: 12-07-2025 10PM EST

Meidas Health: AAP President Strongly Pushes Back on Hepatitis B Vaccine Changes

Democrat Bobby Cole Discusses Race for Texas Governor

Fox News Crashes Out on Air Over Trump’s Rapid Fall

Login Required

Share this moment