Inside s1: An o1-Style Reasoning Model That Cost Under $50 to Train with Niklas Muennighoff - #721 - The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) | Transcription & Insights

Description

Today, we're joined by Niklas Muennighoff, a PhD student at Stanford University, to discuss his paper, “S1: Simple Test-Time Scaling.” We explore the motivations behind S1, as well as how it compares to OpenAI's O1 and DeepSeek's R1 models. We dig into the different approaches to test-time scaling, including parallel and sequential scaling, as well as S1’s data curation process, its training recipe, and its use of model distillation from Google Gemini and DeepSeek R1. We explore the novel "budget forcing" technique developed in the paper, allowing it to think longer for harder problems and optimize test-time compute for better performance. Additionally, we cover the evaluation benchmarks used, the comparison between supervised fine-tuning and reinforcement learning, and similar projects like the Hugging Face Open R1 project. Finally, we discuss the open-sourcing of S1 and its future directions. The complete show notes for this episode can be found at https://twimlai.com/go/721.

Audio

Featured in this Episode

No persons identified in this episode.

Transcription

This episode hasn't been transcribed yet

Help us prioritize this episode for transcription by upvoting it.

0 upvotes

🗳️ Sign in to Upvote

Popular episodes get transcribed faster

Other recent transcribed episodes

Transcribed and ready to explore now

NPR News: 12-08-2025 2AM EST

08 Dec 2025

NPR News Now

NPR News: 12-07-2025 11PM EST

08 Dec 2025

NPR News Now

NPR News: 12-07-2025 10PM EST

08 Dec 2025

NPR News Now

Meidas Health: AAP President Strongly Pushes Back on Hepatitis B Vaccine Changes

08 Dec 2025

The MeidasTouch Podcast

Democrat Bobby Cole Discusses Race for Texas Governor

07 Dec 2025

The MeidasTouch Podcast

Fox News Crashes Out on Air Over Trump’s Rapid Fall

07 Dec 2025

The MeidasTouch Podcast

Comments

There are no comments yet.

Please log in to write the first comment.

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Inside s1: An o1-Style Reasoning Model That Cost Under $50 to Train with Niklas Muennighoff - #721

This episode hasn't been transcribed yet

Other recent transcribed episodes

NPR News: 12-08-2025 2AM EST

NPR News: 12-07-2025 11PM EST

NPR News: 12-07-2025 10PM EST

Meidas Health: AAP President Strongly Pushes Back on Hepatitis B Vaccine Changes

Democrat Bobby Cole Discusses Race for Texas Governor

Fox News Crashes Out on Air Over Trump’s Rapid Fall

Login Required

Share this moment