Next in AI: Your Daily News Podcast
Episodes
Google Gemini 3 Deep Think: Advancing Science and Engineering Reasoning
17 Feb 2026
Contributed by Lukas
This discussion revolves around the release of Gemini 3 Deep Think, highlighting its record-breaking performance on the ARC-AGI-2 benchmark. Users ...
Vibe Citing: The Hallucination Crisis at NeurIPS 2025
24 Jan 2026
Contributed by Lukas
Recent investigations by GPTZero uncovered over 100 fabricated citations in research papers accepted for the NeurIPS 2025 conference. These &quo...
Brain Surgery for LLMs: Scaling Transformers with Embedding Modules
21 Jan 2026
Contributed by Lukas
The provided research introduces STEM (Scaling Transformers with Embedding Modules), a novel architecture designed to enhance the efficiency and know...
Open Responses: An Interoperable LLM Interface Specification
17 Jan 2026
Contributed by Lukas
Open Responses is a community-governed, vendor-neutral specification designed to standardize how developers interact with large language models. By ...
Silicon Supremacy: Nvidia and Apple Fight for TSMC Chips
16 Jan 2026
Contributed by Lukas
A significant shift in power is occurring at the semiconductor giant TSMC as Nvidia challenges Apple's long-standing status as the foundry...
Introducing Cowork: Claude for the Rest of Your Work
14 Jan 2026
Contributed by Lukas
This discussion explores the launch of Claude Cowork, an AI agent designed to automate general office tasks by managing local files and applications....
ChatGPT and Humans Solve an Erdős Problem
12 Jan 2026
Contributed by Lukas
Recent progress in artificial intelligence has enabled the autonomous solution of Erdős Problem #728, marking a significant milestone in computation...
ChatGPT Health: AI, Medicine, and the Privacy Frontier
08 Jan 2026
Contributed by Lukas
The podcast features a wide-ranging debate regarding ChatGPT Health, a new marketplace and diagnostic tool, and the broader implications of AI in me...
Claude Code LSP Support and the IDE Identity Crisis
24 Dec 2025
Contributed by Lukas
The provided podcast features a discussion regarding Claude Code's new native LSP support and its implications for the software development indu...
The Dawn of Reasoning: AI Reflections at the end of 2025
22 Dec 2025
Contributed by Lukas
In this reflective analysis, the podcast examines the evolving landscape of artificial intelligence by the end of 2025, noting a significant shift i...
Anthropic Agent Skills: A New Paradigm for Universal AI Expertise
20 Dec 2025
Contributed by Lukas
Anthropic researchers propose a shift from creating specialized AI agents to developing modular "skills" that provide domain-specific expe...
GPT Image 1.5: ChatGPT Images Strategic Shift
17 Dec 2025
Contributed by Lukas
The podcast provides an overview of GPT Image 1.5, a new flagship image generation model released by OpenAI, detailing its features and performance. ...
Introducing GPT-5.2: The New Frontier Model
15 Dec 2025
Contributed by Lukas
The podcast provides an overview of the new GPT-5.2 model release from OpenAI, detailing its improved performance across various professional and aca...
LLM Stock Market Showdown: Eight-Month Backtest
05 Dec 2025
Contributed by Lukas
The podcast describes an experiment called the AI Trade Arena, which was created to evaluate the predictive and analytical capabilities of large lan...
Anthropic Bought Bun Why They Need It
03 Dec 2025
Contributed by Lukas
The podcast, which includes excerpts from the Bun Blog and a corresponding online discussion, focus on the acquisition of the Bun JavaScript runtim...
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models
01 Dec 2025
Contributed by Lukas
This podcast introduces DeepSeek-V3.2, a novel open Large Language Model engineered to balance high computational efficiency with cutting-edge reas...
Elon Musk: X, Starlink, and the Singularity's Edge
01 Dec 2025
Contributed by Lukas
The provided podcast captures excerpts from a wide-ranging conversation between Elon Musk and Nikhil Kamath, concentrating on advice for aspiring entr...
Ilya Sutskever says AI scaling is over
26 Nov 2025
Contributed by Lukas
The podcast provides an extensive dialogue with Ilya Sutskever concerning the trajectory of artificial intelligence, arguing that the industry is shif...
The TPU vs GPU Battle for AI Dominance
26 Nov 2025
Contributed by Lukas
The podcast examines the ongoing strategic rivalry in the AI accelerator market between the ubiquitous Graphics Processing Units (GPUs), primarily ...
AI Agent design is still hard
24 Nov 2025
Contributed by Lukas
The podcast provides an extensive technical overview of challenges and best practices in building large language model agents. The author shares less...
Emergent Reasoning in Google's New AI Model: Unreleased AI Cracks Historical Handwriting Reasoning
15 Nov 2025
Contributed by Lukas
The podcast discusses a seemingly new Google AI model, potentially Gemini-3, that is showing unprecedented capabilities during A/B testing in AI Stu...
AI-Driven Shortages in Global Storage and Memory
12 Nov 2025
Contributed by Lukas
The podcast discusses a rapidly escalating global shortage across both memory and storage components, directly attributed to the aggressive expansion ...
Terminal Bench Deep Dive: Why the Command Line is the Only Way to Measure Real AI Intelligence and Economic Value
09 Nov 2025
Contributed by Lukas
The podcast features the creators of Terminal-Bench, a new benchmark designed to evaluate large language model agents by testing their ability to ...
DreamGym Decoded: How LLM Reasoning Smashes the 80,000-Step Data Bottleneck with Synthetic Experience
08 Nov 2025
Contributed by Lukas
The podcast introduces DreamGym, a novel framework designed to overcome the challenges of applying reinforcement learning (RL) to large language mode...
Perplexity MoE Deployment Deep Dive: The Custom Kernels and Network Secrets That Make Massive AI Models Run 5X Faster
06 Nov 2025
Contributed by Lukas
The podcast describes the development of high-performance, portable communication kernels specifically designed to handle the challenging sparse exper...
Stop Vibe Coding! Cognition's Windsurf Codemaps Battles the "Comprehension Tax" to Turn Engineers' Brains On
05 Nov 2025
Contributed by Lukas
The provided podcast introduces and discuss esWindsurf Codemaps, a new AI-powered feature developed by Cognition.ai for code comprehension, designed ...
OpenAI's $38 Billion AWS Deal: How a Sovereign AI Power Built a $700 Billion Multi-Cloud Empire and the Financial Bubble That Could Pop It All
04 Nov 2025
Contributed by Lukas
The podcast provides an extensive analysis of OpenAI's infrastructure strategy, highlighted by a new multi-year, $38 billion partnership with Am...
Karpathy's AI Divide: Why We're Summoning "Ghosts," Agents Will Take a Decade, and the Brutal "March of Nines"
18 Oct 2025
Contributed by Lukas
The podcast provides an extensive interview transcript with Andrej Karpathy, discussing his views on the future of Large Language Models (LLMs) and...
30 Gigawatts and the AI Race: Inside OpenAI's Custom Chip Alliance with Broadcom to Build Compute Abundance
14 Oct 2025
Contributed by Lukas
The podcast provides excerpts from an OpenAI podcast episode announcing a major partnership between OpenAI and Broadcom to develop custom artificial...
AI's Tectonic Shift: The State of AI 2025—Superintelligence Race, Open Source Tsunami, and the Looming Cybersecurity Crisis
11 Oct 2025
Contributed by Lukas
The podcast provides an extensive overview of the State of AI for 2025, presented by Nathan Benaich, General Partner of Air Street Capital. This mate...
Gemini 2.5 Computer Use Model: How Google's New AI Agent Is Learning to 'Live' Inside Your Browser and Conquer the Messy Web
09 Oct 2025
Contributed by Lukas
The podcast discusses the launch and implications of Google's Gemini 2.5 Computer Use model, a specialized AI built on Gemini 2.5 Pro designed to...
ChatGPT’s New Apps SDK: The Universal UI Dream vs. The Developer's Walled Garden
07 Oct 2025
Contributed by Lukas
The podcast provides an extensive overview of guidelines for developers building applications that integrate with ChatGPT, which are referred to as &q...
End AI Amnesia: Anthropic's Context Editing and Memory Tool Solve LLM Forgetfulness and Token Limits
06 Oct 2025
Contributed by Lukas
The podcast discusses new features on the Claude Developer Platform to enhance agents' ability to manage long-running tasks by addressing contex...
OpenAI's Money Furnace: How $13.5 Billion in Losses Fuels the AI Arms Race and the Inevitable Ad Strategy
04 Oct 2025
Contributed by Lukas
The podcast focuses heavily on the financial health and long-term viability of OpenAI, particularly given its substantial revenue of $4.3 billion c...
OpenAI Sora 2: Video Generation Advancements and Deployment
01 Oct 2025
Contributed by Lukas
The podcast discusses the launch of Sora 2, the company’s advanced video and audio generation model, highlighting its improved capabilities in rea...
Claude Sonnet 4.5: Best AI Coder or Vibe Coder? Deep Diving Anthropic's Agent Autonomy, Price Wars, and the 30-Hour Task Breakthrough
30 Sep 2025
Contributed by Lukas
The podcast discusses announcement from Anthropic introducing Claude Sonnet 4.5, which is presented as the world's best model for coding and buil...
The Synergy Secret: How Gemini Robotics' Dual-Model Agent (GR 1.5 & GR-ER 1.5) Solves the General-Purpose Robot Problem
27 Sep 2025
Contributed by Lukas
The podcast introduces and explain the capabilities of the Gemini Robotics 1.5 model family from Google DeepMind, focusing on the Vision-Language-A...
OpenAI: Why the GDPval Benchmark Reveals Near-Human Parity and Catastrophic Failure Rates
26 Sep 2025
Contributed by Lukas
The podcast introduces GDPval, a new benchmark created by OpenAI to evaluate AI models on real-world economically valuable tasks across major secto...
Alibaba's $53 Billion AI War: Unpacking the Qwen3 'Yunqi Declaration' and the New Global Race for ASI
24 Sep 2025
Contributed by Lukas
The podcast provides an extensive analysis of Alibaba's Qwen3 AI strategy, describing it as a meticulous, multi-front assault on the global AI la...
The Great AI Coding Paradox: Mastering Context Engineering to Beat 'Slop' on 500k-Line Codebases
23 Sep 2025
Contributed by Lukas
The podcast discusses a GitHub repository titled "advanced-context-engineering-for-coding-agents" under the "humanlayer" profi...
OpenAI's 10 Gigawatt Gamble: The $100 Billion NVIDIA AI Deal, Energy Crisis, and the "Round Tripping" Debate
23 Sep 2025
Contributed by Lukas
The podcast centers on a significant NVIDIA-OpenAI partnership to deploy at least ten gigawatts (10GW) of AI data centers, which is raising serious ...
When AI Breaks: Anthropic's Postmortem Reveals the Three Infrastructure Bugs That Tanked Claude's Quality
22 Sep 2025
Contributed by Lukas
The podcast discusses a technical postmortem from Anthropic detailing three infrastructure bugs that intermittently degraded the quality of Claude&#...
98% Cost Revolution: How xAI's Grok 4 Fast Rewrites the Economics of Frontier AI
21 Sep 2025
Contributed by Lukas
The podcast discusses the launch of Grok 4 Fast, a new model from xAI designed for maximum cost-efficiency and intelligence density. This model achie...
NVIDIA's $5 Billion Intel Bet: How the Arc-Rival NVLink Fusion Rewires PCs and AI with Uniform Memory Access
20 Sep 2025
Contributed by Lukas
The podcast discusses a major strategic partnership between NVIDIA and Intel, highlighted by NVIDIA’s $5 billion equity investment in Intel. This...
AI vs. VC: How LLMs Surpassed Human Experts in Spotting Unicorn Startups
19 Sep 2025
Contributed by Lukas
The podcast introduces VCBench, the first standardized, anonymized benchmark designed to evaluate Large Language Models (LLMs) in the challenging d...
AI Outsmarts World's Best Programmers: The ICPC Revolution and the Future of Human-AI Collaboration
18 Sep 2025
Contributed by Lukas
The podcast discusses significant achievement of AI models from DeepMind and OpenAI in the 2025 International Collegiate Programming Contest (ICPC) Wo...
GPT-5 Codex Unveiled: Your AI Co-Worker Revolutionizing Software Development
16 Sep 2025
Contributed by Lukas
This podcast features discussing the evolution and future of AI in coding, particularly focusing on OpenAI's Codex and GPT-5 models. It explains...
Stop Overthinking: How AI is Learning to Think Smarter, Not Just Longer
15 Sep 2025
Contributed by Lukas
This podcast provides a comprehensive overview of efficient reasoning in Large Language Models (LLMs), identifying the "overthinking phenomenon&...
Seedream 4.0: The AI Image Game Changer for Creative Pros
14 Sep 2025
Contributed by Lukas
The podcast introduces Seedream 4.0, a new AI model from ByteDance released in September 2025, which is presented as the definitive leader in AI ima...
Qwen3-Next: Decoupling LLM Knowledge from Compute for Sustainable AI Performance
13 Sep 2025
Contributed by Lukas
The podcast introduces Qwen3-Next, a new generation of large language models developed by Alibaba, emphasizing its innovative hybrid architecture ...
From LLMs to LRMs: Reinforcement Learning's Quest for Truly Reasoning AI
12 Sep 2025
Contributed by Lukas
This podcast explores the integration of Reinforcement Learning (RL) with Large Reasoning Models (LRMs), highlighting its foundational components, cu...
ChatGPT Developer Mode: Unleashing AI Power & Unpacking the "Lethal Trifecta" of Security Risks
11 Sep 2025
Contributed by Lukas
The podcast discusses the recent release of ChatGPT's "Developer Mode", which grants full Model Context Protocol (MCP) client access, enabling the ...
Trillion-Parameter Titans: Alibaba's Qwen3-Max-Preview vs. Kimi K2's Agentic AI Showdown
10 Sep 2025
Contributed by Lukas
Unpack the latest breakthroughs in AI with our podcast. We delve into trillion-parameter language models like Alibaba's Qwen3-Max-Preview, which marks...
Meta REFRAG: 30x Faster and Smarter Knowledge Access
09 Sep 2025
Contributed by Lukas
Tune into "REFRAG: Rethinking RAG Decoding" to discover a cutting-edge framework revolutionizing Retrieval-Augmented Generation (RAG) in Lar...
OpenAI: Why LLM Hallucinates and How Our Tests Make It Worse
07 Sep 2025
Contributed by Lukas
Why do AI chatbots confidently make up facts? This podcast explores the surprising reasons language models 'hallucinate'. We'll uncover ho...
Beyond Chatbots: Building Robust LLM Agents with LangGraph
06 Sep 2025
Contributed by Lukas
Dive into LangGraph, the production-ready agent runtime designed to give you control and durability over your AI agents. Discover how LangGraph addres...
The Gemmaverse Unleashed: Private, Powerful AI in Your Pocket
05 Sep 2025
Contributed by Lukas
Welcome to the "Gemmaverse Unlocked" podcast! Dive into the world of Google's Gemma family of open models, where State-of-the-Art AI mee...
Unpacking Implicit Reasoning: The Silent, Speedy Revolution in LLM Thinking
05 Sep 2025
Contributed by Lukas
Decoding the Silent Mind: Implicit Reasoning in LLMsDiscover Implicit Reasoning, the cutting-edge method where Large Language Models (LLMs) solve comp...
LLMs Unleashed: How GLM-4.5, vLLM, and Cognitive Load Shape the Future of AI Software
04 Sep 2025
Contributed by Lukas
Explore the future of AI software development with a look into advanced LLMs, high-performance inference systems, and the human element of cognitive l...