The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Relational Foundation Models for Enterprise Data with Jure Leskovec - #768

21 May 2026

Contributed by Lukas

In this episode, Jure Leskovec, co-founder and chief scientist at Kumo and professor of computer science at Stanford, joins us to explore two fronts o...

How to Find the Agent Failures Your Evals Miss with Scott Clark - #767

07 May 2026

Contributed by Lukas

In this episode, Scott Clark, co-founder and CEO of Distributional, joins us to explore how teams can reliably operate and improve complex LLM systems...

How to Engineer AI Inference Systems with Philip Kiely - #766

30 Apr 2026

Contributed by Lukas

In this episode, Philip Kiely, head of AI education at Baseten, joins us to unpack the fast-evolving discipline of inference engineering. We explore w...

How Capital One Delivers Multi-Agent Systems with Rashmi Shetty - #765

16 Apr 2026

Contributed by Lukas

In this episode, Rashmi Shetty, senior director of enterprise generative AI platform at Capital One, joins us to explore how the company is designing,...

The Race to Production-Grade Diffusion LLMs with Stefano Ermon - #764

26 Mar 2026

Contributed by Lukas

Today, we're joined by Stefano Ermon, associate professor at Stanford University and CEO of Inception Labs to discuss diffusion language models. We di...

Agent Swarms and Knowledge Graphs for Autonomous Software Development with Siddhant Pardeshi - #763

10 Mar 2026

Contributed by Lukas

In this episode, Sid Pardeshi, co-founder and CTO of Blitzy, joins us to discuss building autonomous development systems able to deliver production-re...

AI Trends 2026: OpenClaw Agents, Reasoning LLMs, and More with Sebastian Raschka - #762

26 Feb 2026

Contributed by Lukas

In this episode, Sebastian Raschka, independent LLM researcher and author, joins us to break down how the LLM landscape has changed over the past year...

The Evolution of Reasoning in Small Language Models with Yejin Choi - #761

29 Jan 2026

Contributed by Lukas

Today, we're joined by Yejin Choi, professor and senior fellow at Stanford University in the Computer Science Department and the Institute for Human-C...

Intelligent Robots in 2026: Are We There Yet? with Nikita Rudin - #760

08 Jan 2026

Contributed by Lukas

Today, we're joined by Nikita Rudin, co-founder and CEO of Flexion Robotics to discuss the gap between current robotic capabilities and what’s requi...

Rethinking Pre-Training for Agentic AI with Aakanksha Chowdhery - #759

17 Dec 2025

Contributed by Lukas

Today, we're joined by Aakanksha Chowdhery, member of technical staff at Reflection, to explore the fundamental shifts required to build true agentic ...

Why Vision Language Models Ignore What They See with Munawar Hayat - #758

09 Dec 2025

Contributed by Lukas

In this episode, we’re joined by Munawar Hayat, researcher at Qualcomm AI Research, to discuss a series of papers presented at NeurIPS 2025 focusing...

Scaling Agentic Inference Across Heterogeneous Compute with Zain Asgar - #757

02 Dec 2025

Contributed by Lukas

In this episode, Zain Asgar, co-founder and CEO of Gimlet Labs, joins us to discuss the heterogeneous AI inference across diverse hardware. Zain argue...

Proactive Agents for the Web with Devi Parikh - #756

19 Nov 2025

Contributed by Lukas

Today, we're joined by Devi Parikh, co-founder and co-CEO of Yutori, to discuss browser use models and a future where we interact with the web through...

AI Orchestration for Smart Cities and the Enterprise with Robin Braun and Luke Norris - #755

12 Nov 2025

Contributed by Lukas

Today, we're joined by Robin Braun, VP of AI business development for hybrid cloud at HPE, and Luke Norris, co-founder and CEO of Kamiwaza, to discuss...

Building an AI Mathematician with Carina Hong - #754

04 Nov 2025

Contributed by Lukas

In this episode, Carina Hong, founder and CEO of Axiom, joins us to discuss her work building an "AI Mathematician." Carina explains why this is a piv...

High-Efficiency Diffusion Models for On-Device Image Generation and Editing with Hung Bui - #753

28 Oct 2025

Contributed by Lukas

In this episode, Hung Bui, Technology Vice President at Qualcomm, joins us to explore the latest high-efficiency techniques for running generative AI,...

Vibe Coding's Uncanny Valley with Alexandre Pesant - #752

22 Oct 2025

Contributed by Lukas

Today, we're joined by Alexandre Pesant, AI lead at Lovable, who joins us to discuss the evolution and practice of vibe coding. Alex shares his take o...

Dataflow Computing for AI Inference with Kunle Olukotun - #751

14 Oct 2025

Contributed by Lukas

In this episode, we're joined by Kunle Olukotun, professor of electrical engineering and computer science at Stanford University and co-founder and ch...

Recurrence and Attention for Long-Context Transformers with Jacob Buckman - #750

07 Oct 2025

Contributed by Lukas

Today, we're joined by Jacob Buckman, co-founder and CEO of Manifest AI to discuss achieving long context in transformers. We discuss the bottlenecks ...

The Decentralized Future of Private AI with Illia Polosukhin - #749

30 Sep 2025

Contributed by Lukas

In this episode, Illia Polosukhin, a co-author of the seminal "Attention Is All You Need" paper and co-founder of Near AI, joins us to discuss his vis...

Inside Nano Banana 🍌 and the Future of Vision-Language Models with Oliver Wang - #748

23 Sep 2025

Contributed by Lukas

Today, we’re joined by Oliver Wang, principal scientist at Google DeepMind and tech lead for Gemini 2.5 Flash Image—better known by its code name,...

Is It Time to Rethink LLM Pre-Training? with Aditi Raghunathan - #747

16 Sep 2025

Contributed by Lukas

Today, we're joined by Aditi Raghunathan, assistant professor at Carnegie Mellon University, to discuss the limitations of LLMs and how we can build m...

Building an Immune System for AI Generated Software with Animesh Koratana - #746

09 Sep 2025

Contributed by Lukas

Today, we're joined by Animesh Koratana, founder and CEO of PlayerZero to discuss his team’s approach to making agentic and AI-assisted coding tools...

Autoformalization and Verifiable Superintelligence with Christian Szegedy - #745

02 Sep 2025

Contributed by Lukas

In this episode, Christian Szegedy, Chief Scientist at Morph Labs, joins us to discuss how the application of formal mathematics and reasoning enables...

Multimodal AI Models on Apple Silicon with MLX with Prince Canuma - #744

26 Aug 2025

Contributed by Lukas

Today, we're joined by Prince Canuma, an ML engineer and open-source developer focused on optimizing AI inference on Apple Silicon devices. Prince sha...

Genie 3: A New Frontier for World Models with Jack Parker-Holder and Shlomi Fruchter - #743

19 Aug 2025

Contributed by Lukas

Today, we're joined by Jack Parker-Holder and Shlomi Fruchter, researchers at Google DeepMind, to discuss the recent release of Genie 3, a model capab...

Closing the Loop Between AI Training and Inference with Lin Qiao - #742

12 Aug 2025

Contributed by Lukas

In this episode, we're joined by Lin Qiao, CEO and co-founder of Fireworks AI. Drawing on key lessons from her time building PyTorch, Lin shares her p...

Context Engineering for Productive AI Agents with Filip Kozera - #741

29 Jul 2025

Contributed by Lukas

In this episode, Filip Kozera, founder and CEO of Wordware, explains his approach to building agentic workflows where natural language serves as the n...

Infrastructure Scaling and Compound AI Systems with Jared Quincy Davis - #740

22 Jul 2025

Contributed by Lukas

In this episode, Jared Quincy Davis, founder and CEO at Foundry, introduces the concept of "compound AI systems," which allows users to create powerfu...

Building Voice AI Agents That Don’t Suck with Kwindla Kramer - #739

15 Jul 2025

Contributed by Lukas

In this episode, Kwindla Kramer, co-founder and CEO of Daily and creator of the open source Pipecat framework, joins us to discuss the architecture an...

Distilling Transformers and Diffusion Models for Robust Edge Use Cases with Fatih Porikli - #738

09 Jul 2025

Contributed by Lukas

Today, we're joined by Fatih Porikli, senior director of technology at Qualcomm AI Research for an in-depth look at several of Qualcomm's accepted pap...

Building the Internet of Agents with Vijoy Pandey - #737

24 Jun 2025

Contributed by Lukas

Today, we're joined by Vijoy Pandey, SVP and general manager at Outshift by Cisco to discuss a foundational challenge for the enterprise: how do we ma...

LLMs for Equities Feature Forecasting at Two Sigma with Ben Wellington - #736

17 Jun 2025

Contributed by Lukas

Today, we're joined by Ben Wellington, deputy head of feature forecasting at Two Sigma. We dig into the team’s end-to-end approach to leveraging AI ...

Zero-Shot Auto-Labeling: The End of Annotation for Computer Vision with Jason Corso - #735

10 Jun 2025

Contributed by Lukas

Today, we're joined by Jason Corso, co-founder of Voxel51 and professor at the University of Michigan, to explore automated labeling in computer visio...

Grokking, Generalization Collapse, and the Dynamics of Training Deep Neural Networks with Charles Martin - #734

05 Jun 2025

Contributed by Lukas

Today, we're joined by Charles Martin, founder of Calculation Consulting, to discuss Weight Watcher, an open-source tool for analyzing and improving D...

Google I/O 2025 Special Edition - #733

28 May 2025

Contributed by Lukas

Today, I’m excited to share a special crossover edition of the podcast recorded live from Google I/O 2025! In this episode, I join Shawn Wang aka Sw...

RAG Risks: Why Retrieval-Augmented LLMs are Not Safer with Sebastian Gehrmann - #732

21 May 2025

Contributed by Lukas

Today, we're joined by Sebastian Gehrmann, head of responsible AI in the Office of the CTO at Bloomberg, to discuss AI safety in retrieval-augmented g...

From Prompts to Policies: How RL Builds Better AI Agents with Mahesh Sathiamoorthy - #731

13 May 2025

Contributed by Lukas

Today, we're joined by Mahesh Sathiamoorthy, co-founder and CEO of Bespoke Labs, to discuss how reinforcement learning (RL) is reshaping the way we bu...

How OpenAI Builds AI Agents That Think and Act with Josh Tobin - #730

06 May 2025

Contributed by Lukas

Today, we're joined by Josh Tobin, member of technical staff at OpenAI, to discuss the company’s approach to building AI agents. We cover OpenAI's t...

CTIBench: Evaluating LLMs in Cyber Threat Intelligence with Nidhi Rastogi - #729

30 Apr 2025

Contributed by Lukas

Today, we're joined by Nidhi Rastogi, assistant professor at Rochester Institute of Technology to discuss Cyber Threat Intelligence (CTI), focusing on...

Generative Benchmarking with Kelly Hong - #728

23 Apr 2025

Contributed by Lukas

In this episode, Kelly Hong, a researcher at Chroma, joins us to discuss "Generative Benchmarking," a novel approach to evaluating retrieval systems, ...

Exploring the Biology of LLMs with Circuit Tracing with Emmanuel Ameisen - #727

14 Apr 2025

Contributed by Lukas

In this episode, Emmanuel Ameisen, a research engineer at Anthropic, returns to discuss two recent papers: "Circuit Tracing: Revealing Language Model ...

Teaching LLMs to Self-Reflect with Reinforcement Learning with Maohao Shen - #726

08 Apr 2025

Contributed by Lukas

Today, we're joined by Maohao Shen, PhD student at MIT to discuss his paper, “Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances L...

Waymo's Foundation Model for Autonomous Driving with Drago Anguelov - #725

31 Mar 2025

Contributed by Lukas

Today, we're joined by Drago Anguelov, head of AI foundations at Waymo, for a deep dive into the role of foundation models in autonomous driving. Drag...

Dynamic Token Merging for Efficient Byte-level Language Models with Julie Kallini - #724

24 Mar 2025

Contributed by Lukas

Today, we're joined by Julie Kallini, PhD student at Stanford University to discuss her recent papers, “MrT5: Dynamic Token Merging for Efficient By...

Scaling Up Test-Time Compute with Latent Reasoning with Jonas Geiping - #723

17 Mar 2025

Contributed by Lukas

Today, we're joined by Jonas Geiping, research group leader at Ellis Institute and the Max Planck Institute for Intelligent Systems to discuss his rec...

Imagine while Reasoning in Space: Multimodal Visualization-of-Thought with Chengzu Li - #722

10 Mar 2025

Contributed by Lukas

Today, we're joined by Chengzu Li, PhD student at the University of Cambridge to discuss his recent paper, “Imagine while Reasoning in Space: Multim...

Inside s1: An o1-Style Reasoning Model That Cost Under $50 to Train with Niklas Muennighoff - #721

03 Mar 2025

Contributed by Lukas

Today, we're joined by Niklas Muennighoff, a PhD student at Stanford University, to discuss his paper, “S1: Simple Test-Time Scaling.” We explore ...

Accelerating AI Training and Inference with AWS Trainium2 with Ron Diamant - #720

24 Feb 2025

Contributed by Lukas

Today, we're joined by Ron Diamant, chief architect for Trainium at Amazon Web Services, to discuss hardware acceleration for generative AI and the de...

π0: A Foundation Model for Robotics with Sergey Levine - #719

18 Feb 2025

Contributed by Lukas

Today, we're joined by Sergey Levine, associate professor at UC Berkeley and co-founder of Physical Intelligence, to discuss π0 (pi-zero), a general-...

AI Trends 2025: AI Agents and Multi-Agent Systems with Victor Dibia - #718

10 Feb 2025

Contributed by Lukas

Today we’re joined by Victor Dibia, principal research software engineer at Microsoft Research, to explore the key trends and advancements in AI age...

Speculative Decoding and Efficient LLM Inference with Chris Lott - #717

04 Feb 2025

Contributed by Lukas

Today, we're joined by Chris Lott, senior director of engineering at Qualcomm AI Research to discuss accelerating large language model inference. We e...

Ensuring Privacy for Any LLM with Patricia Thaine - #716

28 Jan 2025

Contributed by Lukas

Today, we're joined by Patricia Thaine, co-founder and CEO of Private AI to discuss techniques for ensuring privacy, data minimization, and compliance...

AI Engineering Pitfalls with Chip Huyen - #715

21 Jan 2025

Contributed by Lukas

Today, we're joined by Chip Huyen, independent researcher and writer to discuss her new book, “AI Engineering.” We dig into the definition of AI e...

Evolving MLOps Platforms for Generative AI and Agents with Abhijit Bose - #714

13 Jan 2025

Contributed by Lukas

Today, we're joined by Abhijit Bose, head of enterprise AI and ML platforms at Capital One to discuss the evolution of the company’s approach and in...

Why Agents Are Stupid & What We Can Do About It with Dan Jeffries - #713

16 Dec 2024

Contributed by Lukas

Today, we're joined by Dan Jeffries, founder and CEO of Kentauros AI to discuss the challenges currently faced by those developing advanced AI agents....

Automated Reasoning to Prevent LLM Hallucination with Byron Cook - #712

09 Dec 2024

Contributed by Lukas

Today, we're joined by Byron Cook, VP and distinguished scientist in the Automated Reasoning Group at AWS to dig into the underlying technology behind...

AI at the Edge: Qualcomm AI Research at NeurIPS 2024 with Arash Behboodi - #711

03 Dec 2024

Contributed by Lukas

Today, we're joined by Arash Behboodi, director of engineering at Qualcomm AI Research to discuss the papers and workshops Qualcomm will be presenting...

AI for Network Management with Shirley Wu - #710

19 Nov 2024

Contributed by Lukas

Today, we're joined by Shirley Wu, senior director of software engineering at Juniper Networks to discuss how machine learning and artificial intellig...

Why Your RAG System Is Broken, and How to Fix It with Jason Liu - #709

11 Nov 2024

Contributed by Lukas

Today, we're joined by Jason Liu, freelance AI consultant, advisor, and creator of the Instructor library to discuss all things retrieval-augmented ge...

An Agentic Mixture of Experts for DevOps with Sunil Mallya - #708

04 Nov 2024

Contributed by Lukas

Today we're joined by Sunil Mallya, CTO and co-founder of Flip AI. We discuss Flip’s incident debugging system for DevOps, which was built using a c...

Building AI Voice Agents with Scott Stephenson - #707

28 Oct 2024

Contributed by Lukas

Today, we're joined by Scott Stephenson, co-founder and CEO of Deepgram to discuss voice AI agents. We explore the importance of perception, understan...

Is Artificial Superintelligence Imminent? with Tim Rocktäschel - #706

21 Oct 2024

Contributed by Lukas

Today, we're joined by Tim Rocktäschel, senior staff research scientist at Google DeepMind, professor of Artificial Intelligence at University Colleg...

ML Models for Safety-Critical Systems with Lucas García - #705

14 Oct 2024

Contributed by Lukas

Today, we're joined by Lucas García, principal product manager for deep learning at MathWorks to discuss incorporating ML models into safety-critical...

AI Agents: Substance or Snake Oil with Arvind Narayanan - #704

07 Oct 2024

Contributed by Lukas

Today, we're joined by Arvind Narayanan, professor of Computer Science at Princeton University to discuss his recent works, AI Agents That Matter and ...

AI Agents for Data Analysis with Shreya Shankar - #703

30 Sep 2024

Contributed by Lukas

Today, we're joined by Shreya Shankar, a PhD student at UC Berkeley to discuss DocETL, a declarative system for building and optimizing LLM-powered da...

Stealing Part of a Production Language Model with Nicholas Carlini - #702

23 Sep 2024

Contributed by Lukas

Today, we're joined by Nicholas Carlini, research scientist at Google DeepMind to discuss adversarial machine learning and model security, focusing on...

Supercharging Developer Productivity with ChatGPT and Claude with Simon Willison - #701

16 Sep 2024

Contributed by Lukas

Today, we're joined by Simon Willison, independent researcher and creator of Datasette to discuss the many ways software developers and engineers can ...

Automated Design of Agentic Systems with Shengran Hu - #700

02 Sep 2024

Contributed by Lukas

Today, we're joined by Shengran Hu, a PhD student at the University of British Columbia, to discuss Automated Design of Agentic Systems (ADAS), an app...

The EU AI Act and Mitigating Bias in Automated Decisioning with Peter van der Putten - #699

27 Aug 2024

Contributed by Lukas

Today, we're joined by Peter van der Putten, director of the AI Lab at Pega and assistant professor of AI at Leiden University. We discuss the newly a...

The Building Blocks of Agentic Systems with Harrison Chase - #698

19 Aug 2024

Contributed by Lukas

Today, we're joined by Harrison Chase, co-founder and CEO of LangChain to discuss LLM frameworks, agentic systems, RAG, evaluation, and more. We dig i...

Simplifying On-Device AI for Developers with Siddhika Nevrekar - #697

12 Aug 2024

Contributed by Lukas

Today, we're joined by Siddhika Nevrekar, AI Hub head at Qualcomm Technologies, to discuss on-device AI and how to make it easier for developers to ta...

Genie: Generative Interactive Environments with Ashley Edwards - #696

05 Aug 2024

Contributed by Lukas

Today, we're joined by Ashley Edwards, a member of technical staff at Runway, to discuss Genie: Generative Interactive Environments, a system for crea...

Bridging the Sim2real Gap in Robotics with Marius Memmel - #695

30 Jul 2024

Contributed by Lukas

Today, we're joined by Marius Memmel, a PhD student at the University of Washington, to discuss his research on sim-to-real transfer approaches for de...

Building Real-World LLM Products with Fine-Tuning and More with Hamel Husain - #694

23 Jul 2024

Contributed by Lukas

Today, we're joined by Hamel Husain, founder of Parlance Labs, to discuss the ins and outs of building real-world products using large language models...

Mamba, Mamba-2 and Post-Transformer Architectures for Generative AI with Albert Gu - #693

17 Jul 2024

Contributed by Lukas

Today, we're joined by Albert Gu, assistant professor at Carnegie Mellon University, to discuss his research on post-transformer architectures for mul...

Decoding Animal Behavior to Train Robots with EgoPet with Amir Bar - #692

09 Jul 2024

Contributed by Lukas

Today, we're joined by Amir Bar, a PhD candidate at Tel Aviv University and UC Berkeley to discuss his research on visual-based learning, including hi...

How Microsoft Scales Testing and Safety for Generative AI with Sarah Bird - #691

01 Jul 2024

Contributed by Lukas

Today, we're joined by Sarah Bird, chief product officer of responsible AI at Microsoft. We discuss the testing and evaluation techniques Microsoft ap...

Long Context Language Models and their Biological Applications with Eric Nguyen - #690

25 Jun 2024

Contributed by Lukas

Today, we're joined by Eric Nguyen, PhD student at Stanford University. In our conversation, we explore his research on long context foundation models...

Accelerating Sustainability with AI with Andres Ravinet - #689

18 Jun 2024

Contributed by Lukas

Today, we're joined by Andres Ravinet, sustainability global black belt at Microsoft, to discuss the role of AI in sustainability. We explore real-wor...

Gen AI at the Edge: Qualcomm AI Research at CVPR 2024 with Fatih Porikli - #688

10 Jun 2024

Contributed by Lukas

Today we’re joined by Fatih Porikli, senior director of technology at Qualcomm AI Research. In our conversation, we covered several of the Qualcomm ...

Energy Star Ratings for AI Models with Sasha Luccioni - #687

03 Jun 2024

Contributed by Lukas

Today, we're joined by Sasha Luccioni, AI and Climate lead at Hugging Face, to discuss the environmental impact of AI models. We dig into her recent r...

Language Understanding and LLMs with Christopher Manning - #686

27 May 2024

Contributed by Lukas

Today, we're joined by Christopher Manning, the Thomas M. Siebel professor in Machine Learning at Stanford University and a recent recipient of the 20...

Chronos: Learning the Language of Time Series with Abdul Fatir Ansari - #685

20 May 2024

Contributed by Lukas

Today we're joined by Abdul Fatir Ansari, a machine learning scientist at AWS AI Labs in Berlin, to discuss his paper, "Chronos: Learning the Language...

Powering AI with the World's Largest Computer Chip with Joel Hestness - #684

13 May 2024

Contributed by Lukas

Today we're joined by Joel Hestness, principal research scientist and lead of the core machine learning team at Cerebras. We discuss Cerebras’ custo...

AI for Power & Energy with Laurent Boinot - #683

07 May 2024

Contributed by Lukas

Today we're joined by Laurent Boinot, power and utilities lead for the Americas at Microsoft, to discuss the intersection of AI and energy infrastruct...

Controlling Fusion Reactor Instability with Deep Reinforcement Learning with Aza Jalalvand - #682

29 Apr 2024

Contributed by Lukas

Today we're joined by Azarakhsh (Aza) Jalalvand, a research scholar at Princeton University, to discuss his work using deep reinforcement learning to ...

GraphRAG: Knowledge Graphs for AI Applications with Kirk Marple - #681

22 Apr 2024

Contributed by Lukas

Today we're joined by Kirk Marple, CEO and founder of Graphlit, to explore the emerging paradigm of "GraphRAG," or Graph Retrieval Augmented Generatio...

Teaching Large Language Models to Reason with Reinforcement Learning with Alex Havrilla - #680

16 Apr 2024

Contributed by Lukas

Today we're joined by Alex Havrilla, a PhD student at Georgia Tech, to discuss "Teaching Large Language Models to Reason with Reinforcement Learning."...

Localizing and Editing Knowledge in LLMs with Peter Hase - #679

08 Apr 2024

Contributed by Lukas

Today we're joined by Peter Hase, a fifth-year PhD student at the University of North Carolina NLP lab. We discuss "scalable oversight", and the impor...

Coercing LLMs to Do and Reveal (Almost) Anything with Jonas Geiping - #678

01 Apr 2024

Contributed by Lukas

Today we're joined by Jonas Geiping, a research group leader at the ELLIS Institute, to explore his paper: "Coercing LLMs to Do and Reveal (Almost) An...

V-JEPA, AI Reasoning from a Non-Generative Architecture with Mido Assran - #677

25 Mar 2024

Contributed by Lukas

Today we’re joined by Mido Assran, a research scientist at Meta’s Fundamental AI Research (FAIR). In this conversation, we discuss V-JEPA, a new m...

Video as a Universal Interface for AI Reasoning with Sherry Yang - #676

18 Mar 2024

Contributed by Lukas

Today we’re joined by Sherry Yang, senior research scientist at Google DeepMind and a PhD student at UC Berkeley. In this interview, we discuss her ...

Assessing the Risks of Open AI Models with Sayash Kapoor - #675

11 Mar 2024

Contributed by Lukas

Today we’re joined by Sayash Kapoor, a Ph.D. student in the Department of Computer Science at Princeton University. Sayash walks us through his pape...

OLMo: Everything You Need to Train an Open Source LLM with Akshita Bhagia - #674

04 Mar 2024

Contributed by Lukas

Today we’re joined by Akshita Bhagia, a senior research engineer at the Allen Institute for AI. Akshita joins us to discuss OLMo, a new open source ...

Training Data Locality and Chain-of-Thought Reasoning in LLMs with Ben Prystawski - #673

26 Feb 2024

Contributed by Lukas

Today we’re joined by Ben Prystawski, a PhD student in the Department of Psychology at Stanford University working at the intersection of cognitive ...

Reasoning Over Complex Documents with DocLLM with Armineh Nourbakhsh - #672

19 Feb 2024

Contributed by Lukas

Today we're joined by Armineh Nourbakhsh of JP Morgan AI Research to discuss the development and capabilities of DocLLM, a layout-aware large language...

Are Emergent Behaviors in LLMs an Illusion? with Sanmi Koyejo - #671

12 Feb 2024

Contributed by Lukas

Today we’re joined by Sanmi Koyejo, assistant professor at Stanford University, to continue our NeurIPS 2024 series. In our conversation, Sanmi disc...

AI Trends 2024: Reinforcement Learning in the Age of LLMs with Kamyar Azizzadenesheli - #670

05 Feb 2024

Contributed by Lukas

Today we’re joined by Kamyar Azizzadenesheli, a staff researcher at Nvidia, to continue our AI Trends 2024 series. In our conversation, Kamyar updat...

Building and Deploying Real-World RAG Applications with Ram Sriharsha - #669

29 Jan 2024

Contributed by Lukas

Today we’re joined by Ram Sriharsha, VP of engineering at Pinecone. In our conversation, we dive into the topic of vector databases and retrieval au...

Activity Overview

Episodes

Relational Foundation Models for Enterprise Data with Jure Leskovec - #768

How to Find the Agent Failures Your Evals Miss with Scott Clark - #767

How to Engineer AI Inference Systems with Philip Kiely - #766

How Capital One Delivers Multi-Agent Systems with Rashmi Shetty - #765

The Race to Production-Grade Diffusion LLMs with Stefano Ermon - #764

Agent Swarms and Knowledge Graphs for Autonomous Software Development with Siddhant Pardeshi - #763

AI Trends 2026: OpenClaw Agents, Reasoning LLMs, and More with Sebastian Raschka - #762

The Evolution of Reasoning in Small Language Models with Yejin Choi - #761

Intelligent Robots in 2026: Are We There Yet? with Nikita Rudin - #760

Rethinking Pre-Training for Agentic AI with Aakanksha Chowdhery - #759

Why Vision Language Models Ignore What They See with Munawar Hayat - #758

Scaling Agentic Inference Across Heterogeneous Compute with Zain Asgar - #757

Proactive Agents for the Web with Devi Parikh - #756

AI Orchestration for Smart Cities and the Enterprise with Robin Braun and Luke Norris - #755

Building an AI Mathematician with Carina Hong - #754

High-Efficiency Diffusion Models for On-Device Image Generation and Editing with Hung Bui - #753

Vibe Coding's Uncanny Valley with Alexandre Pesant - #752

Dataflow Computing for AI Inference with Kunle Olukotun - #751

Recurrence and Attention for Long-Context Transformers with Jacob Buckman - #750

The Decentralized Future of Private AI with Illia Polosukhin - #749

Inside Nano Banana 🍌 and the Future of Vision-Language Models with Oliver Wang - #748

Is It Time to Rethink LLM Pre-Training? with Aditi Raghunathan - #747

Building an Immune System for AI Generated Software with Animesh Koratana - #746

Autoformalization and Verifiable Superintelligence with Christian Szegedy - #745

Multimodal AI Models on Apple Silicon with MLX with Prince Canuma - #744

Genie 3: A New Frontier for World Models with Jack Parker-Holder and Shlomi Fruchter - #743

Closing the Loop Between AI Training and Inference with Lin Qiao - #742

Context Engineering for Productive AI Agents with Filip Kozera - #741

Infrastructure Scaling and Compound AI Systems with Jared Quincy Davis - #740

Building Voice AI Agents That Don’t Suck with Kwindla Kramer - #739

Distilling Transformers and Diffusion Models for Robust Edge Use Cases with Fatih Porikli - #738

Building the Internet of Agents with Vijoy Pandey - #737

LLMs for Equities Feature Forecasting at Two Sigma with Ben Wellington - #736

Zero-Shot Auto-Labeling: The End of Annotation for Computer Vision with Jason Corso - #735

Grokking, Generalization Collapse, and the Dynamics of Training Deep Neural Networks with Charles Martin - #734

Google I/O 2025 Special Edition - #733

RAG Risks: Why Retrieval-Augmented LLMs are Not Safer with Sebastian Gehrmann - #732

From Prompts to Policies: How RL Builds Better AI Agents with Mahesh Sathiamoorthy - #731

How OpenAI Builds AI Agents That Think and Act with Josh Tobin - #730

CTIBench: Evaluating LLMs in Cyber Threat Intelligence with Nidhi Rastogi - #729

Generative Benchmarking with Kelly Hong - #728

Exploring the Biology of LLMs with Circuit Tracing with Emmanuel Ameisen - #727

Teaching LLMs to Self-Reflect with Reinforcement Learning with Maohao Shen - #726

Waymo's Foundation Model for Autonomous Driving with Drago Anguelov - #725

Dynamic Token Merging for Efficient Byte-level Language Models with Julie Kallini - #724

Scaling Up Test-Time Compute with Latent Reasoning with Jonas Geiping - #723

Imagine while Reasoning in Space: Multimodal Visualization-of-Thought with Chengzu Li - #722

Inside s1: An o1-Style Reasoning Model That Cost Under $50 to Train with Niklas Muennighoff - #721

Accelerating AI Training and Inference with AWS Trainium2 with Ron Diamant - #720

π0: A Foundation Model for Robotics with Sergey Levine - #719

AI Trends 2025: AI Agents and Multi-Agent Systems with Victor Dibia - #718

Speculative Decoding and Efficient LLM Inference with Chris Lott - #717

Ensuring Privacy for Any LLM with Patricia Thaine - #716

AI Engineering Pitfalls with Chip Huyen - #715

Evolving MLOps Platforms for Generative AI and Agents with Abhijit Bose - #714

Why Agents Are Stupid & What We Can Do About It with Dan Jeffries - #713

Automated Reasoning to Prevent LLM Hallucination with Byron Cook - #712

AI at the Edge: Qualcomm AI Research at NeurIPS 2024 with Arash Behboodi - #711

AI for Network Management with Shirley Wu - #710

Why Your RAG System Is Broken, and How to Fix It with Jason Liu - #709

An Agentic Mixture of Experts for DevOps with Sunil Mallya - #708

Building AI Voice Agents with Scott Stephenson - #707

Is Artificial Superintelligence Imminent? with Tim Rocktäschel - #706

ML Models for Safety-Critical Systems with Lucas García - #705

AI Agents: Substance or Snake Oil with Arvind Narayanan - #704

AI Agents for Data Analysis with Shreya Shankar - #703

Stealing Part of a Production Language Model with Nicholas Carlini - #702

Supercharging Developer Productivity with ChatGPT and Claude with Simon Willison - #701

Automated Design of Agentic Systems with Shengran Hu - #700

The EU AI Act and Mitigating Bias in Automated Decisioning with Peter van der Putten - #699

The Building Blocks of Agentic Systems with Harrison Chase - #698

Simplifying On-Device AI for Developers with Siddhika Nevrekar - #697

Genie: Generative Interactive Environments with Ashley Edwards - #696

Bridging the Sim2real Gap in Robotics with Marius Memmel - #695

Building Real-World LLM Products with Fine-Tuning and More with Hamel Husain - #694

Mamba, Mamba-2 and Post-Transformer Architectures for Generative AI with Albert Gu - #693

Decoding Animal Behavior to Train Robots with EgoPet with Amir Bar - #692