Scaling Agentic Inference Across Heterogeneous Compute with Zain Asgar - #757
02 Dec 2025
Contributed by Lukas
In this episode, Zain Asgar, co-founder and CEO of Gimlet Labs, joins us to discuss the heterogeneou...
Proactive Agents for the Web with Devi Parikh - #756
19 Nov 2025
Contributed by Lukas
Today, we're joined by Devi Parikh, co-founder and co-CEO of Yutori, to discuss browser use models a...
AI Orchestration for Smart Cities and the Enterprise with Robin Braun and Luke Norris - #755
12 Nov 2025
Contributed by Lukas
Today, we're joined by Robin Braun, VP of AI business development for hybrid cloud at HPE, and Luke ...
Building an AI Mathematician with Carina Hong - #754
04 Nov 2025
Contributed by Lukas
In this episode, Carina Hong, founder and CEO of Axiom, joins us to discuss her work building an "AI...
High-Efficiency Diffusion Models for On-Device Image Generation and Editing with Hung Bui - #753
28 Oct 2025
Contributed by Lukas
In this episode, Hung Bui, Technology Vice President at Qualcomm, joins us to explore the latest hig...
Vibe Coding's Uncanny Valley with Alexandre Pesant - #752
22 Oct 2025
Contributed by Lukas
Today, we're joined by Alexandre Pesant, AI lead at Lovable, who joins us to discuss the evolution a...
Dataflow Computing for AI Inference with Kunle Olukotun - #751
14 Oct 2025
Contributed by Lukas
In this episode, we're joined by Kunle Olukotun, professor of electrical engineering and computer sc...
Recurrence and Attention for Long-Context Transformers with Jacob Buckman - #750
07 Oct 2025
Contributed by Lukas
Today, we're joined by Jacob Buckman, co-founder and CEO of Manifest AI to discuss achieving long co...
The Decentralized Future of Private AI with Illia Polosukhin - #749
30 Sep 2025
Contributed by Lukas
In this episode, Illia Polosukhin, a co-author of the seminal "Attention Is All You Need" paper and ...
Inside Nano Banana 🍌 and the Future of Vision-Language Models with Oliver Wang - #748
23 Sep 2025
Contributed by Lukas
Today, we’re joined by Oliver Wang, principal scientist at Google DeepMind and tech lead for Gemin...
Is It Time to Rethink LLM Pre-Training? with Aditi Raghunathan - #747
16 Sep 2025
Contributed by Lukas
Today, we're joined by Aditi Raghunathan, assistant professor at Carnegie Mellon University, to disc...
Building an Immune System for AI Generated Software with Animesh Koratana - #746
09 Sep 2025
Contributed by Lukas
Today, we're joined by Animesh Koratana, founder and CEO of PlayerZero to discuss his team’s appro...
Autoformalization and Verifiable Superintelligence with Christian Szegedy - #745
02 Sep 2025
Contributed by Lukas
In this episode, Christian Szegedy, Chief Scientist at Morph Labs, joins us to discuss how the appli...
Multimodal AI Models on Apple Silicon with MLX with Prince Canuma - #744
26 Aug 2025
Contributed by Lukas
Today, we're joined by Prince Canuma, an ML engineer and open-source developer focused on optimizing...
Genie 3: A New Frontier for World Models with Jack Parker-Holder and Shlomi Fruchter - #743
19 Aug 2025
Contributed by Lukas
Today, we're joined by Jack Parker-Holder and Shlomi Fruchter, researchers at Google DeepMind, to di...
Closing the Loop Between AI Training and Inference with Lin Qiao - #742
12 Aug 2025
Contributed by Lukas
In this episode, we're joined by Lin Qiao, CEO and co-founder of Fireworks AI. Drawing on key lesson...
Context Engineering for Productive AI Agents with Filip Kozera - #741
29 Jul 2025
Contributed by Lukas
In this episode, Filip Kozera, founder and CEO of Wordware, explains his approach to building agenti...
Infrastructure Scaling and Compound AI Systems with Jared Quincy Davis - #740
22 Jul 2025
Contributed by Lukas
In this episode, Jared Quincy Davis, founder and CEO at Foundry, introduces the concept of "compound...
Building Voice AI Agents That Don’t Suck with Kwindla Kramer - #739
15 Jul 2025
Contributed by Lukas
In this episode, Kwindla Kramer, co-founder and CEO of Daily and creator of the open source Pipecat ...
Distilling Transformers and Diffusion Models for Robust Edge Use Cases with Fatih Porikli - #738
09 Jul 2025
Contributed by Lukas
Today, we're joined by Fatih Porikli, senior director of technology at Qualcomm AI Research for an i...
Building the Internet of Agents with Vijoy Pandey - #737
24 Jun 2025
Contributed by Lukas
Today, we're joined by Vijoy Pandey, SVP and general manager at Outshift by Cisco to discuss a found...
LLMs for Equities Feature Forecasting at Two Sigma with Ben Wellington - #736
17 Jun 2025
Contributed by Lukas
Today, we're joined by Ben Wellington, deputy head of feature forecasting at Two Sigma. We dig into ...
Zero-Shot Auto-Labeling: The End of Annotation for Computer Vision with Jason Corso - #735
10 Jun 2025
Contributed by Lukas
Today, we're joined by Jason Corso, co-founder of Voxel51 and professor at the University of Michiga...
Grokking, Generalization Collapse, and the Dynamics of Training Deep Neural Networks with Charles Martin - #734
05 Jun 2025
Contributed by Lukas
Today, we're joined by Charles Martin, founder of Calculation Consulting, to discuss Weight Watcher,...
Google I/O 2025 Special Edition - #733
28 May 2025
Contributed by Lukas
Today, I’m excited to share a special crossover edition of the podcast recorded live from Google I...
RAG Risks: Why Retrieval-Augmented LLMs are Not Safer with Sebastian Gehrmann - #732
21 May 2025
Contributed by Lukas
Today, we're joined by Sebastian Gehrmann, head of responsible AI in the Office of the CTO at Bloomb...
From Prompts to Policies: How RL Builds Better AI Agents with Mahesh Sathiamoorthy - #731
13 May 2025
Contributed by Lukas
Today, we're joined by Mahesh Sathiamoorthy, co-founder and CEO of Bespoke Labs, to discuss how rein...
How OpenAI Builds AI Agents That Think and Act with Josh Tobin - #730
06 May 2025
Contributed by Lukas
Today, we're joined by Josh Tobin, member of technical staff at OpenAI, to discuss the company’s a...
CTIBench: Evaluating LLMs in Cyber Threat Intelligence with Nidhi Rastogi - #729
30 Apr 2025
Contributed by Lukas
Today, we're joined by Nidhi Rastogi, assistant professor at Rochester Institute of Technology to di...
Generative Benchmarking with Kelly Hong - #728
23 Apr 2025
Contributed by Lukas
In this episode, Kelly Hong, a researcher at Chroma, joins us to discuss "Generative Benchmarking," ...
Exploring the Biology of LLMs with Circuit Tracing with Emmanuel Ameisen - #727
14 Apr 2025
Contributed by Lukas
In this episode, Emmanuel Ameisen, a research engineer at Anthropic, returns to discuss two recent p...
Teaching LLMs to Self-Reflect with Reinforcement Learning with Maohao Shen - #726
08 Apr 2025
Contributed by Lukas
Today, we're joined by Maohao Shen, PhD student at MIT to discuss his paper, “Satori: Reinforcemen...
Waymo's Foundation Model for Autonomous Driving with Drago Anguelov - #725
31 Mar 2025
Contributed by Lukas
Today, we're joined by Drago Anguelov, head of AI foundations at Waymo, for a deep dive into the rol...
Dynamic Token Merging for Efficient Byte-level Language Models with Julie Kallini - #724
24 Mar 2025
Contributed by Lukas
Today, we're joined by Julie Kallini, PhD student at Stanford University to discuss her recent paper...
Scaling Up Test-Time Compute with Latent Reasoning with Jonas Geiping - #723
17 Mar 2025
Contributed by Lukas
Today, we're joined by Jonas Geiping, research group leader at Ellis Institute and the Max Planck In...
Imagine while Reasoning in Space: Multimodal Visualization-of-Thought with Chengzu Li - #722
10 Mar 2025
Contributed by Lukas
Today, we're joined by Chengzu Li, PhD student at the University of Cambridge to discuss his recent ...
Inside s1: An o1-Style Reasoning Model That Cost Under $50 to Train with Niklas Muennighoff - #721
03 Mar 2025
Contributed by Lukas
Today, we're joined by Niklas Muennighoff, a PhD student at Stanford University, to discuss his pape...
Accelerating AI Training and Inference with AWS Trainium2 with Ron Diamant - #720
24 Feb 2025
Contributed by Lukas
Today, we're joined by Ron Diamant, chief architect for Trainium at Amazon Web Services, to discuss ...
π0: A Foundation Model for Robotics with Sergey Levine - #719
18 Feb 2025
Contributed by Lukas
Today, we're joined by Sergey Levine, associate professor at UC Berkeley and co-founder of Physical ...
AI Trends 2025: AI Agents and Multi-Agent Systems with Victor Dibia - #718
10 Feb 2025
Contributed by Lukas
Today we’re joined by Victor Dibia, principal research software engineer at Microsoft Research, to...
Speculative Decoding and Efficient LLM Inference with Chris Lott - #717
04 Feb 2025
Contributed by Lukas
Today, we're joined by Chris Lott, senior director of engineering at Qualcomm AI Research to discuss...
Ensuring Privacy for Any LLM with Patricia Thaine - #716
28 Jan 2025
Contributed by Lukas
Today, we're joined by Patricia Thaine, co-founder and CEO of Private AI to discuss techniques for e...
AI Engineering Pitfalls with Chip Huyen - #715
21 Jan 2025
Contributed by Lukas
Today, we're joined by Chip Huyen, independent researcher and writer to discuss her new book, “AI ...
Evolving MLOps Platforms for Generative AI and Agents with Abhijit Bose - #714
13 Jan 2025
Contributed by Lukas
Today, we're joined by Abhijit Bose, head of enterprise AI and ML platforms at Capital One to discus...
Why Agents Are Stupid & What We Can Do About It with Dan Jeffries - #713
16 Dec 2024
Contributed by Lukas
Today, we're joined by Dan Jeffries, founder and CEO of Kentauros AI to discuss the challenges curre...
Automated Reasoning to Prevent LLM Hallucination with Byron Cook - #712
09 Dec 2024
Contributed by Lukas
Today, we're joined by Byron Cook, VP and distinguished scientist in the Automated Reasoning Group a...
AI at the Edge: Qualcomm AI Research at NeurIPS 2024 with Arash Behboodi - #711
03 Dec 2024
Contributed by Lukas
Today, we're joined by Arash Behboodi, director of engineering at Qualcomm AI Research to discuss th...
AI for Network Management with Shirley Wu - #710
19 Nov 2024
Contributed by Lukas
Today, we're joined by Shirley Wu, senior director of software engineering at Juniper Networks to di...
Why Your RAG System Is Broken, and How to Fix It with Jason Liu - #709
11 Nov 2024
Contributed by Lukas
Today, we're joined by Jason Liu, freelance AI consultant, advisor, and creator of the Instructor li...
An Agentic Mixture of Experts for DevOps with Sunil Mallya - #708
04 Nov 2024
Contributed by Lukas
Today we're joined by Sunil Mallya, CTO and co-founder of Flip AI. We discuss Flip’s incident debu...
Building AI Voice Agents with Scott Stephenson - #707
28 Oct 2024
Contributed by Lukas
Today, we're joined by Scott Stephenson, co-founder and CEO of Deepgram to discuss voice AI agents. ...
Is Artificial Superintelligence Imminent? with Tim Rocktäschel - #706
21 Oct 2024
Contributed by Lukas
Today, we're joined by Tim Rocktäschel, senior staff research scientist at Google DeepMind, profess...
ML Models for Safety-Critical Systems with Lucas García - #705
14 Oct 2024
Contributed by Lukas
Today, we're joined by Lucas García, principal product manager for deep learning at MathWorks to di...
AI Agents: Substance or Snake Oil with Arvind Narayanan - #704
07 Oct 2024
Contributed by Lukas
Today, we're joined by Arvind Narayanan, professor of Computer Science at Princeton University to di...
AI Agents for Data Analysis with Shreya Shankar - #703
30 Sep 2024
Contributed by Lukas
Today, we're joined by Shreya Shankar, a PhD student at UC Berkeley to discuss DocETL, a declarative...
Stealing Part of a Production Language Model with Nicholas Carlini - #702
23 Sep 2024
Contributed by Lukas
Today, we're joined by Nicholas Carlini, research scientist at Google DeepMind to discuss adversaria...
Supercharging Developer Productivity with ChatGPT and Claude with Simon Willison - #701
16 Sep 2024
Contributed by Lukas
Today, we're joined by Simon Willison, independent researcher and creator of Datasette to discuss th...
Automated Design of Agentic Systems with Shengran Hu - #700
02 Sep 2024
Contributed by Lukas
Today, we're joined by Shengran Hu, a PhD student at the University of British Columbia, to discuss ...
The EU AI Act and Mitigating Bias in Automated Decisioning with Peter van der Putten - #699
27 Aug 2024
Contributed by Lukas
Today, we're joined by Peter van der Putten, director of the AI Lab at Pega and assistant professor ...
The Building Blocks of Agentic Systems with Harrison Chase - #698
19 Aug 2024
Contributed by Lukas
Today, we're joined by Harrison Chase, co-founder and CEO of LangChain to discuss LLM frameworks, ag...
Simplifying On-Device AI for Developers with Siddhika Nevrekar - #697
12 Aug 2024
Contributed by Lukas
Today, we're joined by Siddhika Nevrekar, AI Hub head at Qualcomm Technologies, to discuss on-device...
Genie: Generative Interactive Environments with Ashley Edwards - #696
05 Aug 2024
Contributed by Lukas
Today, we're joined by Ashley Edwards, a member of technical staff at Runway, to discuss Genie: Gene...
Bridging the Sim2real Gap in Robotics with Marius Memmel - #695
30 Jul 2024
Contributed by Lukas
Today, we're joined by Marius Memmel, a PhD student at the University of Washington, to discuss his ...
Building Real-World LLM Products with Fine-Tuning and More with Hamel Husain - #694
23 Jul 2024
Contributed by Lukas
Today, we're joined by Hamel Husain, founder of Parlance Labs, to discuss the ins and outs of buildi...
Mamba, Mamba-2 and Post-Transformer Architectures for Generative AI with Albert Gu - #693
17 Jul 2024
Contributed by Lukas
Today, we're joined by Albert Gu, assistant professor at Carnegie Mellon University, to discuss his ...
Decoding Animal Behavior to Train Robots with EgoPet with Amir Bar - #692
09 Jul 2024
Contributed by Lukas
Today, we're joined by Amir Bar, a PhD candidate at Tel Aviv University and UC Berkeley to discuss h...
How Microsoft Scales Testing and Safety for Generative AI with Sarah Bird - #691
01 Jul 2024
Contributed by Lukas
Today, we're joined by Sarah Bird, chief product officer of responsible AI at Microsoft. We discuss ...
Long Context Language Models and their Biological Applications with Eric Nguyen - #690
25 Jun 2024
Contributed by Lukas
Today, we're joined by Eric Nguyen, PhD student at Stanford University. In our conversation, we expl...
Accelerating Sustainability with AI with Andres Ravinet - #689
18 Jun 2024
Contributed by Lukas
Today, we're joined by Andres Ravinet, sustainability global black belt at Microsoft, to discuss the...
Gen AI at the Edge: Qualcomm AI Research at CVPR 2024 with Fatih Porikli - #688
10 Jun 2024
Contributed by Lukas
Today we’re joined by Fatih Porikli, senior director of technology at Qualcomm AI Research. In our...
Energy Star Ratings for AI Models with Sasha Luccioni - #687
03 Jun 2024
Contributed by Lukas
Today, we're joined by Sasha Luccioni, AI and Climate lead at Hugging Face, to discuss the environme...
Language Understanding and LLMs with Christopher Manning - #686
27 May 2024
Contributed by Lukas
Today, we're joined by Christopher Manning, the Thomas M. Siebel professor in Machine Learning at St...
Chronos: Learning the Language of Time Series with Abdul Fatir Ansari - #685
20 May 2024
Contributed by Lukas
Today we're joined by Abdul Fatir Ansari, a machine learning scientist at AWS AI Labs in Berlin, to ...
Powering AI with the World's Largest Computer Chip with Joel Hestness - #684
13 May 2024
Contributed by Lukas
Today we're joined by Joel Hestness, principal research scientist and lead of the core machine learn...
AI for Power & Energy with Laurent Boinot - #683
07 May 2024
Contributed by Lukas
Today we're joined by Laurent Boinot, power and utilities lead for the Americas at Microsoft, to dis...
Controlling Fusion Reactor Instability with Deep Reinforcement Learning with Aza Jalalvand - #682
29 Apr 2024
Contributed by Lukas
Today we're joined by Azarakhsh (Aza) Jalalvand, a research scholar at Princeton University, to disc...
GraphRAG: Knowledge Graphs for AI Applications with Kirk Marple - #681
22 Apr 2024
Contributed by Lukas
Today we're joined by Kirk Marple, CEO and founder of Graphlit, to explore the emerging paradigm of ...
Teaching Large Language Models to Reason with Reinforcement Learning with Alex Havrilla - #680
16 Apr 2024
Contributed by Lukas
Today we're joined by Alex Havrilla, a PhD student at Georgia Tech, to discuss "Teaching Large Langu...
Localizing and Editing Knowledge in LLMs with Peter Hase - #679
08 Apr 2024
Contributed by Lukas
Today we're joined by Peter Hase, a fifth-year PhD student at the University of North Carolina NLP l...
Coercing LLMs to Do and Reveal (Almost) Anything with Jonas Geiping - #678
01 Apr 2024
Contributed by Lukas
Today we're joined by Jonas Geiping, a research group leader at the ELLIS Institute, to explore his ...
V-JEPA, AI Reasoning from a Non-Generative Architecture with Mido Assran - #677
25 Mar 2024
Contributed by Lukas
Today we’re joined by Mido Assran, a research scientist at Meta’s Fundamental AI Research (FAIR)...
Video as a Universal Interface for AI Reasoning with Sherry Yang - #676
18 Mar 2024
Contributed by Lukas
Today we’re joined by Sherry Yang, senior research scientist at Google DeepMind and a PhD student ...
Assessing the Risks of Open AI Models with Sayash Kapoor - #675
11 Mar 2024
Contributed by Lukas
Today we’re joined by Sayash Kapoor, a Ph.D. student in the Department of Computer Science at Prin...
OLMo: Everything You Need to Train an Open Source LLM with Akshita Bhagia - #674
04 Mar 2024
Contributed by Lukas
Today we’re joined by Akshita Bhagia, a senior research engineer at the Allen Institute for AI. Ak...
Training Data Locality and Chain-of-Thought Reasoning in LLMs with Ben Prystawski - #673
26 Feb 2024
Contributed by Lukas
Today we’re joined by Ben Prystawski, a PhD student in the Department of Psychology at Stanford Un...
Reasoning Over Complex Documents with DocLLM with Armineh Nourbakhsh - #672
19 Feb 2024
Contributed by Lukas
Today we're joined by Armineh Nourbakhsh of JP Morgan AI Research to discuss the development and cap...
Are Emergent Behaviors in LLMs an Illusion? with Sanmi Koyejo - #671
12 Feb 2024
Contributed by Lukas
Today we’re joined by Sanmi Koyejo, assistant professor at Stanford University, to continue our Ne...
AI Trends 2024: Reinforcement Learning in the Age of LLMs with Kamyar Azizzadenesheli - #670
05 Feb 2024
Contributed by Lukas
Today we’re joined by Kamyar Azizzadenesheli, a staff researcher at Nvidia, to continue our AI Tre...
Building and Deploying Real-World RAG Applications with Ram Sriharsha - #669
29 Jan 2024
Contributed by Lukas
Today we’re joined by Ram Sriharsha, VP of engineering at Pinecone. In our conversation, we dive i...
Nightshade: Data Poisoning to Fight Generative AI with Ben Zhao - #668
22 Jan 2024
Contributed by Lukas
Today we’re joined by Ben Zhao, a Neubauer professor of computer science at the University of Chic...
Learning Transformer Programs with Dan Friedman - #667
15 Jan 2024
Contributed by Lukas
Today, we continue our NeurIPS series with Dan Friedman, a PhD student in the Princeton NLP group. I...
AI Trends 2024: Machine Learning & Deep Learning with Thomas Dietterich - #666
08 Jan 2024
Contributed by Lukas
Today we continue our AI Trends 2024 series with a conversation with Thomas Dietterich, distinguishe...
AI Trends 2024: Computer Vision with Naila Murray - #665
02 Jan 2024
Contributed by Lukas
Today we kick off our AI Trends 2024 series with a conversation with Naila Murray, director of AI re...
Are Vector DBs the Future Data Platform for AI? with Ed Anuff - #664
28 Dec 2023
Contributed by Lukas
Today we’re joined by Ed Anuff, chief product officer at DataStax. In our conversation, we discuss...
Quantizing Transformers by Helping Attention Heads Do Nothing with Markus Nagel - #663
26 Dec 2023
Contributed by Lukas
Today we’re joined by Markus Nagel, research scientist at Qualcomm AI Research, who helps us kick ...
Responsible AI in the Generative Era with Michael Kearns - #662
22 Dec 2023
Contributed by Lukas
Today we’re joined by Michael Kearns, professor in the Department of Computer and Information Scie...
Edutainment for AI and AWS PartyRock with Mike Miller - #661
18 Dec 2023
Contributed by Lukas
Today we’re joined by Mike Miller, director of product at AWS responsible for the company’s “e...
Data, Systems and ML for Visual Understanding with Cody Coleman - #660
14 Dec 2023
Contributed by Lukas
Today we’re joined by Cody Coleman, co-founder and CEO of Coactive AI. In our conversation with Co...
Patterns and Middleware for LLM Applications with Kyle Roche - #659
11 Dec 2023
Contributed by Lukas
Today we’re joined by Kyle Roche, founder and CEO of Griptape to discuss patterns and middleware f...
AI Access and Inclusivity as a Technical Challenge with Prem Natarajan - #658
04 Dec 2023
Contributed by Lukas
Today we’re joined by Prem Natarajan, chief scientist and head of enterprise AI at Capital One. In...