The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
Episodes
Why Vision Language Models Ignore What They See with Munawar Hayat - #758
09 Dec 2025
Contributed by Lukas
In this episode, we’re joined by Munawar Hayat, researcher at Qualcomm AI Research, to discuss a series of papers presented at NeurIPS 2025 focusing...
Scaling Agentic Inference Across Heterogeneous Compute with Zain Asgar - #757
02 Dec 2025
Contributed by Lukas
In this episode, Zain Asgar, co-founder and CEO of Gimlet Labs, joins us to discuss the heterogeneous AI inference across diverse hardware. Zain argue...
Proactive Agents for the Web with Devi Parikh - #756
19 Nov 2025
Contributed by Lukas
Today, we're joined by Devi Parikh, co-founder and co-CEO of Yutori, to discuss browser use models and a future where we interact with the web through...
AI Orchestration for Smart Cities and the Enterprise with Robin Braun and Luke Norris - #755
12 Nov 2025
Contributed by Lukas
Today, we're joined by Robin Braun, VP of AI business development for hybrid cloud at HPE, and Luke Norris, co-founder and CEO of Kamiwaza, to discuss...
Building an AI Mathematician with Carina Hong - #754
04 Nov 2025
Contributed by Lukas
In this episode, Carina Hong, founder and CEO of Axiom, joins us to discuss her work building an "AI Mathematician." Carina explains why this is a piv...
High-Efficiency Diffusion Models for On-Device Image Generation and Editing with Hung Bui - #753
28 Oct 2025
Contributed by Lukas
In this episode, Hung Bui, Technology Vice President at Qualcomm, joins us to explore the latest high-efficiency techniques for running generative AI,...
Vibe Coding's Uncanny Valley with Alexandre Pesant - #752
22 Oct 2025
Contributed by Lukas
Today, we're joined by Alexandre Pesant, AI lead at Lovable, who joins us to discuss the evolution and practice of vibe coding. Alex shares his take o...
Dataflow Computing for AI Inference with Kunle Olukotun - #751
14 Oct 2025
Contributed by Lukas
In this episode, we're joined by Kunle Olukotun, professor of electrical engineering and computer science at Stanford University and co-founder and ch...
Recurrence and Attention for Long-Context Transformers with Jacob Buckman - #750
07 Oct 2025
Contributed by Lukas
Today, we're joined by Jacob Buckman, co-founder and CEO of Manifest AI to discuss achieving long context in transformers. We discuss the bottlenecks ...
The Decentralized Future of Private AI with Illia Polosukhin - #749
30 Sep 2025
Contributed by Lukas
In this episode, Illia Polosukhin, a co-author of the seminal "Attention Is All You Need" paper and co-founder of Near AI, joins us to discuss his vis...
Inside Nano Banana 🍌 and the Future of Vision-Language Models with Oliver Wang - #748
23 Sep 2025
Contributed by Lukas
Today, we’re joined by Oliver Wang, principal scientist at Google DeepMind and tech lead for Gemini 2.5 Flash Image—better known by its code name,...
Is It Time to Rethink LLM Pre-Training? with Aditi Raghunathan - #747
16 Sep 2025
Contributed by Lukas
Today, we're joined by Aditi Raghunathan, assistant professor at Carnegie Mellon University, to discuss the limitations of LLMs and how we can build m...
Building an Immune System for AI Generated Software with Animesh Koratana - #746
09 Sep 2025
Contributed by Lukas
Today, we're joined by Animesh Koratana, founder and CEO of PlayerZero to discuss his team’s approach to making agentic and AI-assisted coding tools...
Autoformalization and Verifiable Superintelligence with Christian Szegedy - #745
02 Sep 2025
Contributed by Lukas
In this episode, Christian Szegedy, Chief Scientist at Morph Labs, joins us to discuss how the application of formal mathematics and reasoning enables...
Multimodal AI Models on Apple Silicon with MLX with Prince Canuma - #744
26 Aug 2025
Contributed by Lukas
Today, we're joined by Prince Canuma, an ML engineer and open-source developer focused on optimizing AI inference on Apple Silicon devices. Prince sha...
Genie 3: A New Frontier for World Models with Jack Parker-Holder and Shlomi Fruchter - #743
19 Aug 2025
Contributed by Lukas
Today, we're joined by Jack Parker-Holder and Shlomi Fruchter, researchers at Google DeepMind, to discuss the recent release of Genie 3, a model capab...
Closing the Loop Between AI Training and Inference with Lin Qiao - #742
12 Aug 2025
Contributed by Lukas
In this episode, we're joined by Lin Qiao, CEO and co-founder of Fireworks AI. Drawing on key lessons from her time building PyTorch, Lin shares her p...
Context Engineering for Productive AI Agents with Filip Kozera - #741
29 Jul 2025
Contributed by Lukas
In this episode, Filip Kozera, founder and CEO of Wordware, explains his approach to building agentic workflows where natural language serves as the n...
Infrastructure Scaling and Compound AI Systems with Jared Quincy Davis - #740
22 Jul 2025
Contributed by Lukas
In this episode, Jared Quincy Davis, founder and CEO at Foundry, introduces the concept of "compound AI systems," which allows users to create powerfu...
Building Voice AI Agents That Don’t Suck with Kwindla Kramer - #739
15 Jul 2025
Contributed by Lukas
In this episode, Kwindla Kramer, co-founder and CEO of Daily and creator of the open source Pipecat framework, joins us to discuss the architecture an...
Distilling Transformers and Diffusion Models for Robust Edge Use Cases with Fatih Porikli - #738
09 Jul 2025
Contributed by Lukas
Today, we're joined by Fatih Porikli, senior director of technology at Qualcomm AI Research for an in-depth look at several of Qualcomm's accepted pap...
Building the Internet of Agents with Vijoy Pandey - #737
24 Jun 2025
Contributed by Lukas
Today, we're joined by Vijoy Pandey, SVP and general manager at Outshift by Cisco to discuss a foundational challenge for the enterprise: how do we ma...
LLMs for Equities Feature Forecasting at Two Sigma with Ben Wellington - #736
17 Jun 2025
Contributed by Lukas
Today, we're joined by Ben Wellington, deputy head of feature forecasting at Two Sigma. We dig into the team’s end-to-end approach to leveraging AI ...
Zero-Shot Auto-Labeling: The End of Annotation for Computer Vision with Jason Corso - #735
10 Jun 2025
Contributed by Lukas
Today, we're joined by Jason Corso, co-founder of Voxel51 and professor at the University of Michigan, to explore automated labeling in computer visio...
Grokking, Generalization Collapse, and the Dynamics of Training Deep Neural Networks with Charles Martin - #734
05 Jun 2025
Contributed by Lukas
Today, we're joined by Charles Martin, founder of Calculation Consulting, to discuss Weight Watcher, an open-source tool for analyzing and improving D...
Google I/O 2025 Special Edition - #733
28 May 2025
Contributed by Lukas
Today, I’m excited to share a special crossover edition of the podcast recorded live from Google I/O 2025! In this episode, I join Shawn Wang aka Sw...
RAG Risks: Why Retrieval-Augmented LLMs are Not Safer with Sebastian Gehrmann - #732
21 May 2025
Contributed by Lukas
Today, we're joined by Sebastian Gehrmann, head of responsible AI in the Office of the CTO at Bloomberg, to discuss AI safety in retrieval-augmented g...
From Prompts to Policies: How RL Builds Better AI Agents with Mahesh Sathiamoorthy - #731
13 May 2025
Contributed by Lukas
Today, we're joined by Mahesh Sathiamoorthy, co-founder and CEO of Bespoke Labs, to discuss how reinforcement learning (RL) is reshaping the way we bu...
How OpenAI Builds AI Agents That Think and Act with Josh Tobin - #730
06 May 2025
Contributed by Lukas
Today, we're joined by Josh Tobin, member of technical staff at OpenAI, to discuss the company’s approach to building AI agents. We cover OpenAI's t...
CTIBench: Evaluating LLMs in Cyber Threat Intelligence with Nidhi Rastogi - #729
30 Apr 2025
Contributed by Lukas
Today, we're joined by Nidhi Rastogi, assistant professor at Rochester Institute of Technology to discuss Cyber Threat Intelligence (CTI), focusing on...
Generative Benchmarking with Kelly Hong - #728
23 Apr 2025
Contributed by Lukas
In this episode, Kelly Hong, a researcher at Chroma, joins us to discuss "Generative Benchmarking," a novel approach to evaluating retrieval systems, ...
Exploring the Biology of LLMs with Circuit Tracing with Emmanuel Ameisen - #727
14 Apr 2025
Contributed by Lukas
In this episode, Emmanuel Ameisen, a research engineer at Anthropic, returns to discuss two recent papers: "Circuit Tracing: Revealing Language Model ...
Teaching LLMs to Self-Reflect with Reinforcement Learning with Maohao Shen - #726
08 Apr 2025
Contributed by Lukas
Today, we're joined by Maohao Shen, PhD student at MIT to discuss his paper, “Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances L...
Waymo's Foundation Model for Autonomous Driving with Drago Anguelov - #725
31 Mar 2025
Contributed by Lukas
Today, we're joined by Drago Anguelov, head of AI foundations at Waymo, for a deep dive into the role of foundation models in autonomous driving. Drag...
Dynamic Token Merging for Efficient Byte-level Language Models with Julie Kallini - #724
24 Mar 2025
Contributed by Lukas
Today, we're joined by Julie Kallini, PhD student at Stanford University to discuss her recent papers, “MrT5: Dynamic Token Merging for Efficient By...
Scaling Up Test-Time Compute with Latent Reasoning with Jonas Geiping - #723
17 Mar 2025
Contributed by Lukas
Today, we're joined by Jonas Geiping, research group leader at Ellis Institute and the Max Planck Institute for Intelligent Systems to discuss his rec...
Imagine while Reasoning in Space: Multimodal Visualization-of-Thought with Chengzu Li - #722
10 Mar 2025
Contributed by Lukas
Today, we're joined by Chengzu Li, PhD student at the University of Cambridge to discuss his recent paper, “Imagine while Reasoning in Space: Multim...
Inside s1: An o1-Style Reasoning Model That Cost Under $50 to Train with Niklas Muennighoff - #721
03 Mar 2025
Contributed by Lukas
Today, we're joined by Niklas Muennighoff, a PhD student at Stanford University, to discuss his paper, “S1: Simple Test-Time Scaling.” We explore ...
Accelerating AI Training and Inference with AWS Trainium2 with Ron Diamant - #720
24 Feb 2025
Contributed by Lukas
Today, we're joined by Ron Diamant, chief architect for Trainium at Amazon Web Services, to discuss hardware acceleration for generative AI and the de...
π0: A Foundation Model for Robotics with Sergey Levine - #719
18 Feb 2025
Contributed by Lukas
Today, we're joined by Sergey Levine, associate professor at UC Berkeley and co-founder of Physical Intelligence, to discuss π0 (pi-zero), a general-...
AI Trends 2025: AI Agents and Multi-Agent Systems with Victor Dibia - #718
10 Feb 2025
Contributed by Lukas
Today we’re joined by Victor Dibia, principal research software engineer at Microsoft Research, to explore the key trends and advancements in AI age...
Speculative Decoding and Efficient LLM Inference with Chris Lott - #717
04 Feb 2025
Contributed by Lukas
Today, we're joined by Chris Lott, senior director of engineering at Qualcomm AI Research to discuss accelerating large language model inference. We e...
Ensuring Privacy for Any LLM with Patricia Thaine - #716
28 Jan 2025
Contributed by Lukas
Today, we're joined by Patricia Thaine, co-founder and CEO of Private AI to discuss techniques for ensuring privacy, data minimization, and compliance...
AI Engineering Pitfalls with Chip Huyen - #715
21 Jan 2025
Contributed by Lukas
Today, we're joined by Chip Huyen, independent researcher and writer to discuss her new book, “AI Engineering.” We dig into the definition of AI e...
Evolving MLOps Platforms for Generative AI and Agents with Abhijit Bose - #714
13 Jan 2025
Contributed by Lukas
Today, we're joined by Abhijit Bose, head of enterprise AI and ML platforms at Capital One to discuss the evolution of the company’s approach and in...
Why Agents Are Stupid & What We Can Do About It with Dan Jeffries - #713
16 Dec 2024
Contributed by Lukas
Today, we're joined by Dan Jeffries, founder and CEO of Kentauros AI to discuss the challenges currently faced by those developing advanced AI agents....
Automated Reasoning to Prevent LLM Hallucination with Byron Cook - #712
09 Dec 2024
Contributed by Lukas
Today, we're joined by Byron Cook, VP and distinguished scientist in the Automated Reasoning Group at AWS to dig into the underlying technology behind...
AI at the Edge: Qualcomm AI Research at NeurIPS 2024 with Arash Behboodi - #711
03 Dec 2024
Contributed by Lukas
Today, we're joined by Arash Behboodi, director of engineering at Qualcomm AI Research to discuss the papers and workshops Qualcomm will be presenting...
AI for Network Management with Shirley Wu - #710
19 Nov 2024
Contributed by Lukas
Today, we're joined by Shirley Wu, senior director of software engineering at Juniper Networks to discuss how machine learning and artificial intellig...
Why Your RAG System Is Broken, and How to Fix It with Jason Liu - #709
11 Nov 2024
Contributed by Lukas
Today, we're joined by Jason Liu, freelance AI consultant, advisor, and creator of the Instructor library to discuss all things retrieval-augmented ge...
An Agentic Mixture of Experts for DevOps with Sunil Mallya - #708
04 Nov 2024
Contributed by Lukas
Today we're joined by Sunil Mallya, CTO and co-founder of Flip AI. We discuss Flip’s incident debugging system for DevOps, which was built using a c...
Building AI Voice Agents with Scott Stephenson - #707
28 Oct 2024
Contributed by Lukas
Today, we're joined by Scott Stephenson, co-founder and CEO of Deepgram to discuss voice AI agents. We explore the importance of perception, understan...
Is Artificial Superintelligence Imminent? with Tim Rocktäschel - #706
21 Oct 2024
Contributed by Lukas
Today, we're joined by Tim Rocktäschel, senior staff research scientist at Google DeepMind, professor of Artificial Intelligence at University Colleg...
ML Models for Safety-Critical Systems with Lucas García - #705
14 Oct 2024
Contributed by Lukas
Today, we're joined by Lucas García, principal product manager for deep learning at MathWorks to discuss incorporating ML models into safety-critical...
AI Agents: Substance or Snake Oil with Arvind Narayanan - #704
07 Oct 2024
Contributed by Lukas
Today, we're joined by Arvind Narayanan, professor of Computer Science at Princeton University to discuss his recent works, AI Agents That Matter and ...
AI Agents for Data Analysis with Shreya Shankar - #703
30 Sep 2024
Contributed by Lukas
Today, we're joined by Shreya Shankar, a PhD student at UC Berkeley to discuss DocETL, a declarative system for building and optimizing LLM-powered da...
Stealing Part of a Production Language Model with Nicholas Carlini - #702
23 Sep 2024
Contributed by Lukas
Today, we're joined by Nicholas Carlini, research scientist at Google DeepMind to discuss adversarial machine learning and model security, focusing on...
Supercharging Developer Productivity with ChatGPT and Claude with Simon Willison - #701
16 Sep 2024
Contributed by Lukas
Today, we're joined by Simon Willison, independent researcher and creator of Datasette to discuss the many ways software developers and engineers can ...
Automated Design of Agentic Systems with Shengran Hu - #700
02 Sep 2024
Contributed by Lukas
Today, we're joined by Shengran Hu, a PhD student at the University of British Columbia, to discuss Automated Design of Agentic Systems (ADAS), an app...
The EU AI Act and Mitigating Bias in Automated Decisioning with Peter van der Putten - #699
27 Aug 2024
Contributed by Lukas
Today, we're joined by Peter van der Putten, director of the AI Lab at Pega and assistant professor of AI at Leiden University. We discuss the newly a...
The Building Blocks of Agentic Systems with Harrison Chase - #698
19 Aug 2024
Contributed by Lukas
Today, we're joined by Harrison Chase, co-founder and CEO of LangChain to discuss LLM frameworks, agentic systems, RAG, evaluation, and more. We dig i...
Simplifying On-Device AI for Developers with Siddhika Nevrekar - #697
12 Aug 2024
Contributed by Lukas
Today, we're joined by Siddhika Nevrekar, AI Hub head at Qualcomm Technologies, to discuss on-device AI and how to make it easier for developers to ta...
Genie: Generative Interactive Environments with Ashley Edwards - #696
05 Aug 2024
Contributed by Lukas
Today, we're joined by Ashley Edwards, a member of technical staff at Runway, to discuss Genie: Generative Interactive Environments, a system for crea...
Bridging the Sim2real Gap in Robotics with Marius Memmel - #695
30 Jul 2024
Contributed by Lukas
Today, we're joined by Marius Memmel, a PhD student at the University of Washington, to discuss his research on sim-to-real transfer approaches for de...
Building Real-World LLM Products with Fine-Tuning and More with Hamel Husain - #694
23 Jul 2024
Contributed by Lukas
Today, we're joined by Hamel Husain, founder of Parlance Labs, to discuss the ins and outs of building real-world products using large language models...
Mamba, Mamba-2 and Post-Transformer Architectures for Generative AI with Albert Gu - #693
17 Jul 2024
Contributed by Lukas
Today, we're joined by Albert Gu, assistant professor at Carnegie Mellon University, to discuss his research on post-transformer architectures for mul...
Decoding Animal Behavior to Train Robots with EgoPet with Amir Bar - #692
09 Jul 2024
Contributed by Lukas
Today, we're joined by Amir Bar, a PhD candidate at Tel Aviv University and UC Berkeley to discuss his research on visual-based learning, including hi...
How Microsoft Scales Testing and Safety for Generative AI with Sarah Bird - #691
01 Jul 2024
Contributed by Lukas
Today, we're joined by Sarah Bird, chief product officer of responsible AI at Microsoft. We discuss the testing and evaluation techniques Microsoft ap...
Long Context Language Models and their Biological Applications with Eric Nguyen - #690
25 Jun 2024
Contributed by Lukas
Today, we're joined by Eric Nguyen, PhD student at Stanford University. In our conversation, we explore his research on long context foundation models...
Accelerating Sustainability with AI with Andres Ravinet - #689
18 Jun 2024
Contributed by Lukas
Today, we're joined by Andres Ravinet, sustainability global black belt at Microsoft, to discuss the role of AI in sustainability. We explore real-wor...
Gen AI at the Edge: Qualcomm AI Research at CVPR 2024 with Fatih Porikli - #688
10 Jun 2024
Contributed by Lukas
Today we’re joined by Fatih Porikli, senior director of technology at Qualcomm AI Research. In our conversation, we covered several of the Qualcomm ...
Energy Star Ratings for AI Models with Sasha Luccioni - #687
03 Jun 2024
Contributed by Lukas
Today, we're joined by Sasha Luccioni, AI and Climate lead at Hugging Face, to discuss the environmental impact of AI models. We dig into her recent r...
Language Understanding and LLMs with Christopher Manning - #686
27 May 2024
Contributed by Lukas
Today, we're joined by Christopher Manning, the Thomas M. Siebel professor in Machine Learning at Stanford University and a recent recipient of the 20...
Chronos: Learning the Language of Time Series with Abdul Fatir Ansari - #685
20 May 2024
Contributed by Lukas
Today we're joined by Abdul Fatir Ansari, a machine learning scientist at AWS AI Labs in Berlin, to discuss his paper, "Chronos: Learning the Language...
Powering AI with the World's Largest Computer Chip with Joel Hestness - #684
13 May 2024
Contributed by Lukas
Today we're joined by Joel Hestness, principal research scientist and lead of the core machine learning team at Cerebras. We discuss Cerebras’ custo...
AI for Power & Energy with Laurent Boinot - #683
07 May 2024
Contributed by Lukas
Today we're joined by Laurent Boinot, power and utilities lead for the Americas at Microsoft, to discuss the intersection of AI and energy infrastruct...
Controlling Fusion Reactor Instability with Deep Reinforcement Learning with Aza Jalalvand - #682
29 Apr 2024
Contributed by Lukas
Today we're joined by Azarakhsh (Aza) Jalalvand, a research scholar at Princeton University, to discuss his work using deep reinforcement learning to ...
GraphRAG: Knowledge Graphs for AI Applications with Kirk Marple - #681
22 Apr 2024
Contributed by Lukas
Today we're joined by Kirk Marple, CEO and founder of Graphlit, to explore the emerging paradigm of "GraphRAG," or Graph Retrieval Augmented Generatio...
Teaching Large Language Models to Reason with Reinforcement Learning with Alex Havrilla - #680
16 Apr 2024
Contributed by Lukas
Today we're joined by Alex Havrilla, a PhD student at Georgia Tech, to discuss "Teaching Large Language Models to Reason with Reinforcement Learning."...
Localizing and Editing Knowledge in LLMs with Peter Hase - #679
08 Apr 2024
Contributed by Lukas
Today we're joined by Peter Hase, a fifth-year PhD student at the University of North Carolina NLP lab. We discuss "scalable oversight", and the impor...
Coercing LLMs to Do and Reveal (Almost) Anything with Jonas Geiping - #678
01 Apr 2024
Contributed by Lukas
Today we're joined by Jonas Geiping, a research group leader at the ELLIS Institute, to explore his paper: "Coercing LLMs to Do and Reveal (Almost) An...
V-JEPA, AI Reasoning from a Non-Generative Architecture with Mido Assran - #677
25 Mar 2024
Contributed by Lukas
Today we’re joined by Mido Assran, a research scientist at Meta’s Fundamental AI Research (FAIR). In this conversation, we discuss V-JEPA, a new m...
Video as a Universal Interface for AI Reasoning with Sherry Yang - #676
18 Mar 2024
Contributed by Lukas
Today we’re joined by Sherry Yang, senior research scientist at Google DeepMind and a PhD student at UC Berkeley. In this interview, we discuss her ...
Assessing the Risks of Open AI Models with Sayash Kapoor - #675
11 Mar 2024
Contributed by Lukas
Today we’re joined by Sayash Kapoor, a Ph.D. student in the Department of Computer Science at Princeton University. Sayash walks us through his pape...
OLMo: Everything You Need to Train an Open Source LLM with Akshita Bhagia - #674
04 Mar 2024
Contributed by Lukas
Today we’re joined by Akshita Bhagia, a senior research engineer at the Allen Institute for AI. Akshita joins us to discuss OLMo, a new open source ...
Training Data Locality and Chain-of-Thought Reasoning in LLMs with Ben Prystawski - #673
26 Feb 2024
Contributed by Lukas
Today we’re joined by Ben Prystawski, a PhD student in the Department of Psychology at Stanford University working at the intersection of cognitive ...
Reasoning Over Complex Documents with DocLLM with Armineh Nourbakhsh - #672
19 Feb 2024
Contributed by Lukas
Today we're joined by Armineh Nourbakhsh of JP Morgan AI Research to discuss the development and capabilities of DocLLM, a layout-aware large language...
Are Emergent Behaviors in LLMs an Illusion? with Sanmi Koyejo - #671
12 Feb 2024
Contributed by Lukas
Today we’re joined by Sanmi Koyejo, assistant professor at Stanford University, to continue our NeurIPS 2024 series. In our conversation, Sanmi disc...
AI Trends 2024: Reinforcement Learning in the Age of LLMs with Kamyar Azizzadenesheli - #670
05 Feb 2024
Contributed by Lukas
Today we’re joined by Kamyar Azizzadenesheli, a staff researcher at Nvidia, to continue our AI Trends 2024 series. In our conversation, Kamyar updat...
Building and Deploying Real-World RAG Applications with Ram Sriharsha - #669
29 Jan 2024
Contributed by Lukas
Today we’re joined by Ram Sriharsha, VP of engineering at Pinecone. In our conversation, we dive into the topic of vector databases and retrieval au...
Nightshade: Data Poisoning to Fight Generative AI with Ben Zhao - #668
22 Jan 2024
Contributed by Lukas
Today we’re joined by Ben Zhao, a Neubauer professor of computer science at the University of Chicago. In our conversation, we explore his research ...
Learning Transformer Programs with Dan Friedman - #667
15 Jan 2024
Contributed by Lukas
Today, we continue our NeurIPS series with Dan Friedman, a PhD student in the Princeton NLP group. In our conversation, we explore his research on mec...
AI Trends 2024: Machine Learning & Deep Learning with Thomas Dietterich - #666
08 Jan 2024
Contributed by Lukas
Today we continue our AI Trends 2024 series with a conversation with Thomas Dietterich, distinguished professor emeritus at Oregon State University. A...
AI Trends 2024: Computer Vision with Naila Murray - #665
02 Jan 2024
Contributed by Lukas
Today we kick off our AI Trends 2024 series with a conversation with Naila Murray, director of AI research at Meta. In our conversation with Naila, we...
Are Vector DBs the Future Data Platform for AI? with Ed Anuff - #664
28 Dec 2023
Contributed by Lukas
Today we’re joined by Ed Anuff, chief product officer at DataStax. In our conversation, we discuss Ed’s insights on RAG, vector databases, embeddi...
Quantizing Transformers by Helping Attention Heads Do Nothing with Markus Nagel - #663
26 Dec 2023
Contributed by Lukas
Today we’re joined by Markus Nagel, research scientist at Qualcomm AI Research, who helps us kick off our coverage of NeurIPS 2023. In our conversat...
Responsible AI in the Generative Era with Michael Kearns - #662
22 Dec 2023
Contributed by Lukas
Today we’re joined by Michael Kearns, professor in the Department of Computer and Information Science at the University of Pennsylvania and an Amazo...
Edutainment for AI and AWS PartyRock with Mike Miller - #661
18 Dec 2023
Contributed by Lukas
Today we’re joined by Mike Miller, director of product at AWS responsible for the company’s “edutainment” products. In our conversation with M...
Data, Systems and ML for Visual Understanding with Cody Coleman - #660
14 Dec 2023
Contributed by Lukas
Today we’re joined by Cody Coleman, co-founder and CEO of Coactive AI. In our conversation with Cody, we discuss how Coactive has leveraged modern d...
Patterns and Middleware for LLM Applications with Kyle Roche - #659
11 Dec 2023
Contributed by Lukas
Today we’re joined by Kyle Roche, founder and CEO of Griptape to discuss patterns and middleware for LLM applications. We dive into the emerging pat...