Build Wiz AI Show
Episodes
Pi - and self-modifying AI Agents
22 May 2026
Contributed by Lukas
Imagine a world where your software isn't just a static tool, but a living system that can actually rewrite and improve itself as you work. This e...
Code with Claude - London 2026
22 May 2026
Contributed by Lukas
Remember the magic of your first successful program—now imagine that feeling applied to AI agents that can solve decades-old bugs and manage thousan...
Google I/O 2026 keynote
20 May 2026
Contributed by Lukas
Imagine a world where your AI doesn't just suggest code but actually builds and deploys your entire app for you. This episode explores a major shi...
The Langchain Agent Development Keynote 2026
20 May 2026
Contributed by Lukas
Stop guessing and start building AI agents that actually work in the real world. This episode explores the essential framework used by top teams to bu...
Building the Software Factory: From Code to Autonomy
19 May 2026
Contributed by Lukas
Stop writing every line of code and start managing a fleet of autonomous AI agents that do the heavy lifting for you. This episode breaks down how to ...
Spec-Driven Development and Agentic Workflows in 2026
15 May 2026
Contributed by Lukas
Stop wrestling with AI-generated code that misses the mark and start building with architectural precision. This episode explores the shift toward Spe...
Efficient Pre-Training with Token Superposition
14 May 2026
Contributed by Lukas
Imagine training powerful AI models in less than half the time without sacrificing an ounce of performance,,. This episode breaks down a clever new te...
Skills at Scale: Building and Scaling Agentic Workflows
06 May 2026
Contributed by Lukas
Stop wasting time repeating the same basic instructions to your AI every time you start a new conversation. This episode explores the power of "s...
Jensen Huang on the AI Revolution 2026
05 May 2026
Contributed by Lukas
Forget everything you know about how computers work because a fundamental shift from simple searching to AI that can think and act is currently reinve...
Robotics' End Game: The Great Parallel to AGI
04 May 2026
Contributed by Lukas
What if robots could learn to master human tasks simply by watching videos of us, just like AI learned to talk by reading the internet? Nvidia’s Jim...
Andrej Karpathy at Sequoia - AI Ascent 2026: From Vibe Coding to Agentic Engineering
02 May 2026
Contributed by Lukas
What does it mean when one of the world’s leading AI pioneers says he’s never felt more behind as a programmer? Andrej Karpathy explores the radic...
The Cognitive Revolution: Sequoia AI Ascent 2026 Keynote
01 May 2026
Contributed by Lukas
What if the 100-year projects of the past could now be completed in just 100 days by autonomous AI agents? This episode explores the dawn of the Cogni...
Demis Hassabis on the Roadmap to General Intelligence
29 Apr 2026
Contributed by Lukas
Ever wonder what the world looks like when the ultimate tool for scientific discovery finally arrives? In this episode, Nobel laureate Demis Hassabis ...
AHE: Observability-Driven Evolution of Coding-Agent Harnesses
29 Apr 2026
Contributed by Lukas
What if AI agents could engineer their own "survival gear" to solve complex coding tasks? This episode dives into Agentic Harness Engineerin...
Claude Mythos Preview
08 Apr 2026
Contributed by Lukas
Imagine an AI so capable that its own creators decided it was simply too powerful to be released to the general public. This episode dives into the sy...
Anthropic Econimic Index Report 03/2026
26 Mar 2026
Contributed by Lukas
Is there a secret to mastering AI, or does it just come with practice?,. This episode explores the latest Anthropic Economic Index, which reveals that...
How to Ship Complex Features 10x Faster with AI Agents
22 Mar 2026
Contributed by Lukas
Stop wrestling with erratic prompts and discover how to ship complex features 10x faster by shifting from simple coding to high-leverage production wi...
The Era of AI Psychosis and Agentic Leverage
22 Mar 2026
Contributed by Lukas
Have you reached a state of "AI psychosis" where your typing speed is the only thing holding back a massive jump in personal capability? In ...
Attention Residuals - from Kimi
17 Mar 2026
Contributed by Lukas
Is the very foundation of modern large language models causing them to lose focus as they get deeper?, This episode explores Attention Residuals (Attn...
Nvidia 2026 GTC keynote - recap
16 Mar 2026
Contributed by Lukas
What if the future of technology isn't just about smarter software, but the birth of physical AI? We’re breaking down the biggest announcements ...
Why long context make AI dumber
14 Mar 2026
Contributed by Lukas
Forget the needle in the haystack—can your AI actually sculpt an answer from a mountain of data? This episode explores the "Michelangelo" ...
How Coding Agents Are Reshaping Engineering, Product and Design
10 Mar 2026
Contributed by Lukas
What if the traditional waterfall process of PRDs, mocks, and manual coding is officially dead? This episode explores how coding agents are fundamenta...
Securing AI Agents and Execution Engine
08 Mar 2026
Contributed by Lukas
What happens when your autonomous AI assistant decides to go rogue or has its core mission hijacked by a single malicious prompt? Join us as we explor...
The Blueprint for Engineering reliable AI Agents
17 Feb 2026
Contributed by Lukas
Are AI agents really the next "smartphone app" revolution, or is the lack of specialized development tools holding them back?, This episode ...
The complete guide to build skills for AI Agents - from Anthropic
15 Feb 2026
Contributed by Lukas
Stop re-explaining your process in every new chat and start building "Skills," the modular instruction sets that transform Claude into a spe...
Something big is happening
14 Feb 2026
Contributed by Lukas
Remember the quiet weeks before the world changed in 2020? We are currently in that same "this seems overblown" phase with AI, as an intelli...
AI Cybersecurity Trends and Defense Strategies for 2026
11 Feb 2026
Contributed by Lukas
Think your business is safe from cyber threats just because you have the latest software? Think again—2026 is officially the year AI moves from bein...
Agent World Model
11 Feb 2026
Contributed by Lukas
Ever wonder why even the most advanced AI agents struggle to handle complex tasks in the real world? Today, we explore the Agent World Model (AWM), an...
Catching AI Sleeper Agent - LLM Backdoors
05 Feb 2026
Contributed by Lukas
Could your trusted AI model be a hidden "sleeper agent" just waiting for a secret command to turn malicious? We explore a new methodology th...
AI 2026: Scaling Laws, China, and the Race for AGI
02 Feb 2026
Contributed by Lukas
Is the global AI landscape shifting toward a "DeepSeek moment" where cheaper, open-weight models from China challenge the dominance of US fr...
The Hidden Cost of AI: Is Automation Killing Your Skills?
01 Feb 2026
Contributed by Lukas
Is your favorite AI assistant a productivity powerhouse or a "shortcut" that’s secretly stalling your professional growth? We dive into ne...
How AI changes software engineering
30 Jan 2026
Contributed by Lukas
This episode explores how Generative AI is shifting software development from simple tool-based experimentation to a holistic transformation of engine...
The creator of Clawd - ship code without reading it.
29 Jan 2026
Contributed by Lukas
In this episode, Peter Steinberger, the creator of Clawbot, discusses his radical transition to an AI-driven workflow where he merges hundreds of comm...
Kimi 2.5 and Data Agent Swarms
28 Jan 2026
Contributed by Lukas
In today’s episode, we explore the evolution of AI from solitary models to data agent swarms, where specialized autonomous agents collaborate like a...
Claude's constitution
24 Jan 2026
Contributed by Lukas
In this episode, we explore Claude’s constitution, the foundational document that serves as the final authority on Anthropic's vision for the AI...
The World after AGI - Dario Amodei and Demis Hasssabis
23 Jan 2026
Contributed by Lukas
This episode features a landmark debate between Google DeepMind’s Demis Hassabis and Anthropic’s Dario Amodei regarding the imminent arrival of Ar...
Future of AI & Global Economy - Nvidia CEO Jensen Huang and BlackRock's Larry Fink
22 Jan 2026
Contributed by Lukas
In this episode, Nvidia CEO Jensen Huang and BlackRock’s Larry Fink explore how AI represents a massive platform shift and the largest infrastructur...
Recursive LM - model solves context rot
19 Jan 2026
Contributed by Lukas
This episode explores Recursive Language Models (RLMs), a groundbreaking inference strategy that enables large language models to process prompts two ...
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
10 Jan 2026
Contributed by Lukas
This episode explores Agentic Context Engineering (ACE), a breakthrough framework that transforms LLM contexts into evolving playbooks that accumulate...
DSPy: Programming and Optimizing LLM Workflows with Systems Mindsets
10 Jan 2026
Contributed by Lukas
This episode explores DSPy, a declarative framework that enables developers to build modular software by treating Large Language Models as first-class...
Based on Claude Agent SDK — Thariq Shihipar, Anthropic
05 Jan 2026
Contributed by Lukas
Drawing on the sources, this episode explores the Claude Agent SDK, an opinionated framework built on Claude Code that enables the creation of autonom...
Cybersecurity Trends in 2026: Shadow AI, Quantum & Deepfakes
04 Jan 2026
Contributed by Lukas
This episode explores the cybersecurity landscape of 2026, where autonomous AI agents and a 1,500% surge in deepfakes have significantly amplified org...
DeepSeek: Manifold-Constrained Hyper-Connections (mHC)
03 Jan 2026
Contributed by Lukas
This episode explores Manifold-Constrained Hyper-Connections (mHC), a framework designed to solve the training instability and memory overhead issues ...
AI agent trends 2026 - Google
30 Dec 2025
Contributed by Lukas
This episode explores the five critical shifts driven by AI agents that are set to redefine professional roles and business workflows by 2026. We dive...
Building reliable AI Agent with domain memory
29 Dec 2025
Contributed by Lukas
In this episode, we explore the shift from basic AI assistants to sophisticated agentic coding, moving beyond the "Dumb Zone" where oversize...
METR's Benchmarks vs Economics: The AI capability measurement gap
28 Dec 2025
Contributed by Lukas
In this episode, drawing on insights from the sources, METR researcher Joel Becker explores the widening gap between AI’s exponential progress on be...
Adaptation of Agentic AI
26 Dec 2025
Contributed by Lukas
This episode explores a unified framework for adapting agentic AI systems, detailing how foundation models are specialized to plan, reason, and master...
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
25 Dec 2025
Contributed by Lukas
In this episode, we explore Agent-R1, a modular framework designed to transform Large Language Models from static text generators into autonomous agen...
Career Advice in AI
22 Dec 2025
Contributed by Lukas
In this episode, Andrew Ng and Lawrence Moroni discuss why now is the "golden age" for building a career in AI, highlighting how the complex...
Leadership in AI Assisted Engineering
21 Dec 2025
Contributed by Lukas
This episode explores the critical July 2025 maturation point, where the initial generative AI hype gives way to the practical challenges of securing ...
AI Consulting in Practice
19 Dec 2025
Contributed by Lukas
This episode explores the rapid shift in enterprise AI adoption, highlighting how production-level agent deployment has surged as organizations move p...
Google - 5 days: Prototype to Production
19 Dec 2025
Contributed by Lukas
Join us as we tackle the "last mile" of AI Agents series, exploring the rigorous operational discipline required to transform fragile protot...
Google - 5 days: Agent Quality
18 Dec 2025
Contributed by Lukas
In this episode of our AI Agent series, we synthesize our previous discussions on evaluation frameworks and observability into a cohesive operational ...
Google - 5 days: Context Engineering: Sessions & Memory
17 Dec 2025
Contributed by Lukas
Moving beyond the temporary "workbench" of individual sessions, episode 3 of our AI Agents series unlocks the power of Memory—the mechanis...
The Gemini Interactions API
16 Dec 2025
Contributed by Lukas
The new Gemini interactions API unifies modern LLM requirements, moving past older stateless APIs to fully embrace agents and complex workflows. This ...
Google - 5 days: Agent Tools
16 Dec 2025
Contributed by Lukas
In this episode in series AI Agents, we discuss the transformation of foundation models from static prediction engines into "Agentic AI" cap...
Google 5 days: Introduction to Agent
15 Dec 2025
Contributed by Lukas
Join us for the premiere of our series on AI Agents, exploring the paradigm shift from passive predictive models to autonomous systems capable of reas...
The Adoption and Usage of AI Agents: Early Evidence from Perplexity
13 Dec 2025
Contributed by Lukas
Explore the emerging "year of agentic AI" through the first large-scale field study of Perplexity’s Comet browser, which analyzes millions...
Monetizing AI: Pricing Strategies and Experimentation
10 Dec 2025
Contributed by Lukas
Monetizing AI is uniquely challenging, driven by rapid cost changes and intense customer demand for demonstrable ROI. We discuss essential frameworks,...
The 2026 State of AI Agents in Production - report from Anthropic
10 Dec 2025
Contributed by Lukas
The era of production AI agents is here. Drawing from the "2026 State of AI Agents Report," we examine how enterprises are shifting from bas...
Agents to Skills: Building Expertise with Procedural Knowledge
10 Dec 2025
Contributed by Lukas
Join us as Anthropic experts reveal why they stopped building agents and started building specialized "skills" instead, arguing that while g...
The Renaissance Developer - Dr. Werner at AWS re:Invent 2025
05 Dec 2025
Contributed by Lukas
Dr. Werner Vogels delivered his final AWS re:Invent keynote in 2025, announcing his departure from the platform for new speakers. He assured developer...
The RPI workflow (Research, Plan, Implement) - for advanced AI Coding Agent
04 Dec 2025
Contributed by Lukas
Dive into the world of Advanced Context Engineering, tackling the challenges of AI-generated "slop" and high rework rates often found when u...
The complete IDE workflow for AI-driven development - the BMAD method
29 Nov 2025
Contributed by Lukas
Welcome to the ultimate BMAD Method masterclass, demonstrating the complete IDE workflow for AI-driven development using Claude Code. This episode wal...
Weaponizing AI: The Rise of Autonomous Cyber Attacks
19 Nov 2025
Contributed by Lukas
Autonomous AI agents execute hyper-personalized phishing and polymorphic malware attacks at unprecedented speed and scale. This acceleration creates a...
MAKER: Million-Step LLM Tasks with Zero Errors
19 Nov 2025
Contributed by Lukas
Large Language Models typically fail long-horizon tasks due to persistent error rates, often derailing after only a few hundred steps. This episode ex...
From Context Engineering to AI Agent Harnesses
14 Nov 2025
Contributed by Lukas
Lance Martin of Langchain will discuss the shift in AI from model training to orchestrating powerful LLMs and computing primitives via a new software ...
First AI-Orchestrated Cyber Espionage Campaign Disrupted
13 Nov 2025
Contributed by Lukas
State-sponsored group GTG-1002 executed the first reported cyber espionage campaign largely run by autonomous AI, fundamentally shifting the threat la...
Sam Altman on the future of AI and its massive impact on society
11 Nov 2025
Contributed by Lukas
Join us for a candid conversation with OpenAI CEO Sam Altman on the future of AI and its massive impact on society. Altman explains why AI is the most...
🧠 Supervised Reinforcement Learning for Step-wise Reasoning
11 Nov 2025
Contributed by Lukas
Large Language Models often struggle with complex, multi-step reasoning where traditional Supervised Fine-Tuning (SFT) and Reinforcement Learning (RLV...
Kimi K2: the current Leading Open-Weight Agentic Model
09 Nov 2025
Contributed by Lukas
Moonshot AI's Kimi K2 Thinking is changing the global LLM landscape, as this 1-trillion parameter open-weight model challenges the performance of ...
AI Vision of the Future: An Expert Panel Discussion
08 Nov 2025
Contributed by Lukas
Join AI pioneers and 2025 Queen Elizabeth Prize winners, including Jensen Huang, Geoffrey Hinton, and Yann LeCun, as they share the personal "aha...
Creating Claude Code: Agent Design and Product Philosophy
07 Nov 2025
Contributed by Lukas
Join the engineers who built Claude Code to explore their counterintuitive decision to ditch the IDE for a terminal-first experience. They reveal how ...
Context Engineering 2.0: The Context of Context Engineering
04 Nov 2025
Contributed by Lukas
Context Engineering (CE) is the systematic process designed to bridge the cognitive gap between human intent and machine understanding by optimizing c...
⚡ Agent Lightning: Reinforcement Learning for Any AI Agent
04 Nov 2025
Contributed by Lukas
Agent Lightning introduces a revolutionary approach to optimizing AI agents by fully decoupling Reinforcement Learning (RL) training from agent execut...
🛡️ Breaking Agent Backbones: Evaluating LLM Security in AI Agents
31 Oct 2025
Contributed by Lukas
Breaking Agent Backbones: AI agents are being deployed at scale, but their security is challenged by non-deterministic behavior and novel vulnerabilit...
🚀 OpenAI's Future: Research, Product, and Infrastructure Vision
30 Oct 2025
Contributed by Lukas
In this episode, OpenAI leaders share unprecedented transparency regarding their research goals, aiming for a fully automated AI researcher by March 2...
GitHub Universe 2025: Agent HQ, The Agent Workflow
30 Oct 2025
Contributed by Lukas
Welcome to the new era of coding collaboration: Agent HQ is here, establishing GitHub as the centralized home for developers and a fleet of AI coding ...
Jensen Huang - NVIDIA - Keynote 10/2025
29 Oct 2025
Contributed by Lukas
We delve into Jensen Huang's vision that Artificial Intelligence marks the New Industrial Revolution, positioning it as essential national infrast...
Perplexity at Work: A Guide to Getting More Done
29 Oct 2025
Contributed by Lukas
The modern workplace often buries professionals under context switching and scattered technology, hindering the productivity gains promised by AI. Thi...
Context Engineering for AI Agents - from LangChain vs Manus
28 Oct 2025
Contributed by Lukas
Join Lance from LangChain and Pete from Manus as they dive deep into the crucial discipline of Context Engineering for building effective AI agents. T...
💻 A Survey of Vibe Coding with LLMs
27 Oct 2025
Contributed by Lukas
Welcome to an essential discussion on Vibe Coding, the new paradigm where developers shift from writing code line-by-line to orchestrating and validat...
AI Adoption, Productivity, and System Thinking - from the interview with Huyen Chip
24 Oct 2025
Contributed by Lukas
Chip Huyen, author of AI Engineering and AI strategy expert from NVIDIA and Netflix, breaks down the technical basics of building successful AI produc...
The Hidden Dangers of Browsing AI Agents
23 Oct 2025
Contributed by Lukas
In the hype of ChatGPT Atlas, lets talk about the darkside of Browsing AI Agents
🤏 DeepSeek-OCR: Contexts Optical Compression
21 Oct 2025
Contributed by Lukas
Welcome to the show, where we discuss DeepSeek-OCR and its investigation into using optical 2D mapping for contexts compression, addressing the comput...
Claude Skills: Standard Operating Procedures for Agents
18 Oct 2025
Contributed by Lukas
This episode explores Anthropic's revolutionary 'Skills,' a new way to implement Standard Operating Procedures (SOPs) for LLM agents, ensu...
Self-Adapting Language Models (SEAL)
14 Oct 2025
Contributed by Lukas
**SEAL, the Self-Adapting Language Model framework, is revolutionizing how LLMs learn by enabling them to generate their own finetuning data and updat...
Training-Free Group Relative Policy Optimization for LLM Agents
13 Oct 2025
Contributed by Lukas
Are expensive Large Language Model (LLM) fine-tuning methods holding back your specialized agents, demanding massive computational resources and data?...
OpenAI's Vision: AGI, Sora, and Bottlenecks
10 Oct 2025
Contributed by Lukas
Join us for a deep dive with Greg Brockman on the future of AI, where he reveals the internal struggle ("pain and suffering") of managing co...
Agentic Context Engineering: Evolving Contexts for LLMs
10 Oct 2025
Contributed by Lukas
Tune in as we explore Agentic Context Engineering (ACE), a novel framework designed to overcome limitations like "brevity bias" and "co...
Less is More: Recursive Reasoning with Tiny Networks
08 Oct 2025
Contributed by Lukas
This episode explores the Tiny Recursive Model (TRM), a novel approach that leverages a single, tiny network (as small as 7M parameters) to tackle har...
Understanding the 4 Main Approaches to LLM Evaluation - from Sebastian Raschka
08 Oct 2025
Contributed by Lukas
Demystify Large Language Model (LLM) evaluation, breaking down the four main methods used to compare models: multiple-choice benchmarks, verifiers, le...
OpenAI DevDay 2025: Agents, Apps, and GPT-5 Pro
07 Oct 2025
Contributed by Lukas
OpenAI DevDay 2025 marked the start of the "agentic era" of software development, focusing on making it "easier to build with AI" ...
Self-Supervised Learning and the Future of AI - from a lecture given by Yann LeCun
07 Oct 2025
Contributed by Lukas
Join us as Turing Award recipient Yann LeCun, Chief Scientist at Meta, critiques the state of AI, arguing that current systems, including Large Langua...
Skill erosion, where relying on intelligent systems creates an "illusion of mastery" while core competence fades
05 Oct 2025
Contributed by Lukas
Are smart machines making us forget how to think? This episode dives into the quiet phenomenon of AI-induced skill erosion, where relying on intellige...
The Essential Startup Guide to Building AI Agents with Google
29 Sep 2025
Contributed by Lukas
AI agents represent a paradigm shift in software engineering, but moving a promising prototype to a production-ready system presents a new set of chal...
LIMI: Less Is More for Intelligent Agency
25 Sep 2025
Contributed by Lukas
In the race to build AI that can not just think, but work as an autonomous agent, the prevailing wisdom has been that more data is always better. This...
AI Adoption: Claude and ChatGPT Usage Patterns
24 Sep 2025
Contributed by Lukas
This episode delves into the unprecedented speed of AI adoption, which has outpaced historical technologies like the internet and personal computers. ...
Teaching LLMs to Plan: Logical Chain-of-Thought Instruction Tuning for Symbolic Planning
24 Sep 2025
Contributed by Lukas
While Large Language Models excel at creative tasks, they often struggle with the logical precision required for symbolic planning. This episode explo...
⚖️ Self-Consistency Improves Chain-of-Thought Reasoning in LMs
22 Sep 2025
Contributed by Lukas
In this episode, we explore self-consistency, a novel strategy that significantly improves how large language models perform complex reasoning. The me...