Build Wiz AI Show
Episodes
AI Consulting in Practice
19 Dec 2025
Contributed by Lukas
This episode explores the rapid shift in enterprise AI adoption, highlighting how production-level agent deployment has surged as organizations move p...
Google - 5 days: Prototype to Production
19 Dec 2025
Contributed by Lukas
Join us as we tackle the "last mile" of AI Agents series, exploring the rigorous operational discipline required to transform fragile protot...
Google - 5 days: Agent Quality
18 Dec 2025
Contributed by Lukas
In this episode of our AI Agent series, we synthesize our previous discussions on evaluation frameworks and observability into a cohesive operational ...
Google - 5 days: Context Engineering: Sessions & Memory
17 Dec 2025
Contributed by Lukas
Moving beyond the temporary "workbench" of individual sessions, episode 3 of our AI Agents series unlocks the power of Memory—the mechanis...
The Gemini Interactions API
16 Dec 2025
Contributed by Lukas
The new Gemini interactions API unifies modern LLM requirements, moving past older stateless APIs to fully embrace agents and complex workflows. This ...
Google - 5 days: Agent Tools
16 Dec 2025
Contributed by Lukas
In this episode in series AI Agents, we discuss the transformation of foundation models from static prediction engines into "Agentic AI" cap...
Google 5 days: Introduction to Agent
15 Dec 2025
Contributed by Lukas
Join us for the premiere of our series on AI Agents, exploring the paradigm shift from passive predictive models to autonomous systems capable of reas...
The Adoption and Usage of AI Agents: Early Evidence from Perplexity
13 Dec 2025
Contributed by Lukas
Explore the emerging "year of agentic AI" through the first large-scale field study of Perplexity’s Comet browser, which analyzes millions...
Monetizing AI: Pricing Strategies and Experimentation
10 Dec 2025
Contributed by Lukas
Monetizing AI is uniquely challenging, driven by rapid cost changes and intense customer demand for demonstrable ROI. We discuss essential frameworks,...
The 2026 State of AI Agents in Production - report from Anthropic
10 Dec 2025
Contributed by Lukas
The era of production AI agents is here. Drawing from the "2026 State of AI Agents Report," we examine how enterprises are shifting from bas...
Agents to Skills: Building Expertise with Procedural Knowledge
10 Dec 2025
Contributed by Lukas
Join us as Anthropic experts reveal why they stopped building agents and started building specialized "skills" instead, arguing that while g...
The Renaissance Developer - Dr. Werner at AWS re:Invent 2025
05 Dec 2025
Contributed by Lukas
Dr. Werner Vogels delivered his final AWS re:Invent keynote in 2025, announcing his departure from the platform for new speakers. He assured developer...
The RPI workflow (Research, Plan, Implement) - for advanced AI Coding Agent
04 Dec 2025
Contributed by Lukas
Dive into the world of Advanced Context Engineering, tackling the challenges of AI-generated "slop" and high rework rates often found when u...
The complete IDE workflow for AI-driven development - the BMAD method
29 Nov 2025
Contributed by Lukas
Welcome to the ultimate BMAD Method masterclass, demonstrating the complete IDE workflow for AI-driven development using Claude Code. This episode wal...
Weaponizing AI: The Rise of Autonomous Cyber Attacks
19 Nov 2025
Contributed by Lukas
Autonomous AI agents execute hyper-personalized phishing and polymorphic malware attacks at unprecedented speed and scale. This acceleration creates a...
MAKER: Million-Step LLM Tasks with Zero Errors
19 Nov 2025
Contributed by Lukas
Large Language Models typically fail long-horizon tasks due to persistent error rates, often derailing after only a few hundred steps. This episode ex...
From Context Engineering to AI Agent Harnesses
14 Nov 2025
Contributed by Lukas
Lance Martin of Langchain will discuss the shift in AI from model training to orchestrating powerful LLMs and computing primitives via a new software ...
First AI-Orchestrated Cyber Espionage Campaign Disrupted
13 Nov 2025
Contributed by Lukas
State-sponsored group GTG-1002 executed the first reported cyber espionage campaign largely run by autonomous AI, fundamentally shifting the threat la...
Sam Altman on the future of AI and its massive impact on society
11 Nov 2025
Contributed by Lukas
Join us for a candid conversation with OpenAI CEO Sam Altman on the future of AI and its massive impact on society. Altman explains why AI is the most...
🧠 Supervised Reinforcement Learning for Step-wise Reasoning
11 Nov 2025
Contributed by Lukas
Large Language Models often struggle with complex, multi-step reasoning where traditional Supervised Fine-Tuning (SFT) and Reinforcement Learning (RLV...
Kimi K2: the current Leading Open-Weight Agentic Model
09 Nov 2025
Contributed by Lukas
Moonshot AI's Kimi K2 Thinking is changing the global LLM landscape, as this 1-trillion parameter open-weight model challenges the performance of ...
AI Vision of the Future: An Expert Panel Discussion
08 Nov 2025
Contributed by Lukas
Join AI pioneers and 2025 Queen Elizabeth Prize winners, including Jensen Huang, Geoffrey Hinton, and Yann LeCun, as they share the personal "aha...
Creating Claude Code: Agent Design and Product Philosophy
07 Nov 2025
Contributed by Lukas
Join the engineers who built Claude Code to explore their counterintuitive decision to ditch the IDE for a terminal-first experience. They reveal how ...
Context Engineering 2.0: The Context of Context Engineering
04 Nov 2025
Contributed by Lukas
Context Engineering (CE) is the systematic process designed to bridge the cognitive gap between human intent and machine understanding by optimizing c...
⚡ Agent Lightning: Reinforcement Learning for Any AI Agent
04 Nov 2025
Contributed by Lukas
Agent Lightning introduces a revolutionary approach to optimizing AI agents by fully decoupling Reinforcement Learning (RL) training from agent execut...
🛡️ Breaking Agent Backbones: Evaluating LLM Security in AI Agents
31 Oct 2025
Contributed by Lukas
Breaking Agent Backbones: AI agents are being deployed at scale, but their security is challenged by non-deterministic behavior and novel vulnerabilit...
🚀 OpenAI's Future: Research, Product, and Infrastructure Vision
30 Oct 2025
Contributed by Lukas
In this episode, OpenAI leaders share unprecedented transparency regarding their research goals, aiming for a fully automated AI researcher by March 2...
GitHub Universe 2025: Agent HQ, The Agent Workflow
30 Oct 2025
Contributed by Lukas
Welcome to the new era of coding collaboration: Agent HQ is here, establishing GitHub as the centralized home for developers and a fleet of AI coding ...
Jensen Huang - NVIDIA - Keynote 10/2025
29 Oct 2025
Contributed by Lukas
We delve into Jensen Huang's vision that Artificial Intelligence marks the New Industrial Revolution, positioning it as essential national infrast...
Perplexity at Work: A Guide to Getting More Done
29 Oct 2025
Contributed by Lukas
The modern workplace often buries professionals under context switching and scattered technology, hindering the productivity gains promised by AI. Thi...
Context Engineering for AI Agents - from LangChain vs Manus
28 Oct 2025
Contributed by Lukas
Join Lance from LangChain and Pete from Manus as they dive deep into the crucial discipline of Context Engineering for building effective AI agents. T...
💻 A Survey of Vibe Coding with LLMs
27 Oct 2025
Contributed by Lukas
Welcome to an essential discussion on Vibe Coding, the new paradigm where developers shift from writing code line-by-line to orchestrating and validat...
AI Adoption, Productivity, and System Thinking - from the interview with Huyen Chip
24 Oct 2025
Contributed by Lukas
Chip Huyen, author of AI Engineering and AI strategy expert from NVIDIA and Netflix, breaks down the technical basics of building successful AI produc...
The Hidden Dangers of Browsing AI Agents
23 Oct 2025
Contributed by Lukas
In the hype of ChatGPT Atlas, lets talk about the darkside of Browsing AI Agents
🤏 DeepSeek-OCR: Contexts Optical Compression
21 Oct 2025
Contributed by Lukas
Welcome to the show, where we discuss DeepSeek-OCR and its investigation into using optical 2D mapping for contexts compression, addressing the comput...
Claude Skills: Standard Operating Procedures for Agents
18 Oct 2025
Contributed by Lukas
This episode explores Anthropic's revolutionary 'Skills,' a new way to implement Standard Operating Procedures (SOPs) for LLM agents, ensu...
Self-Adapting Language Models (SEAL)
14 Oct 2025
Contributed by Lukas
**SEAL, the Self-Adapting Language Model framework, is revolutionizing how LLMs learn by enabling them to generate their own finetuning data and updat...
Training-Free Group Relative Policy Optimization for LLM Agents
13 Oct 2025
Contributed by Lukas
Are expensive Large Language Model (LLM) fine-tuning methods holding back your specialized agents, demanding massive computational resources and data?...
OpenAI's Vision: AGI, Sora, and Bottlenecks
10 Oct 2025
Contributed by Lukas
Join us for a deep dive with Greg Brockman on the future of AI, where he reveals the internal struggle ("pain and suffering") of managing co...
Agentic Context Engineering: Evolving Contexts for LLMs
10 Oct 2025
Contributed by Lukas
Tune in as we explore Agentic Context Engineering (ACE), a novel framework designed to overcome limitations like "brevity bias" and "co...
Less is More: Recursive Reasoning with Tiny Networks
08 Oct 2025
Contributed by Lukas
This episode explores the Tiny Recursive Model (TRM), a novel approach that leverages a single, tiny network (as small as 7M parameters) to tackle har...
Understanding the 4 Main Approaches to LLM Evaluation - from Sebastian Raschka
08 Oct 2025
Contributed by Lukas
Demystify Large Language Model (LLM) evaluation, breaking down the four main methods used to compare models: multiple-choice benchmarks, verifiers, le...
OpenAI DevDay 2025: Agents, Apps, and GPT-5 Pro
07 Oct 2025
Contributed by Lukas
OpenAI DevDay 2025 marked the start of the "agentic era" of software development, focusing on making it "easier to build with AI" ...
Self-Supervised Learning and the Future of AI - from a lecture given by Yann LeCun
07 Oct 2025
Contributed by Lukas
Join us as Turing Award recipient Yann LeCun, Chief Scientist at Meta, critiques the state of AI, arguing that current systems, including Large Langua...
Skill erosion, where relying on intelligent systems creates an "illusion of mastery" while core competence fades
05 Oct 2025
Contributed by Lukas
Are smart machines making us forget how to think? This episode dives into the quiet phenomenon of AI-induced skill erosion, where relying on intellige...
The Essential Startup Guide to Building AI Agents with Google
29 Sep 2025
Contributed by Lukas
AI agents represent a paradigm shift in software engineering, but moving a promising prototype to a production-ready system presents a new set of chal...
LIMI: Less Is More for Intelligent Agency
25 Sep 2025
Contributed by Lukas
In the race to build AI that can not just think, but work as an autonomous agent, the prevailing wisdom has been that more data is always better. This...
AI Adoption: Claude and ChatGPT Usage Patterns
24 Sep 2025
Contributed by Lukas
This episode delves into the unprecedented speed of AI adoption, which has outpaced historical technologies like the internet and personal computers. ...
Teaching LLMs to Plan: Logical Chain-of-Thought Instruction Tuning for Symbolic Planning
24 Sep 2025
Contributed by Lukas
While Large Language Models excel at creative tasks, they often struggle with the logical precision required for symbolic planning. This episode explo...
⚖️ Self-Consistency Improves Chain-of-Thought Reasoning in LMs
22 Sep 2025
Contributed by Lukas
In this episode, we explore self-consistency, a novel strategy that significantly improves how large language models perform complex reasoning. The me...
Economic Index Report by Anthropic - 09/2025: Uneven Global and Enterprise AI Adoption
17 Sep 2025
Contributed by Lukas
AI is being adopted at a record-breaking pace, far exceeding previous technologies like the internet. However, this new report reveals that the AI rev...
🤔 How People Use ChatGPT - from OpenAI Report
17 Sep 2025
Contributed by Lukas
This episode unpacks groundbreaking research into how hundreds of millions of people are actually using ChatGPT. Contrary to popular belief, non-work-...
LLM Interview Questions: A Comprehensive Guide
15 Sep 2025
Contributed by Lukas
Welcome to "AI Unpacked," your guide to the fascinating world of Large Language Models! In this episode, we'll break down the core conc...
Sam Altman & Khosla Ventures - AI: Evolution, Disruption, and the Future of Work
11 Sep 2025
Contributed by Lukas
In this insightful episode, Sam Altman and Vinod Khosla delve into the world beyond 2035, discussing the astonishing rate of technological change driv...
Distilling Step-by-Step: Outperforming LLMs with Less Data
08 Sep 2025
Contributed by Lukas
Join us as we explore LLM knowledge distillation, a groundbreaking technique that compresses powerful language models into efficient, task-specific ve...
😵💫 Why Language Models Hallucinate
07 Sep 2025
Contributed by Lukas
In this episode, we delve into why language models "hallucinate," generating plausible yet incorrect information instead of admitting uncert...
Attention Is All You Need
05 Sep 2025
Contributed by Lukas
Join us as we unpack "Attention Is All You Need," a pivotal paper introducing the Transformer, a novel neural network architecture. This gro...
LoRA: Low-Rank Adaptation of Large Language Models
04 Sep 2025
Contributed by Lukas
In this episode, we dive into LoRA, a groundbreaking technique that makes fine-tuning massive language models like GPT-3 more accessible and efficient...
The Ultimate Guide to Fine-Tuning LLMs
02 Sep 2025
Contributed by Lukas
Welcome to a deep dive into Large Language Model fine-tuning, covering everything from foundational concepts to cutting-edge advancements. This episod...
Compressing Large Language Models
01 Sep 2025
Contributed by Lukas
Large Language Models offer incredible power, but their immense scale creates significant deployment challenges in resource-constrained environments. ...
The Enterprise AI Divide: Adoption, Failure, and Future Trends
27 Aug 2025
Contributed by Lukas
Join us as we dissect the viral MIT NANDA 'GenAI Divide' report, which controversially claims 95% of enterprise generative AI pilots deliver n...
AI's Rapid Ascent: MacroHard, Meta's Midjourney, and Sentient Concerns
26 Aug 2025
Contributed by Lukas
Welcome to our latest episode, where we dive into Elon Musk's XAI project, MacroHard, an ambitious end-to-end neural network operating system, and...
Task-in-Prompt (TIP) adversarial attacks
25 Aug 2025
Contributed by Lukas
Tune into our latest episode where we dive deep into Task-in-Prompt (TIP) adversarial attacks, a novel class of jailbreaks that cleverly embed sequenc...
Prompt Engineering: Still Essential – The Comprehensive Guide to AI Mastery
18 Aug 2025
Contributed by Lukas
In this episode, we'll explore why prompt engineering is a critical and essential skill for anyone looking to harness the power of AI. Discover ho...
Andrew NG: Building Faster Startups with AI
17 Aug 2025
Contributed by Lukas
Join us for an insightful episode where Andrew Ng shares lessons from AI Fund on building startups faster with AI. Discover how new AI technologies en...
LightRAG: Graph-Enhanced Retrieval-Augmented Generation for LLMs
16 Aug 2025
Contributed by Lukas
Tune in to explore LightRAG, an innovative system designed to overcome the limitations of traditional Retrieval-Augmented Generation. Discover how it ...
Large Language Models (LLMs) in Cybersecurity
10 Aug 2025
Contributed by Lukas
Join us as we explore the dual-edged sword of Large Language Models (LLMs) in cybersecurity. This episode delves into how LLMs are revolutionizing thr...
Fine-Tuning Large Language Models
09 Aug 2025
Contributed by Lukas
Tune into our episode on Large Language Models, where we explore the intricate world of fine-tuning techniques like QLoRA, RAFT, and RLHF for speciali...
Foundations of Large Language Models
08 Aug 2025
Contributed by Lukas
Join us as we explore the foundational concepts of Large Language Models (LLMs), a revolutionary advancement in artificial intelligence. Discover how ...
GPT-5: The Future of AI
08 Aug 2025
Contributed by Lukas
Welcome to this special episode where we unpack OpenAI's latest leap, GPT-5, hailed as an AI that feels like conversing with a PhD-level expert on...
Small Language Models are the Future of Agentic AI
05 Aug 2025
Contributed by Lukas
Join us as we explore the booming world of agentic AI and challenge the status quo of Large Language Models (LLMs). We'll discuss why Small Langua...
Deep Agents: Architectures for Advanced AI Performance
04 Aug 2025
Contributed by Lukas
Welcome to a new episode where we dive into the fascinating world of "deep agents," an advanced form of LLM-based agents capable of planning...
The Relentless Vision of Dario Amodei and Anthropic
04 Aug 2025
Contributed by Lukas
Join us as we explore the captivating journey of Dario Amodei, the outspoken CEO of Anthropic, as he navigates the high-stakes world of artificial int...
Perplexity CEO: AI's Impact on Search, Browsers, and Jobs
18 Jul 2025
Contributed by Lukas
Join us as we explore Perplexity's bold move to launch the Comet browser, a strategic decision driven by the ambition to own the AI agent workflow...
Context Engineering with the PRP Framework
17 Jul 2025
Contributed by Lukas
Welcome to an essential episode on Context Engineering, the strategy separating real AI coding results from "vibe coding". We'll introdu...
How to build an AI Agent
16 Jul 2025
Contributed by Lukas
Welcome to "The Agent Builder's Blueprint"! In this episode, we'll demystify the process of building AI agents, guiding you from a mere idea to real-w...
Navigating the Superintelligence Race
08 Jul 2025
Contributed by Lukas
In this episode, Dylan Patel, a leading AI expert, dissects the current state of artificial intelligence among tech giants. We uncover the organizatio...
Founding Groq and the Future of AI
07 Jul 2025
Contributed by Lukas
Join us for an insightful conversation with Jonathan Ross, founder and CEO of Groq, as he shares his journey from inventing Google's TPU to creati...
AI's Watershed: June 2025 Breakthroughs
01 Jul 2025
Contributed by Lukas
Welcome to a special episode exploring June 2025, a true watershed moment in AI development. We'll dive into groundbreaking announcements like the...
Context Engineering
30 Jun 2025
Contributed by Lukas
In this episode, we unpack Context Engineering, the cutting-edge approach that's being hailed as 10x better than prompt engineering and 100x bette...
Becoming an AI-First Company: A Strategic Guide - BOX White paper
26 Jun 2025
Contributed by Lukas
Welcome to the show! In this episode, we explore the transformative journey of becoming an AI-first company, moving beyond mere automation to a comple...
Software's Evolution: From Code to AI Operating Systems - Andrej Karpathy
19 Jun 2025
Contributed by Lukas
Andrej Karpathy's presentation explores the evolving landscape of software development, introducing the concepts of Software 1.0 (traditional co...
The Agent Development Life Cycle
19 Jun 2025
Contributed by Lukas
Tune into this episode as we explore Sierra's innovative Agent Development Life Cycle (ADLC), a robust process for building and continuously impro...
Small vs. Large AI Models: Trade-offs and Use Cases
16 Jun 2025
Contributed by Lukas
Join us as we explore the fascinating world of Large Language Models (LLMs), delving into the significant challenges of efficient inference driven by ...
GenAI: Skills, Markets, and Models
12 Jun 2025
Contributed by Lukas
Step into the fast-evolving world of AI with our latest episode, where we explore the new breed of GenAI Application Engineers and the groundbreaking ...
Apple WWDC 2025: Intelligence, Design, and Evolution
09 Jun 2025
Contributed by Lukas
Welcome to our special episode covering Apple's WWDC 2025! We dive deep into the groundbreaking Apple Intelligence and the stunning new Liquid Gla...
The Prompt Engineering Handbook
04 Jun 2025
Contributed by Lukas
In this episode, we break down Prompt Engineering, revealing awesome strategies to get the most out of large language models by designing high-quality...
Darwin Gödel Machine: Open-Ended AI Evolution
01 Jun 2025
Contributed by Lukas
Tune into our latest episode to learn about the Darwin Gödel Machine (DGM), a novel AI system designed to autonomously and continuously improve itsel...
Master Claude Code - Practical Tips and Tricks
01 Jun 2025
Contributed by Lukas
This episode delves into Claude Code, an agentic AI assistant designed to help engineers build features, write entire files, and fix bugs by working s...
Andrew Ng: State of AI Agents
01 Jun 2025
Contributed by Lukas
Join us for a deep dive into the state of AI agents, shifting from debates about what is an agent to understanding degrees of "agenticness"....
Sergey Brin on the Future of AI and Gemini
24 May 2025
Contributed by Lukas
Join us for a special episode featuring Sergey Brin discussing Google's latest AI advancements unveiled at Google I/O. He shares insights into the...
Google I/O '25 Developer Keynote
22 May 2025
Contributed by Lukas
This episode explores the latest developer tools and AI advancements from Google I/O '25, highlighting how developers can build with Gemini across...
Code with Claude Opening Keynote
22 May 2025
Contributed by Lukas
Join us as we break down the announcements from Anthropic's first-ever Code with Claude developer conference. We explore the launch of the powerfu...
Google I/O 2025 AI Stage: Day 1 Highlights
21 May 2025
Contributed by Lukas
Join Google DeepMind CEO Demis Hassabis and Google co-founder Sergey Brin as they discuss the frontiers of AI, the path to AGI, and the importance of ...
Microsoft Build 25 Keynote
20 May 2025
Contributed by Lukas
In this episode, we break down the key announcements from Microsoft Build 2025, focusing on the major platform shift towards the open agentic web. We ...
Google IO 25
20 May 2025
Contributed by Lukas
Welcome to a special episode covering Google I/O '25, where major AI advancements were unveiled. We discuss powerful new Gemini models, AI integra...
Log Anomaly Detection with LogLLaMA and RL
16 May 2025
Contributed by Lukas
Welcome to the podcast! Today, we dive into the crucial world of system logs and how to spot trouble. We'll explore LogLLaMA, a new framework buil...
CySecBERT: Domain-Adapted Language Model for Cybersecurity
15 May 2025
Contributed by Lukas
Join us to explore CySecBERT, a language model specifically designed for the cybersecurity domain. Learn how it overcomes limitations of general model...
GitHub Engineering Success Playbook
13 May 2025
Contributed by Lukas
Step into the world of driving measurable engineering improvements with the GitHub Engineering System Success Playbook (ESSP). This episode explores h...
Building AI Agents with a 7-Node Blueprint
12 May 2025
Contributed by Lukas
Unlock the secrets to building robust AI agents with the seven node blueprint, a powerful mental model that breaks down complexity. Discover how think...