Build Wiz AI Show
Episodes
LightRAG: Graph-Enhanced Retrieval-Augmented Generation for LLMs
16 Aug 2025
Contributed by Lukas
Tune in to explore LightRAG, an innovative system designed to overcome the limitations of traditional Retrieval-Augmented Generation. Discover how it ...
Large Language Models (LLMs) in Cybersecurity
10 Aug 2025
Contributed by Lukas
Join us as we explore the dual-edged sword of Large Language Models (LLMs) in cybersecurity. This episode delves into how LLMs are revolutionizing thr...
Fine-Tuning Large Language Models
09 Aug 2025
Contributed by Lukas
Tune into our episode on Large Language Models, where we explore the intricate world of fine-tuning techniques like QLoRA, RAFT, and RLHF for speciali...
Foundations of Large Language Models
08 Aug 2025
Contributed by Lukas
Join us as we explore the foundational concepts of Large Language Models (LLMs), a revolutionary advancement in artificial intelligence. Discover how ...
GPT-5: The Future of AI
08 Aug 2025
Contributed by Lukas
Welcome to this special episode where we unpack OpenAI's latest leap, GPT-5, hailed as an AI that feels like conversing with a PhD-level expert on...
Small Language Models are the Future of Agentic AI
05 Aug 2025
Contributed by Lukas
Join us as we explore the booming world of agentic AI and challenge the status quo of Large Language Models (LLMs). We'll discuss why Small Langua...
Deep Agents: Architectures for Advanced AI Performance
04 Aug 2025
Contributed by Lukas
Welcome to a new episode where we dive into the fascinating world of "deep agents," an advanced form of LLM-based agents capable of planning...
The Relentless Vision of Dario Amodei and Anthropic
04 Aug 2025
Contributed by Lukas
Join us as we explore the captivating journey of Dario Amodei, the outspoken CEO of Anthropic, as he navigates the high-stakes world of artificial int...
Perplexity CEO: AI's Impact on Search, Browsers, and Jobs
18 Jul 2025
Contributed by Lukas
Join us as we explore Perplexity's bold move to launch the Comet browser, a strategic decision driven by the ambition to own the AI agent workflow...
Context Engineering with the PRP Framework
17 Jul 2025
Contributed by Lukas
Welcome to an essential episode on Context Engineering, the strategy separating real AI coding results from "vibe coding". We'll introdu...
How to build an AI Agent
16 Jul 2025
Contributed by Lukas
Welcome to "The Agent Builder's Blueprint"! In this episode, we'll demystify the process of building AI agents, guiding you from a mere idea to real-w...
Navigating the Superintelligence Race
08 Jul 2025
Contributed by Lukas
In this episode, Dylan Patel, a leading AI expert, dissects the current state of artificial intelligence among tech giants. We uncover the organizatio...
Founding Groq and the Future of AI
07 Jul 2025
Contributed by Lukas
Join us for an insightful conversation with Jonathan Ross, founder and CEO of Groq, as he shares his journey from inventing Google's TPU to creati...
AI's Watershed: June 2025 Breakthroughs
01 Jul 2025
Contributed by Lukas
Welcome to a special episode exploring June 2025, a true watershed moment in AI development. We'll dive into groundbreaking announcements like the...
Context Engineering
30 Jun 2025
Contributed by Lukas
In this episode, we unpack Context Engineering, the cutting-edge approach that's being hailed as 10x better than prompt engineering and 100x bette...
Becoming an AI-First Company: A Strategic Guide - BOX White paper
26 Jun 2025
Contributed by Lukas
Welcome to the show! In this episode, we explore the transformative journey of becoming an AI-first company, moving beyond mere automation to a comple...
Software's Evolution: From Code to AI Operating Systems - Andrej Karpathy
19 Jun 2025
Contributed by Lukas
Andrej Karpathy's presentation explores the evolving landscape of software development, introducing the concepts of Software 1.0 (traditional co...
The Agent Development Life Cycle
19 Jun 2025
Contributed by Lukas
Tune into this episode as we explore Sierra's innovative Agent Development Life Cycle (ADLC), a robust process for building and continuously impro...
Small vs. Large AI Models: Trade-offs and Use Cases
16 Jun 2025
Contributed by Lukas
Join us as we explore the fascinating world of Large Language Models (LLMs), delving into the significant challenges of efficient inference driven by ...
GenAI: Skills, Markets, and Models
12 Jun 2025
Contributed by Lukas
Step into the fast-evolving world of AI with our latest episode, where we explore the new breed of GenAI Application Engineers and the groundbreaking ...
Apple WWDC 2025: Intelligence, Design, and Evolution
09 Jun 2025
Contributed by Lukas
Welcome to our special episode covering Apple's WWDC 2025! We dive deep into the groundbreaking Apple Intelligence and the stunning new Liquid Gla...
The Prompt Engineering Handbook
04 Jun 2025
Contributed by Lukas
In this episode, we break down Prompt Engineering, revealing awesome strategies to get the most out of large language models by designing high-quality...
Darwin Gödel Machine: Open-Ended AI Evolution
01 Jun 2025
Contributed by Lukas
Tune into our latest episode to learn about the Darwin Gödel Machine (DGM), a novel AI system designed to autonomously and continuously improve itsel...
Master Claude Code - Practical Tips and Tricks
01 Jun 2025
Contributed by Lukas
This episode delves into Claude Code, an agentic AI assistant designed to help engineers build features, write entire files, and fix bugs by working s...
Andrew Ng: State of AI Agents
01 Jun 2025
Contributed by Lukas
Join us for a deep dive into the state of AI agents, shifting from debates about what is an agent to understanding degrees of "agenticness"....
Sergey Brin on the Future of AI and Gemini
24 May 2025
Contributed by Lukas
Join us for a special episode featuring Sergey Brin discussing Google's latest AI advancements unveiled at Google I/O. He shares insights into the...
Google I/O '25 Developer Keynote
22 May 2025
Contributed by Lukas
This episode explores the latest developer tools and AI advancements from Google I/O '25, highlighting how developers can build with Gemini across...
Code with Claude Opening Keynote
22 May 2025
Contributed by Lukas
Join us as we break down the announcements from Anthropic's first-ever Code with Claude developer conference. We explore the launch of the powerfu...
Google I/O 2025 AI Stage: Day 1 Highlights
21 May 2025
Contributed by Lukas
Join Google DeepMind CEO Demis Hassabis and Google co-founder Sergey Brin as they discuss the frontiers of AI, the path to AGI, and the importance of ...
Microsoft Build 25 Keynote
20 May 2025
Contributed by Lukas
In this episode, we break down the key announcements from Microsoft Build 2025, focusing on the major platform shift towards the open agentic web. We ...
Google IO 25
20 May 2025
Contributed by Lukas
Welcome to a special episode covering Google I/O '25, where major AI advancements were unveiled. We discuss powerful new Gemini models, AI integra...
Log Anomaly Detection with LogLLaMA and RL
16 May 2025
Contributed by Lukas
Welcome to the podcast! Today, we dive into the crucial world of system logs and how to spot trouble. We'll explore LogLLaMA, a new framework buil...
CySecBERT: Domain-Adapted Language Model for Cybersecurity
15 May 2025
Contributed by Lukas
Join us to explore CySecBERT, a language model specifically designed for the cybersecurity domain. Learn how it overcomes limitations of general model...
GitHub Engineering Success Playbook
13 May 2025
Contributed by Lukas
Step into the world of driving measurable engineering improvements with the GitHub Engineering System Success Playbook (ESSP). This episode explores h...
Building AI Agents with a 7-Node Blueprint
12 May 2025
Contributed by Lukas
Unlock the secrets to building robust AI agents with the seven node blueprint, a powerful mental model that breaks down complexity. Discover how think...
How AI Reinventing Software Business Models
12 May 2025
Contributed by Lukas
Join us as we explore how AI is fundamentally changing the landscape of software business models with Bret Taylor, co-founder of Sierra. We discuss th...
Sequoia AI Ascent 2025: The Trillion-Dollar Opportunity
11 May 2025
Contributed by Lukas
Join us for insights from the Sequoia AI Ascent 2025 Keynote, where Sequoia partners share their perspectives on the world of AI. They discuss the mar...
LSAST: LLM-Supported Static Application Security Testing
09 May 2025
Contributed by Lukas
Tired of traditional tools missing complex threats and AI facing privacy hurdles? This episode dives into LSAST, a novel approach integrating traditio...
MCP versus API: AI Agent Integration
07 May 2025
Contributed by Lukas
This episode explores how large language models extend their capabilities by interacting with external tools and data via APIs. We introduce the Model...
A Survey of AI Agent Protocols
04 May 2025
Contributed by Lukas
The rise of AI agents is changing industries, but they face a critical challenge: the lack of standardized communication protocols. This fragmentation...
Mem0: Scalable Long-Term Memory for AI Agents
01 May 2025
Contributed by Lukas
Tune in as we discuss Mem0 and Mem0g, innovative memory architectures designed to tackle the fundamental challenge of Large Language Models forgetting...
The Art and Science of Vibe Coding
26 Apr 2025
Contributed by Lukas
Explore how VibeCoding acknowledges programming as a deeply human activity influenced by psychology, environment, and emotional state. Learn how vibe ...
12 factor agents
25 Apr 2025
Contributed by Lukas
Welcome to the '12 Factor Agents' podcast! We're exploring how to build robust, production-ready AI agents by rethinking them as software ...
The AI Scientist: Automated Scientific Discovery
24 Apr 2025
Contributed by Lukas
Welcome to the show! Today, we dive into The AI Scientist, an automated framework designed for open-ended scientific discovery. Discover how it genera...
Thinking About Agent Frameworks: A Comprehensive Guide
23 Apr 2025
Contributed by Lukas
Welcome to our podcast episode! Building reliable agentic systems is a significant challenge, largely revolving around ensuring the LLM has the approp...
Google A2A: Protocol for Interoperable AI Agents
22 Apr 2025
Contributed by Lukas
Welcome to this episode where we delve into Google's revolutionary Agent-to-Agent (A2A) protocol, a new standard designed for AI agents to communi...
Scaling AI Use Cases: An Adoption Guide
20 Apr 2025
Contributed by Lukas
Welcome to our new episode, where we delve into the essential strategies for businesses looking to leverage artificial intelligence effectively. We...
AI in the Enterprise: Seven Lessons from Frontier Companies
19 Apr 2025
Contributed by Lukas
Welcome to our podcast on AI in the Enterprise! This episode dives into key lessons from leading companies on how to successfully adopt and leverage a...
Practical Guide to Building AI Agents
18 Apr 2025
Contributed by Lukas
Welcome to the podcast exploring the cutting-edge world of AI agents, intelligent systems capable of independently handling complex, multi-step tasks....
Lightweight KG Reasoning with Language Model Prompts
17 Apr 2025
Contributed by Lukas
Welcome to this episode where we delve into LightPROF, a novel framework designed to boost the reasoning abilities of Large Language Models (LLMs) usi...
Agentic Knowledgeable Self-awareness for Language Model Agents
16 Apr 2025
Contributed by Lukas
Welcome to this episode where we delve into the groundbreaking concept of agentic knowledgeable self-awareness for large language models, a paradigm s...
AI 2027: Preparing for Superintelligence
13 Apr 2025
Contributed by Lukas
Welcome to "AI Futures Now," where we dive deep into the next decade of artificial intelligence based on the AI 2027 report. This episode ex...
Chapter 5&6: All-In on AI - How Smart Companies Win Big with Artificial Intelligence
13 Apr 2025
Contributed by Lukas
Welcome back to the podcast! In this episode, we're diving deep into the essential AI capabilities that organizations need to build for success, e...
Chapter 3 and 4: All-In on AI - How Smart Companies Win Big with Artificial Intelligence
12 Apr 2025
Contributed by Lukas
Welcome to the podcast where we explore how to win big with artificial intelligence! Today, we dive into how AI can shape and transform business strat...
All-In on AI: How Smart Companies Win Big with Artificial Intelligence (chapter 1 & 2)
11 Apr 2025
Contributed by Lukas
Welcome to our podcast exploring the journey to becoming AI fueled! In our first episode, we delve into what it truly means for a company to be all-in...
Google Cloud Next 2025: The New Way to Cloud
10 Apr 2025
Contributed by Lukas
Welcome to the podcast where we dive deep into the revolutionary "new way to cloud" unveiled at Google Cloud Next. We'll explore how Goo...
AI's Growing Role in Software Development and the Future of Work
09 Apr 2025
Contributed by Lukas
Welcome to this episode where we delve into the rapidly evolving world of software development and the profound impact of Artificial Intelligence. We&...
AI Model and Security Developments: Amazon, Google, Meta, Microsoft
08 Apr 2025
Contributed by Lukas
Welcome to the AI Unpacked podcast! In this episode, we delve into the latest wave of artificial intelligence breakthroughs, from Amazon's new Nov...
AI Agents: Use Cases, Integration, and Business Impact
07 Apr 2025
Contributed by Lukas
Welcome to this episode where we'll dive into the revolutionary impact of AI agents across industries, exploring how they are automating tasks, en...
Decoding AI Agents: Infrastructure, Frameworks, and Market Trends
06 Apr 2025
Contributed by Lukas
Welcome to the show where we delve into the world of AI agents, exploring how these digital tools are streamlining operations and enhancing customer e...
LLM Security: Threats, Detection, and Mitigation Strategies
04 Apr 2025
Contributed by Lukas
Welcome to this week's episode where we dive into the fascinating and critical intersection of Large Language Models (LLMs) and cybersecurity, exp...
Operationalizing Generative AI with MLOps on Vertex AI
04 Apr 2025
Contributed by Lukas
Welcome to the show where we delve into the exciting world of operationalizing Generative AI on Vertex AI using MLOps, exploring how traditional machi...
LLMs for Domain-Specific Problem Solving
02 Apr 2025
Contributed by Lukas
Welcome to this episode where we delve into the fascinating world of domain-specific Large Language Models, exploring how they're revolutionizing ...
Vibe Coding: Setup, Advanced Tips, and Tricks
02 Apr 2025
Contributed by Lukas
Get ready to explore the world of vibe coding, a new approach to software development where AI takes the lead! This episode dives deep into the tools ...
Agents Companion: Building and Evaluating Generative AI Agents
01 Apr 2025
Contributed by Lukas
Welcome to this episode where we delve into the transformative world of generative AI agents, exploring their architecture, evaluation, and the shift ...
Generative AI Agents: Architecture, Tools, and Implementation
01 Apr 2025
Contributed by Lukas
In our latest podcast episode, we explore the world of Generative AI Agents, intelligent applications that go beyond standard models by using reasonin...
Embeddings and Vector Stores: A Comprehensive Guide
01 Apr 2025
Contributed by Lukas
This whitepaper explores embeddings, which are numerical representations of various data types like text and images, and vector stores, which are spec...
The Art and Science of Prompt Engineering
31 Mar 2025
Contributed by Lukas
"Prompt Engineering," authored by Lee Boonstra in September 2024, offers a comprehensive guide to crafting effective prompts for large lang...
Foundational Large Language Models and Text Generation
31 Mar 2025
Contributed by Lukas
This whitepaper provides a comprehensive overview of foundational large language models (LLMs) and text generation. It traces the evolution of trans...
Claude 3.7 Sonnet: Usage Patterns and Economic Insights
31 Mar 2025
Contributed by Lukas
Anthropic's report, the second from their Anthropic Economic Index, analyzes usage patterns of their updated Claude 3.7 Sonnet AI model following its ...
Qwen2.5-Omni: An End-to-End Multimodal Model
30 Mar 2025
Contributed by Lukas
Qwen2.5-Omni is a unified end-to-end multimodal model capable of perceiving text, images, audio, and video, while simultaneously generating text and n...
Knowledge Graph Enhanced Software Repair
29 Mar 2025
Contributed by Lukas
KGCompass is a novel approach for enhancing repository-level software repair by utilizing a repository-aware knowledge graph. This knowledge graph eff...
GitHub Copilot: Enhanced AI with Custom Instructions
28 Mar 2025
Contributed by Lukas
How developers can create Markdown files, such as .github/copilot-instructions.md, to provide Copilot with specific context about their projects, cod...
Knowledge Workers and Large Language Models: Current and Future Use
27 Mar 2025
Contributed by Lukas
Welcome to our podcast on the groundbreaking impact of Large Language Models (LLMs) on knowledge work! We delve into a recent study surveying knowledg...
Chain-of-Tools: Reasoning with Massive Unseen Tools
26 Mar 2025
Contributed by Lukas
We explore Chain-of-Tools (CoTools), a groundbreaking tool learning method for frozen Large Language Models (LLMs). CoTools enables these models to ef...
Claude 3.5 Sonnet Achieves New SWE-bench Verified State-of-the-Art
25 Mar 2025
Contributed by Lukas
While newer models like Claude 3.7 Sonnet is already available, our latest podcast episode delves into the still-valuable insights from Claude 3.5 Son...
Transformers Without Normalization: Dynamic Tanh Achieves Strong Performance
24 Mar 2025
Contributed by Lukas
This podcast episode delves into the "Transformers without Normalization" paper, which introduces Dynamic Tanh (DyT) as a potential replacem...
Fin-R1: Financial Reasoning with a Lightweight Language Model
23 Mar 2025
Contributed by Lukas
In this podcast episode, we delve into Fin-R1, a groundbreaking large language model tailored for financial reasoning. Discover how this efficient 7 b...
Claude's "Think" Tool: Enhanced Complex Problem Solving
22 Mar 2025
Contributed by Lukas
The "think" tool provides Claude with a dedicated space for structured thinking during complex tasks, allowing it to pause and reflect, espe...
The Past, Present, and Future of AI for Developers
21 Mar 2025
Contributed by Lukas
This podcast episode dives into AI for application developers, inspired by Steve Sanderson's keynote . It journeys from early concepts like the Tu...
LLM Concepts Explained: Sampling, Fine-tuning, Sharding, LoRA
20 Mar 2025
Contributed by Lukas
Several key concepts and techniques essential for working with large language models (LLMs). It begins by explaining sampling, the probabilistic meth...
NVIDIA GTC 2025 Keynote: AI Factories and Accelerated Computing
19 Mar 2025
Contributed by Lukas
Jensen Huang's GTC March 2025 keynote showcases NVIDIA's advancements in AI and accelerated computing. Huang introduces their next-generation...
Agentic RAG: Intelligent Retrieval Augmented Generation
19 Mar 2025
Contributed by Lukas
This video from IBM Technology explains Agentic Retrieval Augmented Generation (RAG) as an advancement of the standard RAG pipeline. Traditional RA...
🎣 Phishing: Attacks and Top Cybersecurity Defense Strategies
18 Mar 2025
Contributed by Lukas
The provided YouTube transcript from IBM Technology's channel, "Phishing Defenses: Top Cybersecurity Strategies to Protect Your Data," ...
RAG vs. CAG: Augmenting AI Model Knowledge
18 Mar 2025
Contributed by Lukas
The YouTube video from IBM Technology explains two primary methods for augmenting the knowledge of large language models: Retrieval Augmented Generati...
LONGREPS: Reasoning Path Supervision for Long-Context Language Models
17 Mar 2025
Contributed by Lukas
The provided paper, "Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision," investigates the e...
Prompt Engineering for AI
16 Mar 2025
Contributed by Lukas
The provided article from DEV Community explains prompt engineering for artificial intelligence, emphasizing its importance in achieving better resul...
GraphFC: Graph-based Fact-Checking with Claim Decomposition
15 Mar 2025
Contributed by Lukas
The provided text introduces GraphFC, a new framework for fact-checking that converts claims into graph structures composed of subject-relation-obj...
LLM Agents: A Survey of Planning Approaches
14 Mar 2025
Contributed by Lukas
This survey examines the burgeoning field of large language models (LLMs) as planning modules for autonomous agents, offering the first systematic ov...
Anthropic's Model Context Protocol (MCP): Origins, Functionality, and Impact
13 Mar 2025
Contributed by Lukas
Anthropic's Model Context Protocol (MCP), introduced in late 2024, is presented as an open standard aiming to revolutionize how AI models intera...
🐳 Dockerizing AI: Model Context Protocol with Claude Desktop
12 Mar 2025
Contributed by Lukas
Docker is highlighted as an ideal solution for packaging and distributing Model Context Protocol (MCP) servers, which face challenges like environment...
Model Context Protocol: A QA Guide for AI Testing
12 Mar 2025
Contributed by Lukas
The primary text introduces the Model Context Protocol (MCP), a standardized approach for connecting AI models to external data sources, simplifying i...
Graph RAG: A Query-Focused Summarization Approach
12 Mar 2025
Contributed by Lukas
This research introduces Graph RAG, a novel approach to enhance question answering over large text collections by combining knowledge graphs and retri...
AI Agents: Tools, Planning, and Failure Modes - Huyen Chip
11 Mar 2025
Contributed by Lukas
AI agents, driven by foundation models, are emerging as intelligent assistants capable of perceiving and acting within their environments to complete ...
AI Agents Research Papers: Best of 2024
10 Mar 2025
Contributed by Lukas
Analytics Vidhya highlights the top AI Agents research papers of 2024, emphasizing their role in fields from NLP to autonomous systems. The article c...
Fine-Tuning LLMs: A Deep Dive into Alternatives
09 Mar 2025
Contributed by Lukas
Large language model (LLM) fine-tuning is a key technique for adapting pre-trained AI models to specific tasks or domains. Fine-tuning involves trai...
Advanced Prompt Engineering Techniques
08 Mar 2025
Contributed by Lukas
The provided texts explore the field of advanced prompt engineering, which focuses on refining inputs to AI models for optimal output. They highlight...
ChatGPT Prompts for Software Engineers
08 Mar 2025
Contributed by Lukas
The provided article explores the effective use of ChatGPT for software engineering tasks. It emphasizes the importance of clear and detailed prompts...
The Art of AI Prompt Crafting
08 Mar 2025
Contributed by Lukas
The "Art of AI Prompt Crafting" guide, found on the OpenAI Developer Forum, serves as a comprehensive resource for mastering the creation of...
Generative AI Agents: A Comprehensive Guide
07 Mar 2025
Contributed by Lukas
The "Google AI Agents" whitepaper introduces the concept of AI agents, which enhance generative AI models with reasoning, logic, and access to externa...