Artificial Intelligence : Papers & Concepts
Feed Update Issues
We're having trouble fetching new episodes from this podcast's RSS feed. Last successful update was 2026-05-01 00:02:55.702315.
Episodes
Vision Banana: Rethinking How AI Models See and Generalize
23 Apr 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Vision Banana, a concept that challenges how vision models learn and gener...
Position Encoding: How Transformers Understand Order in Data
22 Apr 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Position Encoding, a fundamental concept that enables transformer models t...
V-JEPA 2.1: Learning Video Understanding Without Labels
21 Apr 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore V-JEPA 2.1, a next-generation video learning model that shifts away from t...
Agentic AI Cost: The Hidden Economics of Autonomous Systems
20 Apr 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Agentic AI Cost, a deep dive into the often-overlooked economics of autono...
ChopGrad: Making Training More Efficient by Cutting Gradient Complexity
17 Apr 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore ChopGrad, a novel technique aimed at improving the efficiency of training ...
Qwen Image Edit: Bringing Precision and Control to AI-Powered Image Editing
16 Apr 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Qwen Image Edit, a multimodal system designed to make image editing more p...
Ouro: Building Self-Improving AI Through Iterative Learning Loops
15 Apr 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Ouro, a new approach to AI that focuses on self-improvement through iterat...
Mythos: Teaching AI to Understand Stories, Not Just Text
14 Apr 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Mythos, a new approach focused on helping AI systems understand narratives...
DRCT: Rethinking Image Restoration With Diffusion-Based Reconstruction
13 Apr 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore DRCT, a diffusion-based approach to image restoration that focuses on reco...
LongCat: Scaling Image Editing With Long-Context Understanding
11 Apr 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore LongCat, a new approach to AI-powered image editing that focuses on handli...
BLIP-2: Bridging Vision and Language Without Full Retraining
10 Apr 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore BLIP-2, a powerful vision–language model that connects pretrained image ...
Ultralytics Platform: Simplifying End-to-End Computer Vision Development
09 Apr 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore the Ultralytics Platform, a unified ecosystem designed to make building, t...
OpenSeeker: Rethinking Search With AI-Native Reasoning
06 Apr 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore OpenSeeker, an emerging approach to building AI-native search systems that...
Apple MPS: Unlocking GPU Acceleration for AI on Apple Devices
06 Apr 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Apple MPS (Metal Performance Shaders), Apple's framework for accelerating ...
LeWorldModel: Teaching AI to Simulate and Understand the World
03 Apr 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore LeWorldModel, a new approach to building AI systems that can model and sim...
V-JEPA 2.1: Learning to Understand Video Without Labels
02 Apr 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore V-JEPA 2.1, an advanced video learning model that moves beyond traditional...
NeRFify: Turning Images Into Immersive 3D Worlds With AI
01 Apr 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore NeRFify, a cutting-edge approach that uses neural radiance fields (NeRFs) ...
Molmo Point: Teaching AI to Ground Language in Precise Visual Locations
31 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Molmo Point, an extension of multimodal AI that focuses on precise visual ...
Think, Then Lie: When AI Reasoning Doesn't Guarantee Truth
30 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore "Think, Then Lie," a concept that challenges a key assumption in modern AI...
ReCoSplat: Reconstructing 3D Worlds From Sparse Visual Data
27 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore ReCoSplat, a novel approach to 3D scene reconstruction that leverages spar...
Video Understanding: Teaching AI to Make Sense of Motion and Time
26 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Video Understanding, a rapidly evolving area of AI focused on helping mode...
Penguin-VL: Advancing Vision–Language Models With Stronger Reasoning
25 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Penguin-VL, a new vision–language model designed to improve how AI syste...
cuVSLAM: Accelerating Real-Time Visual SLAM With GPU Power
24 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore cuVSLAM, NVIDIA's GPU-accelerated solution for visual simultaneous localiz...
MM-Zero: Learning Multimodal Intelligence From Scratch
23 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore MM-Zero, a new approach to building multimodal AI systems that learn from ...
Helios: Rethinking How AI Models Scale Across Compute and Data
20 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Helios, a new approach focused on optimizing how large AI models scale acr...
BitNet: Rethinking Neural Networks With 1-Bit Precision
19 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore BitNet, a radically efficient approach to building neural networks using e...
Agents of Chaos: When Multiple AI Systems Interact in Unpredictable Ways
18 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Chaos Agents, a concept that examines what happens when multiple AI agents...
OC-SORT: Improving Object Tracking by Fixing Motion, Not Just Detection
17 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore OC-SORT (Observation-Centric SORT), an evolution of traditional tracking a...
Attention Residuals: Understanding the Hidden Signals Inside Transformer Models
16 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Attention Residuals, a concept that reveals how transformer models preserv...
SORT: A Simple and Efficient Approach to Real-Time Object Tracking
16 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore SORT (Simple Online and Realtime Tracking), a lightweight yet powerful alg...
SigLIP 2: Advancing Vision-Language Understanding Without Contrastive Bottlenecks
13 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore SigLIP 2, the next evolution of Google's vision–language model designed ...
Nemotron-3 Super: Pushing the Limits of Reasoning in Large Language Models
12 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Nemotron-3 Super, an advanced large language model designed to improve rea...
AI Hallucinations: Why Language Models Sometimes Make Things Up
11 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore the phenomenon of AI hallucinations-the moments when language models gener...
ByteTrack: A Smarter Way for AI to Track Objects in Real Time
10 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore ByteTrack, a breakthrough approach in multi-object tracking that significa...
AI and Copyright: Who Owns Content Created by Machines?
04 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore the growing debate around AI and copyright-one of the most important legal...
Qwen 3.5 - Advancing Open Multilingual Intelligence at Scale
27 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Qwen 3.5, the latest generation of large language models designed to push ...
Qwen 3: Advancing Open Multilingual Intelligence at Scale
26 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Qwen 3, the latest generation of large language models designed to push mu...
Unified Latents: Bringing Images, Video, and Language Into One Shared AI Space
25 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Unified Latents, a new approach that aims to merge different types of data...
DeepSeek-V3: Scaling Open Reasoning Models With Efficiency and Precision
23 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore DeepSeek-V3, a next-generation large language model designed to push the b...
Repeat-Repeat: Why Simply Repeating a Prompt Can Make LLMs Smarter
19 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore the surprisingly simple idea behind "Prompt Repetition Improves Non-Reason...
Seedance 2.0: Moving From AI Video Generation to Cinematic Intelligence
18 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Seedance 2.0, the next evolution of ByteDance's video foundation model des...
Molmo: Building Open Multimodal AI That Can Truly See and Understand
17 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we break down Molmo, an open multimodal model designed to understand images and langu...
Seedance 1.0: The Next Leap in AI Video Generation
16 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Seedance 1.0, a new foundation model from ByteDance that is pushing the bo...
LoRA: Teaching Massive AI Models New Skills Without Retraining Everything
13 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we break down LoRA (Low-Rank Adaptation) - a breakthrough technique that makes fine-t...
Wembley Goal: How Computer Vision Settled Football's Most Controversial Moment
12 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we revisit the legendary 1966 World Cup Final and the infamous "Wembley Goal" - a mom...
I-JEPA: Teaching AI to Understand Images Without Labels
11 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we break down I-JEPA, a self-supervised vision architecture that moves beyond pixel-l...
EchoJEPA: Teaching AI to Truly Understand the Beating Heart
10 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we break down EchoJEPA, a large-scale foundation model trained on millions of real-wo...
PaperBanana: From Raw Text to Publication-Ready Diagrams
09 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we dive into PaperBanana, an agentic framework from Peking University and Google Clou...
SleepFM: Predicting Future Disease from a Single Night of Sleep
06 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we break down SleepFM, a large-scale multimodal foundation model that learns directly...
RF-DETR: Neural Architecture Search for Real-Time Detection Transformers
04 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we break down RF-DETR, a new direction in object detection that challenges the idea o...
YOLO26: Rethinking Real-Time Vision for the Edge
03 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we break down YOLO26, a major shift in real-time object detection. Instead of chasin...
DeepSeek mHC
05 Jan 2026
Contributed by Lukas
Why do some large AI models suddenly collapse during training—and how can geometry prevent it? In this episode of Artificial Intelligence: Papers an...
Chinchilla Scaling Law
18 Dec 2025
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, curated by Dr. Satya Mallick, we break down DeepMind's 2022 paper "Training Compute-O...
Gradient-Based Planning
13 Dec 2025
Contributed by Lukas
How should an AI or robot decide what to do next? In this episode, we explore a new approach to planning that rethinks how world models are trained. T...
SAM3D: The Next Leap in 3D Understanding
10 Dec 2025
Contributed by Lukas
Forget flat photos—SAM3D is rewriting how machines understand the world. In this episode, we break down the groundbreaking new model that takes the ...
DINOv3 : A new Self-Supervised Learning (SSL) Vision Language Model (VLM)
29 Oct 2025
Contributed by Lukas
In this episode, we explore DINOv3, a new self-supervised learning (SSL) vision foundation model from Meta AI Research, emphasizing its ability to sca...
dots.ocr SOTA Document Parsing in a Compact VLM
28 Oct 2025
Contributed by Lukas
dots.ocr is a powerful, multilingual document parsing model from rednote-hilab that achieves state-of-the-art performance by unifying layout detect...
DeepSeek-OCR : A Revolutionary Idea
23 Oct 2025
Contributed by Lukas
In this episode, we dive deep into DeepSeek-OCR, a cutting-edge open-source Optical Character Recognition (OCR) / Text Recognition model that's redefi...
nanochat by Karpathy - How to build your own ChatGPT for $100
21 Oct 2025
Contributed by Lukas
"The best ChatGPT that $100 can buy." That's Andrej Karpathy's positioning for nanochat—a compact, end‑to‑end stack that goes from tokenizer tr...
SmolVLM: Small Yet Mighty Vision Language Model
01 Oct 2025
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore SmolVLM, a family of compact yet powerful vision language models (VLMs) de...
Common Pitfalls in Computer Vision & AI Projects (and How to Avoid Them)
01 Oct 2025
Contributed by Lukas
In this episode, we dig deep into the unglamorous side of AI and computer vision projects — the mistakes, misfires, and blind spots that too often d...