Artificial Intelligence : Papers & Concepts
Episodes
Helios: Rethinking How AI Models Scale Across Compute and Data
20 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Helios, a new approach focused on optimizing how large AI models scale acr...
BitNet: Rethinking Neural Networks With 1-Bit Precision
19 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore BitNet, a radically efficient approach to building neural networks using e...
Agents of Chaos: When Multiple AI Systems Interact in Unpredictable Ways
18 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Chaos Agents, a concept that examines what happens when multiple AI agents...
OC-SORT: Improving Object Tracking by Fixing Motion, Not Just Detection
17 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore OC-SORT (Observation-Centric SORT), an evolution of traditional tracking a...
Attention Residuals: Understanding the Hidden Signals Inside Transformer Models
16 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Attention Residuals, a concept that reveals how transformer models preserv...
SORT: A Simple and Efficient Approach to Real-Time Object Tracking
16 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore SORT (Simple Online and Realtime Tracking), a lightweight yet powerful alg...
SigLIP 2: Advancing Vision-Language Understanding Without Contrastive Bottlenecks
13 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore SigLIP 2, the next evolution of Google's vision–language model designed ...
Nemotron-3 Super: Pushing the Limits of Reasoning in Large Language Models
12 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Nemotron-3 Super, an advanced large language model designed to improve rea...
AI Hallucinations: Why Language Models Sometimes Make Things Up
11 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore the phenomenon of AI hallucinations-the moments when language models gener...
ByteTrack: A Smarter Way for AI to Track Objects in Real Time
10 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore ByteTrack, a breakthrough approach in multi-object tracking that significa...
AI and Copyright: Who Owns Content Created by Machines?
04 Mar 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore the growing debate around AI and copyright-one of the most important legal...
Qwen 3.5 - Advancing Open Multilingual Intelligence at Scale
27 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Qwen 3.5, the latest generation of large language models designed to push ...
Qwen 3: Advancing Open Multilingual Intelligence at Scale
26 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Qwen 3, the latest generation of large language models designed to push mu...
Unified Latents: Bringing Images, Video, and Language Into One Shared AI Space
25 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Unified Latents, a new approach that aims to merge different types of data...
DeepSeek-V3: Scaling Open Reasoning Models With Efficiency and Precision
23 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore DeepSeek-V3, a next-generation large language model designed to push the b...
Repeat-Repeat: Why Simply Repeating a Prompt Can Make LLMs Smarter
19 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore the surprisingly simple idea behind "Prompt Repetition Improves Non-Reason...
Seedance 2.0: Moving From AI Video Generation to Cinematic Intelligence
18 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Seedance 2.0, the next evolution of ByteDance's video foundation model des...
Molmo: Building Open Multimodal AI That Can Truly See and Understand
17 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we break down Molmo, an open multimodal model designed to understand images and langu...
Seedance 1.0: The Next Leap in AI Video Generation
16 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore Seedance 1.0, a new foundation model from ByteDance that is pushing the bo...
LoRA: Teaching Massive AI Models New Skills Without Retraining Everything
13 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we break down LoRA (Low-Rank Adaptation) - a breakthrough technique that makes fine-t...
Wembley Goal: How Computer Vision Settled Football's Most Controversial Moment
12 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we revisit the legendary 1966 World Cup Final and the infamous "Wembley Goal" - a mom...
I-JEPA: Teaching AI to Understand Images Without Labels
11 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we break down I-JEPA, a self-supervised vision architecture that moves beyond pixel-l...
EchoJEPA: Teaching AI to Truly Understand the Beating Heart
10 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we break down EchoJEPA, a large-scale foundation model trained on millions of real-wo...
PaperBanana: From Raw Text to Publication-Ready Diagrams
09 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we dive into PaperBanana, an agentic framework from Peking University and Google Clou...
SleepFM: Predicting Future Disease from a Single Night of Sleep
06 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we break down SleepFM, a large-scale multimodal foundation model that learns directly...
RF-DETR: Neural Architecture Search for Real-Time Detection Transformers
04 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we break down RF-DETR, a new direction in object detection that challenges the idea o...
YOLO26: Rethinking Real-Time Vision for the Edge
03 Feb 2026
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we break down YOLO26, a major shift in real-time object detection. Instead of chasin...
DeepSeek mHC
05 Jan 2026
Contributed by Lukas
Why do some large AI models suddenly collapse during training—and how can geometry prevent it? In this episode of Artificial Intelligence: Papers an...
Chinchilla Scaling Law
18 Dec 2025
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, curated by Dr. Satya Mallick, we break down DeepMind's 2022 paper "Training Compute-O...
Gradient-Based Planning
13 Dec 2025
Contributed by Lukas
How should an AI or robot decide what to do next? In this episode, we explore a new approach to planning that rethinks how world models are trained. T...
SAM3D: The Next Leap in 3D Understanding
10 Dec 2025
Contributed by Lukas
Forget flat photos—SAM3D is rewriting how machines understand the world. In this episode, we break down the groundbreaking new model that takes the ...
DINOv3 : A new Self-Supervised Learning (SSL) Vision Language Model (VLM)
29 Oct 2025
Contributed by Lukas
In this episode, we explore DINOv3, a new self-supervised learning (SSL) vision foundation model from Meta AI Research, emphasizing its ability to sca...
dots.ocr SOTA Document Parsing in a Compact VLM
28 Oct 2025
Contributed by Lukas
dots.ocr is a powerful, multilingual document parsing model from rednote-hilab that achieves state-of-the-art performance by unifying layout detect...
DeepSeek-OCR : A Revolutionary Idea
23 Oct 2025
Contributed by Lukas
In this episode, we dive deep into DeepSeek-OCR, a cutting-edge open-source Optical Character Recognition (OCR) / Text Recognition model that's redefi...
nanochat by Karpathy - How to build your own ChatGPT for $100
21 Oct 2025
Contributed by Lukas
"The best ChatGPT that $100 can buy." That's Andrej Karpathy's positioning for nanochat—a compact, end‑to‑end stack that goes from tokenizer tr...
SmolVLM: Small Yet Mighty Vision Language Model
01 Oct 2025
Contributed by Lukas
In this episode of Artificial Intelligence: Papers and Concepts, we explore SmolVLM, a family of compact yet powerful vision language models (VLMs) de...
Common Pitfalls in Computer Vision & AI Projects (and How to Avoid Them)
01 Oct 2025
Contributed by Lukas
In this episode, we dig deep into the unglamorous side of AI and computer vision projects — the mistakes, misfires, and blind spots that too often d...