Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing
Podcast Image

Artificial Intelligence : Papers & Concepts

Technology Education

Activity Overview

Episode publication activity over the past year

Episodes

Helios: Rethinking How AI Models Scale Across Compute and Data

20 Mar 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we explore Helios, a new approach focused on optimizing how large AI models scale acr...

BitNet: Rethinking Neural Networks With 1-Bit Precision

19 Mar 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we explore BitNet, a radically efficient approach to building neural networks using e...

Agents of Chaos: When Multiple AI Systems Interact in Unpredictable Ways

18 Mar 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we explore Chaos Agents, a concept that examines what happens when multiple AI agents...

OC-SORT: Improving Object Tracking by Fixing Motion, Not Just Detection

17 Mar 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we explore OC-SORT (Observation-Centric SORT), an evolution of traditional tracking a...

Attention Residuals: Understanding the Hidden Signals Inside Transformer Models

16 Mar 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we explore Attention Residuals, a concept that reveals how transformer models preserv...

SORT: A Simple and Efficient Approach to Real-Time Object Tracking

16 Mar 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we explore SORT (Simple Online and Realtime Tracking), a lightweight yet powerful alg...

SigLIP 2: Advancing Vision-Language Understanding Without Contrastive Bottlenecks

13 Mar 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we explore SigLIP 2, the next evolution of Google's vision–language model designed ...

Nemotron-3 Super: Pushing the Limits of Reasoning in Large Language Models

12 Mar 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we explore Nemotron-3 Super, an advanced large language model designed to improve rea...

AI Hallucinations: Why Language Models Sometimes Make Things Up

11 Mar 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we explore the phenomenon of AI hallucinations-the moments when language models gener...

ByteTrack: A Smarter Way for AI to Track Objects in Real Time

10 Mar 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we explore ByteTrack, a breakthrough approach in multi-object tracking that significa...

AI and Copyright: Who Owns Content Created by Machines?

04 Mar 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we explore the growing debate around AI and copyright-one of the most important legal...

Qwen 3.5 - Advancing Open Multilingual Intelligence at Scale

27 Feb 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we explore Qwen 3.5, the latest generation of large language models designed to push ...

Qwen 3: Advancing Open Multilingual Intelligence at Scale

26 Feb 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we explore Qwen 3, the latest generation of large language models designed to push mu...

Unified Latents: Bringing Images, Video, and Language Into One Shared AI Space

25 Feb 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we explore Unified Latents, a new approach that aims to merge different types of data...

DeepSeek-V3: Scaling Open Reasoning Models With Efficiency and Precision

23 Feb 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we explore DeepSeek-V3, a next-generation large language model designed to push the b...

Repeat-Repeat: Why Simply Repeating a Prompt Can Make LLMs Smarter

19 Feb 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we explore the surprisingly simple idea behind "Prompt Repetition Improves Non-Reason...

Seedance 2.0: Moving From AI Video Generation to Cinematic Intelligence

18 Feb 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we explore Seedance 2.0, the next evolution of ByteDance's video foundation model des...

Molmo: Building Open Multimodal AI That Can Truly See and Understand

17 Feb 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we break down Molmo, an open multimodal model designed to understand images and langu...

Seedance 1.0: The Next Leap in AI Video Generation

16 Feb 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we explore Seedance 1.0, a new foundation model from ByteDance that is pushing the bo...

LoRA: Teaching Massive AI Models New Skills Without Retraining Everything

13 Feb 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we break down LoRA (Low-Rank Adaptation) - a breakthrough technique that makes fine-t...

Wembley Goal: How Computer Vision Settled Football's Most Controversial Moment

12 Feb 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we revisit the legendary 1966 World Cup Final and the infamous "Wembley Goal" - a mom...

I-JEPA: Teaching AI to Understand Images Without Labels

11 Feb 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we break down I-JEPA, a self-supervised vision architecture that moves beyond pixel-l...

EchoJEPA: Teaching AI to Truly Understand the Beating Heart

10 Feb 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we break down EchoJEPA, a large-scale foundation model trained on millions of real-wo...

PaperBanana: From Raw Text to Publication-Ready Diagrams

09 Feb 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we dive into PaperBanana, an agentic framework from Peking University and Google Clou...

SleepFM: Predicting Future Disease from a Single Night of Sleep

06 Feb 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we break down SleepFM, a large-scale multimodal foundation model that learns directly...

RF-DETR: Neural Architecture Search for Real-Time Detection Transformers

04 Feb 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we break down RF-DETR, a new direction in object detection that challenges the idea o...

YOLO26: Rethinking Real-Time Vision for the Edge

03 Feb 2026

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we break down YOLO26, a major shift in real-time object detection. Instead of chasin...

DeepSeek mHC

05 Jan 2026

Contributed by Lukas

Why do some large AI models suddenly collapse during training—and how can geometry prevent it? In this episode of Artificial Intelligence: Papers an...

Chinchilla Scaling Law

18 Dec 2025

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, curated by Dr. Satya Mallick, we break down DeepMind's 2022 paper "Training Compute-O...

Gradient-Based Planning

13 Dec 2025

Contributed by Lukas

How should an AI or robot decide what to do next? In this episode, we explore a new approach to planning that rethinks how world models are trained. T...

SAM3D: The Next Leap in 3D Understanding

10 Dec 2025

Contributed by Lukas

Forget flat photos—SAM3D is rewriting how machines understand the world. In this episode, we break down the groundbreaking new model that takes the ...

DINOv3 : A new Self-Supervised Learning (SSL) Vision Language Model (VLM)

29 Oct 2025

Contributed by Lukas

In this episode, we explore DINOv3, a new self-supervised learning (SSL) vision foundation model from Meta AI Research, emphasizing its ability to sca...

dots.ocr SOTA Document Parsing in a Compact VLM

28 Oct 2025

Contributed by Lukas

dots.ocr is a powerful, multilingual document parsing model from rednote-hilab that achieves state-of-the-art performance by unifying layout detect...

DeepSeek-OCR : A Revolutionary Idea

23 Oct 2025

Contributed by Lukas

In this episode, we dive deep into DeepSeek-OCR, a cutting-edge open-source Optical Character Recognition (OCR) / Text Recognition model that's redefi...

nanochat by Karpathy - How to build your own ChatGPT for $100

21 Oct 2025

Contributed by Lukas

"The best ChatGPT that $100 can buy." That's Andrej Karpathy's positioning for nanochat—a compact, end‑to‑end stack that goes from tokenizer tr...

SmolVLM: Small Yet Mighty Vision Language Model

01 Oct 2025

Contributed by Lukas

In this episode of Artificial Intelligence: Papers and Concepts, we explore SmolVLM, a family of compact yet powerful vision language models (VLMs) de...

Common Pitfalls in Computer Vision & AI Projects (and How to Avoid Them)

01 Oct 2025

Contributed by Lukas

In this episode, we dig deep into the unglamorous side of AI and computer vision projects — the mistakes, misfires, and blind spots that too often d...