HuggingFace 每日AI论文速递

2026.02.26 | 分子图生成首破99%化学有效性；DreamID-Omni把多人脸音色混剪错配率砍到8%

26 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.02.25 | 数据工程赋能小模型；轻量重排刷新长文本SOTA

25 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.02.24 | VBVR百万视频补推理教材；VLANeXt十二配方炼成VLA

24 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.02.23 | VESPO防抖离线RL；推理模型学会“点到为止”

23 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

【周末特辑】2月第4周最火AI论文 | 少即是够；FAC靶向补特征；噪声基准SQuTR

22 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.02.20 | 砍95%注意力画质反升；边压缩边生成FID 1.4

20 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.02.19 | 可学习路由+量化加速视频扩散；残差追踪让人形90%抓取

19 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.02.18 | GLM-5智能体工程登顶50分；SAE可解释性遭随机基线打脸

18 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.02.17 | 查询锚定用户画像；量子原生数据库

17 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.02.16 | 特征激活补数据；区域蒸馏藏放大

16 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

【周末特辑】2月第3周最火AI论文 | OPUS精准选数据；弱模型反向助攻强模型

14 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.02.13 | 自演化AI难守安全；音频大模型统一token

13 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.02.12 | 稀疏MoE比肩GPT-5；GENIUS测流体智能

12 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.02.11 | OPUS对齐更新选数据；Code2World代码预演GUI

11 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.02.10 | ReAlign零训弥合图文隙；MOVA同步生成视音频

10 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.02.09 | AI问诊如住院医；互动悟规则才是真智能

09 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

【周末特辑】2月第2周最火AI论文 | 分阶段统一动作空间；ERNIE 5.0大一统多模态

08 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.02.06 | RLVR去长度偏见；长镜头不换记忆

06 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.02.05 | ERNIE 5.0统一模态；FASA稀疏注意力省内存

05 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.02.04 | 看图写代码省token；临时组队降成本

04 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.02.03 | 分阶段训练统一动作空间；MoE+视觉编码器并行智能体

04 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.02.03 | 分阶段训练统一动作空间；MoE+视觉编码器并行智能体

03 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.02.02 | ASTRA合成轨迹炼工具；THINKSAFE自对齐保安全

02 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

【月末特辑】1月最火AI论文 | mHC稳梯度；GDPO解多奖励

02 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

【周末特辑】2月第1周最火AI论文 | LLM当管家，数据变净菜；LongCat训特工，上网打副本

01 Feb 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.01.30 | 空间智能基准测不准；Idea2Story一键成文

30 Jan 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.01.29 | 难题优先补数学推理；LingBot生成交互平行世界

29 Jan 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.01.28 | AgentDoG筑护栏诊断风险根源；AdaReasoner排工具小模型逆袭

28 Jan 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.01.27 | Agent原生训练刷新SWE-Bench；LLM重塑数据清洗 pipeline

27 Jan 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.01.26 | LongCat练5600亿MoE代理满分；SWE-Pruner剪五成Token更快

26 Jan 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

【周末特辑】1月第4周最火AI论文 | Agentic LLM进化成行动派；群体RL纠偏难度歧视

24 Jan 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.01.23 | BayesianVLA逼模型“读心”；扩散模型“按顺序”更聪明

23 Jan 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.01.22 | LLM变数字特工；视频模型先考后练

22 Jan 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.01.21 | AI修Bug统一打分；MLLM未来预测仍易盲猜

21 Jan 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.01.20 | 沙盒测通才是真后端；分叉合并少字多想

20 Jan 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34...

2026.01.19 | GRPO回报纠偏助啃难题；毒苹果AI未用已扰市

19 Jan 2026

Contributed by Lukas

【赞助商】通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事传送门 🔗 https://www.xiaoyuzhoufm.com/podcast/688a3...

【周末特辑】1月第3周最火AI论文 | VideoDR测模型搜证漂移；BabyVision曝视觉短板

17 Jan 2026

Contributed by Lukas

本期的 5 篇论文如下：[00:29] TOP1(🔥201) | 🔍 Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic...

2026.01.16 | 10B模型逆袭千亿巨头；AI一眼读出城市功能

16 Jan 2026

Contributed by Lukas

本期的 15 篇论文如下：[00:20] 🚀 STEP3-VL-10B Technical Report（STEP3-VL-10B 技术报告）[01:01] 🏙 Urban Socio-Semantic Segmentation...

2026.01.15 | 算法自进化夺冠；LLM远瞻省token

15 Jan 2026

Contributed by Lukas

本期的 15 篇论文如下：[00:20] 🧬 Controlled Self-Evolution for Algorithmic Code Optimization（用于算法代码优化的受控自进化方...

2026.01.14 | 合成数据喂出低资源学霸；AI自演多轮对话更靠谱

14 Jan 2026

Contributed by Lukas

本期的 15 篇论文如下：[00:20] 🌍 Solar Open Technical Report（Solar Open 技术报告）[00:54] 🤖 User-Oriented Multi-Turn Dialogue Gen...

2026.01.13 | VideoDR让模型边搜边推理；BabyVision揭视觉短板

13 Jan 2026

Contributed by Lukas

本期的 15 篇论文如下：[00:20] 🔍 Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasonin...

2026.01.12 | 地图AI强化寻位；多模态Lean形式化

12 Jan 2026

Contributed by Lukas

本期的 15 篇论文如下：[00:20] 🗺 Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization（借助地图思考：用于...

【周末特辑】1月第2周最火AI论文 | GDPO分灶吃饭稳优化；NeoVerse单目视频建4D

11 Jan 2026

Contributed by Lukas

本期的 5 篇论文如下：[00:39] TOP1(🔥126) | 📈 GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimiza...

2026.01.09 | GDPO解耦奖励优化多任务；可学习乘数解锁矩阵尺度

09 Jan 2026

Contributed by Lukas

本期的 15 篇论文如下：[00:21] 📈 GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization（GDPO：面...

2026.01.08 | 熵加权微调保旧学；演化技能网络不断进阶

08 Jan 2026

Contributed by Lukas

本期的 15 篇论文如下：[00:21] ⚖ Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting（熵自适应微调：解...

2026.01.07 | 无限深度任意采样；端到端语音转录分离

07 Jan 2026

Contributed by Lukas

本期的 15 篇论文如下：[00:25] 🔍 InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields（InfiniDe...

2026.01.06 | K-EXAONE MoE；NextFlow统一序列建模多模态

06 Jan 2026

Contributed by Lukas

本期的 15 篇论文如下：[00:21] 🧠 K-EXAONE Technical Report（K-EXAONE技术报告）[00:56] 🚀 NextFlow: Unified Sequential Modeling Acti...

2026.01.05 | Agent流水线提速；4D建模平民化

05 Jan 2026

Contributed by Lukas

本期的 12 篇论文如下：[00:22] 🤖 Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization（Youtu-A...

【周末特辑】1月第1周最火AI论文 | mHC 稳梯度；思维景观 RAG 读长文

03 Jan 2026

Contributed by Lukas

本期的 5 篇论文如下：[00:33] TOP1(🔥132) | 🧠 mHC: Manifold-Constrained Hyper-Connections（mHC：流形约束的超连接）[02:32] TOP2...

2026.01.02 | 语义密度压缩；扩散边画边想

02 Jan 2026

Contributed by Lukas

本期的 3 篇论文如下：[00:19] 🧠 Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space（动态大型概念模型：自...

2026.01.01 | 小模型也能原生外挂；30B-MoE智体逼近大模型

01 Jan 2026

Contributed by Lukas

本期的 15 篇论文如下：[00:22] 🚀 Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models（Youtu-LLM：解锁...

【月末特辑】12月最火AI论文 | 代码智能全链路落地；开源模型推理代理双突破

01 Jan 2026

Contributed by Lukas

本期的 10 篇论文如下：[00:29] TOP1(🔥279) | 🧠 From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intellig...

2025.12.31 | 粗模精雕UltraShape；涂鸦编辑DreamOmni3

31 Dec 2025

Contributed by Lukas

本期的 6 篇论文如下：[00:24] 🧊 UltraShape 1.0: High-Fidelity 3D Shape Generation via Scalable Geometric Refinement（UltraShape 1.0：通过...

2025.12.30 | ERC耦合路由与专家；LiveTalk实时视频对话

30 Dec 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:24] 🔗 Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss（通过辅助损失耦合专家混...

2025.12.29 | 鸟瞰式检索提效小模型；4D扩散一键插入逼真物体

29 Dec 2025

Contributed by Lukas

本期的 13 篇论文如下：[00:27] 🧠 Mindscape-Aware Retrieval Augmented Generation for Improved Long Context Understanding（面向提升长文...

【周末特辑】12月第5周最火AI论文 | DataFlow炼数工厂上线；AI科学家跑不完闭环

27 Dec 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:42] TOP1(🔥188) | ⚙ DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in th...

2025.12.26 | 暗号token涨点视觉推理；3D便签本让视频长脑子

26 Dec 2025

Contributed by Lukas

本期的 6 篇论文如下：[00:19] 🧠 Latent Implicit Visual Reasoning（潜在隐式视觉推理）[00:56] 🎬 Spatia: Video Generation with Up...

2025.12.25 | 四维动态理解刷新VLM；单卡200倍速生成高清视频

25 Dec 2025

Contributed by Lukas

本期的 14 篇论文如下：[00:20] 🧠 Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models（学习在四维空间...

2025.12.24 | 语义蓝图提速视频生成；逐层剖析炼出强策略

24 Dec 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:19] 🎬 SemanticGen: Video Generation in Semantic Space（SemanticGen：在语义空间中的视频生成）[01:01...

2025.12.23 | 数据工厂提效；棱镜假说统合

23 Dec 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:22] ⚙ DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-C...

2025.12.22 | PhysBrain用第一人称视频让AI学会动手；大模型离科学家AI还差得远

22 Dec 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:24] 🧠 PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence（PhysBr...

【周末特辑】12月第4周最火AI论文 | 全能生成Kling-Omni秒出4K影片；Step-GUI让手机代理本地跑

20 Dec 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:37] TOP1(🔥117) | 🎬 Kling-Omni Technical Report（Kling-Omni技术报告）[02:55] TOP2(🔥116) | 🤖 Step-GU...

2025.12.19 | Kling-Omni一统视频生成；LLaDA2.0百亿扩散模型

19 Dec 2025

Contributed by Lukas

本期的 14 篇论文如下：[00:26] 🎬 Kling-Omni Technical Report（Kling-Omni技术报告）[01:02] 🚀 LLaDA2.0: Scaling Up Diffusion Languag...

2025.12.18 | 校准步长奖励砍成本；扩散草稿自回归验证提速

18 Dec 2025

Contributed by Lukas

本期的 14 篇论文如下：[00:25] 🤖 Step-GUI Technical Report（Step-GUI技术报告）[00:59] ⚡ DEER: Draft with Diffusion, Verify with Aut...

2025.12.17 | MMGR揭多模态推理短板；WorldPlay保几何一致实时建模

17 Dec 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:23] 🧠 MMGR: Multi-Modal Generative Reasoning（MMGR：多模态生成式推理评估与基准）[01:14] 🎮 Wor...

2025.12.16 | 代理记忆三维框架；VTP刷新生成纪录

16 Dec 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:20] 🧠 Memory in the Age of AI Agents（人工智能代理时代下的记忆）[00:57] 🚀 Towards Scalable Pre-...

2025.12.15 | 牙科小模型逆袭；扩散模型弃VAE

15 Dec 2025

Contributed by Lukas

本期的 14 篇论文如下：[00:22] 🦷 DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry（DentalGPT：激励牙科领域多模态...

【周末特辑】12月第3周最火AI论文 | 潜轨迹制导视频运动；并行自蒸馏提速推理

13 Dec 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:30] TOP1(🔥117) | 🎬 Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance（Wan-Move：...

2025.12.12 | RL捏3D新纪录；AI奥赛摘银牌

12 Dec 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:25] 🤖 Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation（我们准备好将强化学习...

2025.12.11 | StereoWorld单目秒变立体大片；BiCo跨域拼贴新概念

11 Dec 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:22] 🎥 StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation（StereoWorld：几何感知的单目到立...

2025.12.10 | 潜在轨迹控运动；WebGPU实时溅射

10 Dec 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:24] 🎬 Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance（Wan-Move：通过潜在轨...

2025.12.09 | 并行自蒸馏提速4.6倍；虚部RoPE++长文本双优化

09 Dec 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:20] ⚡ Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning（原生并行...

2025.12.08 | 自对抗一步生成；外挂评审迭代编辑

08 Dec 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:19] ⚡ TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows（TwinFlow：基于自对...

【周末特辑】12月第2周最火AI论文 | 代码智能全链路拆解；开源DeepSeek-V3.2登顶

07 Dec 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:32] TOP1(🔥239) | 🧠 From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intellige...

2025.12.05 | DAComp立Agent新靶；流式化身无限实时

05 Dec 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:22] 📊 DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle（DAComp：跨全数据智能...

2025.12.04 | Qwen3-VL多模态超长上下文；PretrainZero强化主动预训练

04 Dec 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:24] 🧠 Qwen3-VL Technical Report（Qwen3-VL 技术报告）[00:57] 🧠 PretrainZero: Reinforcement Active Pretra...

【月末特辑】11月最火AI论文 | Kandinsky 5.0全家桶开源；视频生成让模型边播边想

03 Dec 2025

Contributed by Lukas

本期的 10 篇论文如下：[00:35] TOP1(🔥219) | 🎨 Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation（Kandinsky 5....

2025.12.02 | 代码智能四步落地；LongVT长视频精准理解

02 Dec 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:20] 🧠 From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence（从代码...

2025.12.01 | Z-Image小参高效夺冠；REASONEDIT先思后画登顶

01 Dec 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:26] 🚀 Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer（Z-Image...

【周末特辑】11月第5周最火AI论文 | 自适应正交稳训练；GAM代理即搜忆

29 Nov 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:51] TOP1(🔥161) | ⚡ ROOT: Robust Orthogonalized Optimizer for Neural Network Training（ROOT：面向神经网络...

2025.11.28 | 潜在奖励模型提速降显存；画布多模态生成碾压SOTA

28 Nov 2025

Contributed by Lukas

本期的 6 篇论文如下：[00:19] 🎬 Video Generation Models Are Good Latent Reward Models（视频生成模型是优秀的潜在奖励模型）...

2025.11.27 | 俄语多模态评测补空白；潜协作提速14%

27 Nov 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:22] 🔍 Multimodal Evaluation of Russian-language Architectures（俄语多模态架构的评估框架）[01:15] 🧠...

2025.11.26 | 大模型育种进化框架开源；MedSAM-3听懂临床精准分割

26 Nov 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:17] 🧬 GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms（GigaEvo：基于...

2025.11.25 | 即时编译让记忆无损；AutoEnv自动挑环境提两成

25 Nov 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:25] 🧠 General Agentic Memory Via Deep Research（通过深度研究的通用代理记忆）[00:52] 🧪 AutoEnv:...

2025.11.24 | 开源7B模型刷新多模态推理；GeoVista小模型精准地理定位

24 Nov 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:21] 🧠 OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe（OpenMMRea...

【周末特辑】11月第4周最火AI论文 | Kandinsky 5.0开源全家桶；MiroThinker开源智能体

22 Nov 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:41] TOP1(🔥171) | 🎨 Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation（Kandinsky 5.0...

2025.11.21 | V-ReasonBench考视频模型推理；Step-Audio-R1让语音越“想”越强

21 Nov 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:22] 📊 V-ReasonBench: Toward Unified Reasoning Benchmark Suite for Video Generation Models（V-ReasonBench：面向...

2025.11.20 | 视频模型拍推理链，迷宫百发百中；无标注左右互搏，视觉模型自学跃升

20 Nov 2025

Contributed by Lukas

本期的 4 篇论文如下：[00:23] 🎬 Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks（...

2025.11.19 | 像素演员难推理；视觉误导测真章

19 Nov 2025

Contributed by Lukas

本期的 11 篇论文如下：[00:23] 🧠 Can World Simulators Reason? Gen-ViRe: A Generative Visual Reasoning Benchmark（世界模拟器会推理吗...

2025.11.18 | RL奥赛夺金；Uni-MoE 2.0全能跃升

18 Nov 2025

Contributed by Lukas

本期的 14 篇论文如下：[00:17] 🏅 P1: Mastering Physics Olympiads with Reinforcement Learning（用强化学习攻克物理奥赛）[00:56] ...

2025.11.17 | RoPE去噪救长文本；AI速筛离子液体

17 Nov 2025

Contributed by Lukas

本期的 13 篇论文如下：[00:24] 🧹 DoPE: Denoising Rotary Position Embedding（DoPE：面向旋转位置嵌入的去噪处理）[00:58] 🧪 ...

【周末特辑】11月第3周最火AI论文 | 3D游戏智能体开源方案；桌面AI少样本精准操控

15 Nov 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:38] TOP1(🔥135) | 🌍 Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds（Lumine：在3D开...

2025.11.14 | UniVA四合一开源视频通才；Depth Anything 3单ViT通吃3D

14 Nov 2025

Contributed by Lukas

本期的 4 篇论文如下：[00:24] 🎬 UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist（UniVA：面向开源下...

2025.11.13 | 原神数据炼成7B通用AI；零训练轨迹秒变视频遥控器

13 Nov 2025

Contributed by Lukas

本期的 9 篇论文如下：[00:19] 🌍 Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds（Lumine：在3D开放世界中打造...

2025.11.12 | 1.5B小模型反超671B大模型；多智能体质检聊天机器人

12 Nov 2025

Contributed by Lukas

本期的 9 篇论文如下：[00:24] 🧠 Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1...

2025.11.11 | 小窗口勤总结刷新深度研究；先广撒网再啃难题激活代码竞赛

11 Nov 2025

Contributed by Lukas

本期的 13 篇论文如下：[00:25] 🧩 IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction（IterResearch：基于马...

2025.11.10 | DeepEyesV2小模型边看图边写代码；纯数据让AI长出立体眼

10 Nov 2025

Contributed by Lukas

本期的 7 篇论文如下：[00:21] 🧠 DeepEyesV2: Toward Agentic Multimodal Model（DeepEyesV2：迈向智能体多模态模型）[01:13] 🧭 Vi...

【周末特辑】11月第2周最火AI论文 | 视频生成即推理；SVG草图变代码

08 Nov 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:31] TOP1(🔥137) | 🎬 Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm（用视...

2025.11.07 | 视频推理新范式；图像互动促思维

07 Nov 2025

Contributed by Lukas

本期的 12 篇论文如下：[00:21] 🎬 Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm（用视频思考：视...

2025.11.06 | 扩散模型省数据；音视频对口型

06 Nov 2025

Contributed by Lukas

本期的 9 篇论文如下：[00:17] 🚀 Diffusion Language Models are Super Data Learners（扩散语言模型是超级数据学习者）[01:06] 🎬...

Activity Overview

Episodes

2026.02.26 | 分子图生成首破99%化学有效性；DreamID-Omni把多人脸音色混剪错配率砍到8%

2026.02.25 | 数据工程赋能小模型；轻量重排刷新长文本SOTA

2026.02.24 | VBVR百万视频补推理教材；VLANeXt十二配方炼成VLA

2026.02.23 | VESPO防抖离线RL；推理模型学会“点到为止”

【周末特辑】2月第4周最火AI论文 | 少即是够；FAC靶向补特征；噪声基准SQuTR

2026.02.20 | 砍95%注意力画质反升；边压缩边生成FID 1.4

2026.02.19 | 可学习路由+量化加速视频扩散；残差追踪让人形90%抓取

2026.02.18 | GLM-5智能体工程登顶50分；SAE可解释性遭随机基线打脸

2026.02.17 | 查询锚定用户画像；量子原生数据库

2026.02.16 | 特征激活补数据；区域蒸馏藏放大

【周末特辑】2月第3周最火AI论文 | OPUS精准选数据；弱模型反向助攻强模型

2026.02.13 | 自演化AI难守安全；音频大模型统一token

2026.02.12 | 稀疏MoE比肩GPT-5；GENIUS测流体智能

2026.02.11 | OPUS对齐更新选数据；Code2World代码预演GUI

2026.02.10 | ReAlign零训弥合图文隙；MOVA同步生成视音频

2026.02.09 | AI问诊如住院医；互动悟规则才是真智能

【周末特辑】2月第2周最火AI论文 | 分阶段统一动作空间；ERNIE 5.0大一统多模态

2026.02.06 | RLVR去长度偏见；长镜头不换记忆

2026.02.05 | ERNIE 5.0统一模态；FASA稀疏注意力省内存

2026.02.04 | 看图写代码省token；临时组队降成本

2026.02.03 | 分阶段训练统一动作空间；MoE+视觉编码器并行智能体

2026.02.03 | 分阶段训练统一动作空间；MoE+视觉编码器并行智能体

2026.02.02 | ASTRA合成轨迹炼工具；THINKSAFE自对齐保安全

【月末特辑】1月最火AI论文 | mHC稳梯度；GDPO解多奖励

【周末特辑】2月第1周最火AI论文 | LLM当管家，数据变净菜；LongCat训特工，上网打副本

2026.01.30 | 空间智能基准测不准；Idea2Story一键成文

2026.01.29 | 难题优先补数学推理；LingBot生成交互平行世界

2026.01.28 | AgentDoG筑护栏诊断风险根源；AdaReasoner排工具小模型逆袭

2026.01.27 | Agent原生训练刷新SWE-Bench；LLM重塑数据清洗 pipeline

2026.01.26 | LongCat练5600亿MoE代理满分；SWE-Pruner剪五成Token更快

【周末特辑】1月第4周最火AI论文 | Agentic LLM进化成行动派；群体RL纠偏难度歧视

2026.01.23 | BayesianVLA逼模型“读心”；扩散模型“按顺序”更聪明

2026.01.22 | LLM变数字特工；视频模型先考后练

2026.01.21 | AI修Bug统一打分；MLLM未来预测仍易盲猜

2026.01.20 | 沙盒测通才是真后端；分叉合并少字多想

2026.01.19 | GRPO回报纠偏助啃难题；毒苹果AI未用已扰市

【周末特辑】1月第3周最火AI论文 | VideoDR测模型搜证漂移；BabyVision曝视觉短板

2026.01.16 | 10B模型逆袭千亿巨头；AI一眼读出城市功能

2026.01.15 | 算法自进化夺冠；LLM远瞻省token

2026.01.14 | 合成数据喂出低资源学霸；AI自演多轮对话更靠谱

2026.01.13 | VideoDR让模型边搜边推理；BabyVision揭视觉短板

2026.01.12 | 地图AI强化寻位；多模态Lean形式化

【周末特辑】1月第2周最火AI论文 | GDPO分灶吃饭稳优化；NeoVerse单目视频建4D

2026.01.09 | GDPO解耦奖励优化多任务；可学习乘数解锁矩阵尺度

2026.01.08 | 熵加权微调保旧学；演化技能网络不断进阶

2026.01.07 | 无限深度任意采样；端到端语音转录分离

2026.01.06 | K-EXAONE MoE；NextFlow统一序列建模多模态

2026.01.05 | Agent流水线提速；4D建模平民化

【周末特辑】1月第1周最火AI论文 | mHC 稳梯度；思维景观 RAG 读长文

2026.01.02 | 语义密度压缩；扩散边画边想

2026.01.01 | 小模型也能原生外挂；30B-MoE智体逼近大模型

【月末特辑】12月最火AI论文 | 代码智能全链路落地；开源模型推理代理双突破

2025.12.31 | 粗模精雕UltraShape；涂鸦编辑DreamOmni3

2025.12.30 | ERC耦合路由与专家；LiveTalk实时视频对话

2025.12.29 | 鸟瞰式检索提效小模型；4D扩散一键插入逼真物体

【周末特辑】12月第5周最火AI论文 | DataFlow炼数工厂上线；AI科学家跑不完闭环

2025.12.26 | 暗号token涨点视觉推理；3D便签本让视频长脑子

2025.12.25 | 四维动态理解刷新VLM；单卡200倍速生成高清视频

2025.12.24 | 语义蓝图提速视频生成；逐层剖析炼出强策略

2025.12.23 | 数据工厂提效；棱镜假说统合

2025.12.22 | PhysBrain用第一人称视频让AI学会动手；大模型离科学家AI还差得远

【周末特辑】12月第4周最火AI论文 | 全能生成Kling-Omni秒出4K影片；Step-GUI让手机代理本地跑

2025.12.19 | Kling-Omni一统视频生成；LLaDA2.0百亿扩散模型

2025.12.18 | 校准步长奖励砍成本；扩散草稿自回归验证提速

2025.12.17 | MMGR揭多模态推理短板；WorldPlay保几何一致实时建模

2025.12.16 | 代理记忆三维框架；VTP刷新生成纪录

2025.12.15 | 牙科小模型逆袭；扩散模型弃VAE

【周末特辑】12月第3周最火AI论文 | 潜轨迹制导视频运动；并行自蒸馏提速推理

2025.12.12 | RL捏3D新纪录；AI奥赛摘银牌

2025.12.11 | StereoWorld单目秒变立体大片；BiCo跨域拼贴新概念

2025.12.10 | 潜在轨迹控运动；WebGPU实时溅射

2025.12.09 | 并行自蒸馏提速4.6倍；虚部RoPE++长文本双优化

2025.12.08 | 自对抗一步生成；外挂评审迭代编辑

【周末特辑】12月第2周最火AI论文 | 代码智能全链路拆解；开源DeepSeek-V3.2登顶

2025.12.05 | DAComp立Agent新靶；流式化身无限实时

2025.12.04 | Qwen3-VL多模态超长上下文；PretrainZero强化主动预训练

【月末特辑】11月最火AI论文 | Kandinsky 5.0全家桶开源；视频生成让模型边播边想