HuggingFace 每日AI论文速递

2025.11.05 | 向量草图测代码；先画后想补视觉

05 Nov 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:21] 🖼 VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation（VCode：以SVG为符号视...

2025.11.04 | 超稀疏MoE激活万亿参数；视觉模型看图胜GNN

04 Nov 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:23] 🧠 Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation（全激活赋能...

2025.11.03 | OS-Sentinel实时守护手机操作安全；ThinkMorph让小模型边想边画

03 Nov 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:21] 🛡 OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows（OS-S...

【月末特辑】10月最火AI论文 | 幼龙BDH稀疏可解释；迷你递归7兆碾压大模型

02 Nov 2025

Contributed by Lukas

本期的 10 篇论文如下：[00:30] TOP1(🔥522) | 🐣 The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain（幼...

【周末特辑】11月第1周最火AI论文 | 循环模型省参强推理；Concerto 2D-3D自监督涨点

01 Nov 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:35] TOP1(🔥174) | 🔄 Scaling Latent Reasoning via Looped Language Models（通过循环语言模型扩展潜在推...

2025.10.31 | Emu3.5统一预测时空；扩散提示驱动机器人

31 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:26] 🌍 Emu3.5: Native Multimodal Models are World Learners（Emu3.5：原生多模态世界模型让AI看懂并预...

2025.10.30 | 看图写码7B逆袭；视频思维RL破局

30 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:22] 👁 JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence（JanusCoder：面向...

2025.10.29 | 通义深度研究报告；小模型折记忆胜671B巨模型

29 Oct 2025

Contributed by Lukas

本期的 10 篇论文如下：[00:23] 🔍 Tongyi DeepResearch Technical Report（通义深度研究报告：面向长程深度信息检索任务的智...

2025.10.28 | Point Transformer无标对齐长空间；代码递归统一粗细粒度

28 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:23] 🎼 Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations（Concerto：2D-3D联合自...

2025.10.27 | DeepAgent一步推理+ToolPO；视频即提示DiT秒控百种语义

27 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:27] 🧠 DeepAgent: A General Reasoning Agent with Scalable Toolsets（DeepAgent：具备可扩展工具集的通用...

【周末特辑】10月第4周最火AI论文 | 内部概率+投票剪尾，RPC省样本提精度

26 Oct 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:29] TOP1(🔥135) | 🧠 A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning...

2025.10.24 | AdaSPEC挑40% token提速两成；AutoPage 10美分生成交互网页

24 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:23] 🎯 AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders（AdaSPEC：面向高效推测...

2025.10.23 | 线性注意力显存降十倍；动态裁剪PPO稳提分

23 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:19] 🧠 Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning（每一种注意力都...

2025.10.22 | LightMem压缩记忆千倍提速12倍；闭环世界模型微调8万数据反超巨兽

22 Oct 2025

Contributed by Lukas

本期的 14 篇论文如下：[00:19] 🧠 LightMem: Lightweight and Efficient Memory-Augmented Generation（LightMem：轻量高效的记忆增强生...

2025.10.21 | 模型不懂光影折射；小模型也能写报告

21 Oct 2025

Contributed by Lukas

本期的 13 篇论文如下：[00:21] 🪞 PICABench: How Far Are We from Physically Realistic Image Editing?（PICABench：我们离物理真实的图...

2025.10.20 | RPC剪枝提速保准；OmniVinci小数据跨模态称王

20 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:20] 🧠 A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning（大模型推...

【周末特辑】10月第3周最火AI论文 | 量化噪声变探索，单卡跑RL；冻结编码器放语义，DiT生成新纪录

18 Oct 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:40] TOP1(🔥154) | 🚀 QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs（QeRL：超...

2025.10.17 | AI眼镜预判式服务；视频生成补想象力

17 Oct 2025

Contributed by Lukas

本期的 11 篇论文如下：[00:25] 👓 AI for Service: Proactive Assistance with AI Glasses（AI服务：AI眼镜的主动式协助）[01:06] 🎬...

2025.10.16 | UniMoE一统语音音乐；注意力图点亮大模型推理

16 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:21] 🎧 UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE（UniMoE-Audio：基于动态容...

2025.10.15 | 像素级自监督ViT刷新生成基准；多智能体评测网文翻译新标尺

15 Oct 2025

Contributed by Lukas

本期的 14 篇论文如下：[00:20] 🖼 Advancing End-to-End Pixel Space Generative Modeling via Self-supervised Pre-training（通过自监督预...

2025.10.14 | 量化误差变奖励，单卡训32B；面向多模态大模型的音视频评测基准

14 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:23] 🚀 QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs（QeRL：超越效率——...

2025.10.13 | 桌面交互预训练解锁机器人潜能；统一模型赋予相机空间想象力

13 Oct 2025

Contributed by Lukas

本期的 14 篇论文如下：[00:20] 🖥 D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI（D2E：利用桌面数...

【周末特辑】10月第2周最火AI论文 | 递归小模型刷爆推理榜；未来经验点亮零奖励学习

12 Oct 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:33] TOP1(🔥300) | 🧠 Less is More: Recursive Reasoning with Tiny Networks（小而精：用微型网络递归推...

2025.10.10 | 早期经验的Agent Learning；图文交错反思链跃升至24.9%

10 Oct 2025

Contributed by Lukas

本期的 14 篇论文如下：[00:16] 🌱 Agent Learning via Early Experience（基于早期经验的主体学习）[00:50] 🧠 MM-HELIX: Boosting ...

2025.10.09 | Ming-UniVision统一视觉词表；KV-Cache直连让大模型秒聊

09 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:21] 🔄 Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer（Ming-UniVis...

2025.10.08 | TaTToo用外挂代码干翻大模型；4B小模型32步逼近闭源巨头

08 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:24] 📊 TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning（TaTToo：面向表格推理...

2025.10.07 | 论文秒变演讲；Video-LMM后训练突破

07 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:21] 🎬 Paper2Video: Automatic Video Generation from Scientific Papers（论文自动生成学术演讲视频）[0...

2025.10.06 | 15B小模型追平DeepSeek-R1；渐进蒸馏128 token省八成算力

06 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:28] 🧠 Apriel-1.5-15b-Thinker（Apriel-1.5-15B-Thinker：以小博大实现前沿多模态推理的15B开源模型...

【周末特辑】10月第1周最火AI论文 | Transformer长出大脑的壳；LongLive把长视频做成直播

05 Oct 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:43] TOP1(🔥323) | 🐣 The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain（幼...

2025.10.03 | LongCodeZip删得快准；迈向分钟级高质量视频生成

03 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:22] 🗜 LongCodeZip: Compress Long Context for Code Language Models（LongCodeZip：面向代码大模型的长上...

2025.10.02 | MCTS破局RLVR瓶颈；GEM开源智能体训练场

02 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:19] 🧠 DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree...

【月末特辑】9月最火AI论文 | 群体RL共享降本；SAPO让旧机也能训大模型

02 Oct 2025

Contributed by Lukas

本期的 10 篇论文如下：[00:29] TOP1(🔥640) | 🤝 Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing（共享...

2025.10.01 | 自对弈零标注训练；MCP代理深度评测

01 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:20] 🎮 Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play（Vision-Zero：基于策略化...

2025.09.30 | SLA稀疏注意力砍算力；StableToken抗噪不训模

30 Sep 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:22] ⚡ SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention（SLA：通过可微...

2025.09.29 | 实时长视频边聊边播；分位数基线稳控推理熵

29 Sep 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:20] 🎬 LongLive: Real-time Interactive Long Video Generation（LongLive：实时交互式长视频生成框架）...

【周末特辑】9月第5周最火AI论文 | Qwen3-Omni开源称王; 锁定视觉训解码，Baseer刷新阿文OCR；

27 Sep 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:38] TOP1(🔥116) | 📜 Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR（Baseer：面向阿拉...

2025.09.26 | SciReasoner八项全能；MMR1模糊区炼出开源多模态

26 Sep 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:20] 🔬 SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines（SciReasoner：跨学科夯实科学...

2025.09.25 | 视频模型零样本全能；隐式思维链省token提效

25 Sep 2025

Contributed by Lukas

本期的 10 篇论文如下：[00:22] 🎥 Video models are zero-shot learners and reasoners（视频模型是零样本学习者与推理者）[01:09...

2025.09.24 | 阿语OCR刷新指标；无标注RL涨分

24 Sep 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:24] 📜 Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR（Baseer：面向阿拉伯文档OCR的...

2025.09.23 | 少78条示范让AI飙73.5%；免掩膜视频插主体超Pika

23 Sep 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:21] 🚀 LIMI: Less is More for Agency（LIMI：少即是多，打造AI智能体）[00:55] 🎬 OmniInsert: Mask-Fr...

2025.09.22 | 有向图驱动代码生成；双通道视觉统一模型

22 Sep 2025

Contributed by Lukas

本期的 13 篇论文如下：[00:25] 🗺 RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation（RPG：用于统一可扩展...

【周末特辑】9月第4周最火AI论文 | OmniWorld打造4D数据工厂；WebWeaver让AI边搜边写

20 Sep 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:43] TOP1(🔥95) | 🌍 OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling（OmniWorld：面向...

2025.09.19 | 跨平台GUI模型刷榜；FlowRL分布匹配提推理

19 Sep 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:26] 🖥 ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data（ScaleCUA：基于跨平台数...

2025.09.18 | FP8压缩+翻译微调低成本炼阿语大模型；2B-8B小模型洗数据硬刚GPT-4o

18 Sep 2025

Contributed by Lukas

本期的 14 篇论文如下：[00:19] 🐪 Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale（Hala技术报...

2025.09.17 | WebWeaver框架提升可信长文报告；Agentic预训练扩展智能体系统

17 Sep 2025

Contributed by Lukas

本期的 11 篇论文如下：[00:27] 🔍 WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research（WebWeaver：...

2025.09.16 | OmniWorld建4D数据底座；UI-S1半在线驯界面代理

16 Sep 2025

Contributed by Lukas

本期的 14 篇论文如下：[00:24] 🌍 OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling（OmniWorld：面向4D世界建模...

2025.09.15 | 数据集升级测互动；模型大小非长程瓶颈

15 Sep 2025

Contributed by Lukas

本期的 14 篇论文如下：[00:25] 📚 IntrEx: A Dataset for Modeling Engagement in Educational Conversations（IntrEx：面向教育对话中参...

【周末特辑】9月第3周最火AI论文 | 群智RL提速大模型；小VLA零预训练控机械

14 Sep 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:40] TOP1(🔥455) | 🤝 Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing（共享...

2025.09.12 | HuMo多模态控人视频；SimpleVLA-RL强化升效

12 Sep 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:27] 🎭 HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning（HuMo：通过协同多模...

2025.09.11 | 强化学习提升推理能力；奖励缩放优化视觉生成

11 Sep 2025

Contributed by Lukas

本期的 10 篇论文如下：[00:24] 🧠 A Survey of Reinforcement Learning for Large Reasoning Models（大型推理模型的强化学习综述）...

2025.09.10 | 强化学习并行思维；视觉搜索推理扩展

10 Sep 2025

Contributed by Lukas

本期的 14 篇论文如下：[00:22] 🧠 Parallel-R1: Towards Parallel Thinking via Reinforcement Learning（Parallel-R1: 通过强化学习实现并...

2025.09.09 | REER提升推理性能；WebExplorer训练智能体

09 Sep 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:21] 💡 Reverse-Engineered Reasoning for Open-Ended Generation（面向开放式生成的逆向工程推理）[00:...

2025.09.08 | 语言模型幻觉源于预训练；大模型图形编程性能提升

08 Sep 2025

Contributed by Lukas

本期的 12 篇论文如下：[00:24] 🤔 Why Language Models Hallucinate（语言模型为何产生幻觉）[00:47] 🎨 Symbolic Graphics Programm...

【周末特辑】9月第2周最火AI论文 | LLM智能体RL综述；AI代码安全基准

06 Sep 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:35] TOP1(🔥139) | 🤖 The Landscape of Agentic Reinforcement Learning for LLMs: A Survey（面向大语言模型的...

2025.09.05 | 大型语言模型语义理解弱；图像编辑模型提升几何估计

05 Sep 2025

Contributed by Lukas

本期的 13 篇论文如下：[00:22] 🤔 Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth（废话学：用深度解读无意义...

2025.09.04 | 机器人任务规划高效；数据推理能力提升

04 Sep 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:24] 🤖 Robix: A Unified Model for Robot Interaction, Reasoning and Planning（Robix：一个用于机器人交互、...

2025.09.03 | 智能体RL提升大模型自主性；SimpleTIR解多轮工具推理

03 Sep 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:19] 🤖 The Landscape of Agentic Reinforcement Learning for LLMs: A Survey（面向大语言模型的智能体强化...

2025.09.02 | PVPO优化推理性能；T2R-bench暴露模型短板

02 Sep 2025

Contributed by Lukas

本期的 6 篇论文如下：[00:23] 🧠 PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning（PVPO：基于预估值策略优...

2025.09.01 | R-4B模型优化思考效率；EO-1提升机器人控制能力

01 Sep 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:24] 🧠 R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce ...

【月末特辑】8月最火AI论文 | 科学AI模型缩小性能差距；图像模型解决文本渲染与编辑

31 Aug 2025

Contributed by Lukas

本期的 10 篇论文如下：[00:30] TOP1(🔥242) | 🧪 Intern-S1: A Scientific Multimodal Foundation Model（Intern-S1：一个科学多模态基...

【周末特辑】8月第5周最火AI论文 | 多模态模型效率提升；自博弈策略提高多样性

30 Aug 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:36] TOP1(🔥161) | 🚀 InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficie...

2025.08.29 | 稳定文本到图像生成；高效数学推理

29 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:24] ⚖ Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning（Pref-GRP...

2025.08.28 | 推理分解减幻觉；可解释性编码信息

28 Aug 2025

Contributed by Lukas

本期的 14 篇论文如下：[00:25] 🧠 Self-Rewarding Vision-Language Model via Reasoning Decomposition（通过推理分解的自奖励视觉语...

2025.08.27 | 物理模型评估显不足；树算法优化提效降本

27 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:23] 🔬 CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics（CMPhysBench：...

2025.08.26 | 提升模型推理效率；增强生成语义对齐

26 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:24] 🚀 InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency（InternVL3...

2025.08.25 | 无微调智能体高效学习；四足机器人长周期探索

25 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:23] 🚀 AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs（AgentFly：无需微调LLM即可微调LLM智能...

【周末特辑】8月第4周最火AI论文 | 视觉模型新突破；科学多模态领先

24 Aug 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:39] TOP1(🔥172) | 🚀 DINOv3（DINOv3：视觉基础模型新里程碑）[01:39] TOP2(🔥170) | 🧪 Intern-S1: ...

2025.08.22 | 科学多模态缩小差距；GUI自动化解决挑战

23 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:22] 🧪 Intern-S1: A Scientific Multimodal Foundation Model（Intern-S1：一个科学多模态基础模型）[00:...

2025.08.21 | 金融大模型认知诊断；DuPO优化自验证

22 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:22] 🧠 From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models（从...

2025.08.20 | 智能体链提升效率；长视频3D重建优化

21 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:23] 🤖 Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL（智能体...

2025.08.19 | Ovis2.5提升多模态；ComoRAG优化长叙事推理

20 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:20] ✨ Ovis2.5 Technical Report（Ovis2.5 技术报告）[00:51] 🧠 ComoRAG: A Cognitive-Inspired Memory-Organiz...

2025.08.18 | 超越图像思考；自搜索强化

18 Aug 2025

Contributed by Lukas

本期的 13 篇论文如下：[00:19] 💡 Thyme: Think Beyond Images（Thyme：超越图像的思考）[00:48] 🧠 SSRL: Self-Search Reinforcement ...

【周末特辑】8月第3周最火AI论文 | GLM-4.5统一智能体推理编程；We-Math提升视觉数学推理

17 Aug 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:32] TOP1(🔥139) | 🚀 GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models（GLM-4.5：智能体、推...

2025.08.15 | 数学推理手册提升模型能力；连续令牌生成图像模型

16 Aug 2025

Contributed by Lukas

本期的 12 篇论文如下：[00:23] 📚 We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning（We-Math 2.0：一...

2025.08.14 | 分子推理框架提升性能；视频身份控制轻量高效

14 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:17] 🧪 Mol-R1: Towards Explicit Long-CoT Reasoning in Molecule Discovery（Mol-R1：迈向分子发现中的显式...

2025.08.13 | 多模态AI突破；3D世界生成

13 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:22] 🤖 WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent（WebWatcher：突破视觉-语言...

2025.08.12 | ReasonRank提升段落排序推理；WideSearch评估智能体广域搜寻

13 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:18] 🧠 ReasonRank: Empowering Passage Ranking with Strong Reasoning Ability（ReasonRank：赋予段落排序强大...

2025.08.11 | GLM-4.5统一智能体推理编程；Voost高保真虚拟试穿试脱

12 Aug 2025

Contributed by Lukas

本期的 11 篇论文如下：[00:20] 🚀 GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models（GLM-4.5：智能体、推理与编程（...

【周末特辑】8月第2周最火AI论文 | CoT推理是幻象；Qwen-Image渲染领先

10 Aug 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:33] TOP1(🔥174) | 🤔 Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens（LLM思维链推理...

2025.08.08 | 动态微调优推理;零数据自演进强推理

09 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:16] ✨ On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification（关于SFT泛...

2025.08.07 | VeriGUI提升代理能力；CoT推理实为模式匹配

07 Aug 2025

Contributed by Lukas

本期的 13 篇论文如下：[00:20] 🤖 VeriGUI: Verifiable Long-Chain GUI Dataset（VeriGUI：可验证的长链GUI数据集）[00:40] 🤔 Is Ch...

2025.08.06 | 高速推理扩散模型；紧凑视觉生成模型

07 Aug 2025

Contributed by Lukas

本期的 13 篇论文如下：[00:17] 🚀 Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference（种子扩散：一种具...

2025.08.05 | 图像文本渲染编辑创新；上下文检索提升故事理解

06 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:18] 🎨 Qwen-Image Technical Report（Qwen-Image技术报告）[00:39] 🔍 SitEmb-v1.5: Improved Context-Aware De...

2025.08.04 | 扩散语言模型变长去噪，高效省资源；PixNerd图像扩散，高效高质量。

05 Aug 2025

Contributed by Lukas

本期的 11 篇论文如下：[00:22] 🔄 Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models（超越固定长度：扩散大...

【月末特辑】7月最火AI论文 | GSPO稳训练；序列级裁剪降方差；上下文工程综述，动态拼装信息流

04 Aug 2025

Contributed by Lukas

本期的 10 篇论文如下：[00:30] TOP1(🔥257) | 🚀 Group Sequence Policy Optimization（组序列策略优化）[02:21] TOP2(🔥227) | 🧮 ...

【周末特辑】8月第1周最火AI论文 | ARPO用高熵分叉省预算；混元世界一句话生成可编辑3D场景

03 Aug 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:32] TOP1(🔥114) | 🤖 Agentic Reinforced Policy Optimization（智能体强化策略优化）[02:17] TOP2(🔥94)...

2025.08.01 | Seed-Prover融合LLM解决IMO数学题；Phi-Ground提升GUI感知精度。

01 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:22] 🏆 Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving（Seed-Prover：自动化定理证明的...

2025.07.31 | ScreenCoder自动化UI转代码；Falcon-H1混合架构，提升长序列效率。

01 Aug 2025

Contributed by Lukas

本期的 9 篇论文如下：[00:22] 💻 ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents（S...

2025.07.30 | 混元世界从文字像素生成沉浸3D世界；X-Omni用强化学习提升图像生成质量。

31 Jul 2025

Contributed by Lukas

本期的 8 篇论文如下：[00:23] 🌍 HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels（混元...

2025.07.29 | ARPO提升LLM工具交互性能；ARC-Hunyuan-Video-7B深耕短视频理解。

30 Jul 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:23] 🤖 Agentic Reinforced Policy Optimization（智能体强化策略优化）[00:55] 🧠 ARC-Hunyuan-Video-7B: ...

2025.07.28 | GPTQ揭示为Babai算法，保障精度；TTD-DR以扩散模型生成高质量研究报告。

29 Jul 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:25] 💡 The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm（LLM 量化的几何学：GPTQ 作...

【周末特辑】7月第4周最火AI论文 | GUI-G2：高斯奖励提升GUI定位；MiroMind-M1：开源数学推理LLM

26 Jul 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:36] TOP1(🔥118) | 🎯 GUI-G$^2$: Gaussian Reward Modeling for GUI Grounding（GUI-G$^2$: 基于高斯奖励模型...

2025.07.25 | GSPO解决大模型训练崩溃；MUR提升LLM推理效率。

26 Jul 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:24] 🚀 Group Sequence Policy Optimization（组序列策略优化）[00:53] 🧠 MUR: Momentum Uncertainty guided...

2025.07.24 | MLLMs视觉感知仍不足；Yume模型可生成交互虚拟世界。

25 Jul 2025

Contributed by Lukas

本期的 9 篇论文如下：[00:23] 👁 Pixels, Patterns, but No Poetry: To See The World like Humans（像素、模式，却无诗意：像人类一...

2025.07.23 | TIM模型突破LLM上下文限制；Step-Audio 2提升多模态语音对话。

24 Jul 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:24] ♾ Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning（超越上下文限制：用于长程...

2025.07.22 | MiroMind-M1提升数学推理；GUI-G$^2$高斯奖励助GUI定位。

22 Jul 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:25] 🧮 MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Opt...

2025.07.21 | dLLM新型安全漏洞，现有防御不足；俄语语音合成，数据与标注是核心。

22 Jul 2025

Contributed by Lukas

本期的 10 篇论文如下：[00:20] 😈 The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs（隐藏在面具后的恶魔...

【周末特辑】7月第3周最火AI论文 | 上下文工程提升LLM性能；反射生成模型提高推理效率。

20 Jul 2025

Contributed by Lukas

本期的 5 篇论文如下：[00:39] TOP1(🔥116) | 🧮 A Survey of Context Engineering for Large Language Models（大型语言模型上下文工程...

2025.07.18 | 优化LLMs上下文；提升视觉语言模型效率

19 Jul 2025

Contributed by Lukas

本期的 15 篇论文如下：[00:27] 🧮 A Survey of Context Engineering for Large Language Models（大型语言模型上下文工程综述）[01:...

2025.07.17 | RAG提升LLM推理；PhysX生成物理3D资产

18 Jul 2025

Contributed by Lukas

本期的 13 篇论文如下：[00:26] 🧠 Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs（具身智能RAG与深...

Activity Overview

Episodes

2025.11.05 | 向量草图测代码；先画后想补视觉

2025.11.04 | 超稀疏MoE激活万亿参数；视觉模型看图胜GNN

2025.11.03 | OS-Sentinel实时守护手机操作安全；ThinkMorph让小模型边想边画

【月末特辑】10月最火AI论文 | 幼龙BDH稀疏可解释；迷你递归7兆碾压大模型

【周末特辑】11月第1周最火AI论文 | 循环模型省参强推理；Concerto 2D-3D自监督涨点

2025.10.31 | Emu3.5统一预测时空；扩散提示驱动机器人

2025.10.30 | 看图写码7B逆袭；视频思维RL破局

2025.10.29 | 通义深度研究报告；小模型折记忆胜671B巨模型

2025.10.28 | Point Transformer无标对齐长空间；代码递归统一粗细粒度

2025.10.27 | DeepAgent一步推理+ToolPO；视频即提示DiT秒控百种语义

【周末特辑】10月第4周最火AI论文 | 内部概率+投票剪尾，RPC省样本提精度

2025.10.24 | AdaSPEC挑40% token提速两成；AutoPage 10美分生成交互网页

2025.10.23 | 线性注意力显存降十倍；动态裁剪PPO稳提分

2025.10.22 | LightMem压缩记忆千倍提速12倍；闭环世界模型微调8万数据反超巨兽

2025.10.21 | 模型不懂光影折射；小模型也能写报告

2025.10.20 | RPC剪枝提速保准；OmniVinci小数据跨模态称王

【周末特辑】10月第3周最火AI论文 | 量化噪声变探索，单卡跑RL；冻结编码器放语义，DiT生成新纪录

2025.10.17 | AI眼镜预判式服务；视频生成补想象力

2025.10.16 | UniMoE一统语音音乐；注意力图点亮大模型推理

2025.10.15 | 像素级自监督ViT刷新生成基准；多智能体评测网文翻译新标尺

2025.10.14 | 量化误差变奖励，单卡训32B；面向多模态大模型的音视频评测基准

2025.10.13 | 桌面交互预训练解锁机器人潜能；统一模型赋予相机空间想象力

【周末特辑】10月第2周最火AI论文 | 递归小模型刷爆推理榜；未来经验点亮零奖励学习

2025.10.10 | 早期经验的Agent Learning；图文交错反思链跃升至24.9%

2025.10.09 | Ming-UniVision统一视觉词表；KV-Cache直连让大模型秒聊

2025.10.08 | TaTToo用外挂代码干翻大模型；4B小模型32步逼近闭源巨头

2025.10.07 | 论文秒变演讲；Video-LMM后训练突破

2025.10.06 | 15B小模型追平DeepSeek-R1；渐进蒸馏128 token省八成算力

【周末特辑】10月第1周最火AI论文 | Transformer长出大脑的壳；LongLive把长视频做成直播

2025.10.03 | LongCodeZip删得快准；迈向分钟级高质量视频生成

2025.10.02 | MCTS破局RLVR瓶颈；GEM开源智能体训练场

【月末特辑】9月最火AI论文 | 群体RL共享降本；SAPO让旧机也能训大模型

2025.10.01 | 自对弈零标注训练；MCP代理深度评测

2025.09.30 | SLA稀疏注意力砍算力；StableToken抗噪不训模

2025.09.29 | 实时长视频边聊边播；分位数基线稳控推理熵

【周末特辑】9月第5周最火AI论文 | Qwen3-Omni开源称王; 锁定视觉训解码，Baseer刷新阿文OCR；

2025.09.26 | SciReasoner八项全能；MMR1模糊区炼出开源多模态

2025.09.25 | 视频模型零样本全能；隐式思维链省token提效

2025.09.24 | 阿语OCR刷新指标；无标注RL涨分

2025.09.23 | 少78条示范让AI飙73.5%；免掩膜视频插主体超Pika

2025.09.22 | 有向图驱动代码生成；双通道视觉统一模型

【周末特辑】9月第4周最火AI论文 | OmniWorld打造4D数据工厂；WebWeaver让AI边搜边写

2025.09.19 | 跨平台GUI模型刷榜；FlowRL分布匹配提推理

2025.09.18 | FP8压缩+翻译微调低成本炼阿语大模型；2B-8B小模型洗数据硬刚GPT-4o

2025.09.17 | WebWeaver框架提升可信长文报告；Agentic预训练扩展智能体系统

2025.09.16 | OmniWorld建4D数据底座；UI-S1半在线驯界面代理

2025.09.15 | 数据集升级测互动；模型大小非长程瓶颈

【周末特辑】9月第3周最火AI论文 | 群智RL提速大模型；小VLA零预训练控机械

2025.09.12 | HuMo多模态控人视频；SimpleVLA-RL强化升效

2025.09.11 | 强化学习提升推理能力；奖励缩放优化视觉生成

2025.09.10 | 强化学习并行思维；视觉搜索推理扩展

2025.09.09 | REER提升推理性能；WebExplorer训练智能体

2025.09.08 | 语言模型幻觉源于预训练；大模型图形编程性能提升

【周末特辑】9月第2周最火AI论文 | LLM智能体RL综述；AI代码安全基准

2025.09.05 | 大型语言模型语义理解弱；图像编辑模型提升几何估计

2025.09.04 | 机器人任务规划高效；数据推理能力提升

2025.09.03 | 智能体RL提升大模型自主性；SimpleTIR解多轮工具推理

2025.09.02 | PVPO优化推理性能；T2R-bench暴露模型短板

2025.09.01 | R-4B模型优化思考效率；EO-1提升机器人控制能力

【月末特辑】8月最火AI论文 | 科学AI模型缩小性能差距；图像模型解决文本渲染与编辑

【周末特辑】8月第5周最火AI论文 | 多模态模型效率提升；自博弈策略提高多样性

2025.08.29 | 稳定文本到图像生成；高效数学推理

2025.08.28 | 推理分解减幻觉；可解释性编码信息

2025.08.27 | 物理模型评估显不足；树算法优化提效降本

2025.08.26 | 提升模型推理效率；增强生成语义对齐

2025.08.25 | 无微调智能体高效学习；四足机器人长周期探索

【周末特辑】8月第4周最火AI论文 | 视觉模型新突破；科学多模态领先

2025.08.22 | 科学多模态缩小差距；GUI自动化解决挑战

2025.08.21 | 金融大模型认知诊断；DuPO优化自验证

2025.08.20 | 智能体链提升效率；长视频3D重建优化

2025.08.19 | Ovis2.5提升多模态；ComoRAG优化长叙事推理

2025.08.18 | 超越图像思考；自搜索强化

【周末特辑】8月第3周最火AI论文 | GLM-4.5统一智能体推理编程；We-Math提升视觉数学推理

2025.08.15 | 数学推理手册提升模型能力；连续令牌生成图像模型

2025.08.14 | 分子推理框架提升性能；视频身份控制轻量高效

2025.08.13 | 多模态AI突破；3D世界生成

2025.08.12 | ReasonRank提升段落排序推理；WideSearch评估智能体广域搜寻