HuggingFace 每日AI论文速递
Episodes
2025.11.05 | 向量草图测代码;先画后想补视觉
05 Nov 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:21] 🖼 VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation(VCode:以SVG为符号视...
2025.11.04 | 超稀疏MoE激活万亿参数;视觉模型看图胜GNN
04 Nov 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:23] 🧠 Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation(全激活赋能...
2025.11.03 | OS-Sentinel实时守护手机操作安全;ThinkMorph让小模型边想边画
03 Nov 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:21] 🛡 OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows(OS-S...
【月末特辑】10月最火AI论文 | 幼龙BDH稀疏可解释;迷你递归7兆碾压大模型
02 Nov 2025
Contributed by Lukas
本期的 10 篇论文如下:[00:30] TOP1(🔥522) | 🐣 The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain(幼...
【周末特辑】11月第1周最火AI论文 | 循环模型省参强推理;Concerto 2D-3D自监督涨点
01 Nov 2025
Contributed by Lukas
本期的 5 篇论文如下:[00:35] TOP1(🔥174) | 🔄 Scaling Latent Reasoning via Looped Language Models(通过循环语言模型扩展潜在推...
2025.10.31 | Emu3.5统一预测时空;扩散提示驱动机器人
31 Oct 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:26] 🌍 Emu3.5: Native Multimodal Models are World Learners(Emu3.5:原生多模态世界模型让AI看懂并预...
2025.10.30 | 看图写码7B逆袭;视频思维RL破局
30 Oct 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:22] 👁 JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence(JanusCoder:面向...
2025.10.29 | 通义深度研究报告;小模型折记忆胜671B巨模型
29 Oct 2025
Contributed by Lukas
本期的 10 篇论文如下:[00:23] 🔍 Tongyi DeepResearch Technical Report(通义深度研究报告:面向长程深度信息检索任务的智...
2025.10.28 | Point Transformer无标对齐长空间;代码递归统一粗细粒度
28 Oct 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:23] 🎼 Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations(Concerto:2D-3D联合自...
2025.10.27 | DeepAgent一步推理+ToolPO;视频即提示DiT秒控百种语义
27 Oct 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:27] 🧠 DeepAgent: A General Reasoning Agent with Scalable Toolsets(DeepAgent:具备可扩展工具集的通用...
【周末特辑】10月第4周最火AI论文 | 内部概率+投票剪尾,RPC省样本提精度
26 Oct 2025
Contributed by Lukas
本期的 5 篇论文如下:[00:29] TOP1(🔥135) | 🧠 A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning...
2025.10.24 | AdaSPEC挑40% token提速两成;AutoPage 10美分生成交互网页
24 Oct 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:23] 🎯 AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders(AdaSPEC:面向高效推测...
2025.10.23 | 线性注意力显存降十倍;动态裁剪PPO稳提分
23 Oct 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:19] 🧠 Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning(每一种注意力都...
2025.10.22 | LightMem压缩记忆千倍提速12倍;闭环世界模型微调8万数据反超巨兽
22 Oct 2025
Contributed by Lukas
本期的 14 篇论文如下:[00:19] 🧠 LightMem: Lightweight and Efficient Memory-Augmented Generation(LightMem:轻量高效的记忆增强生...
2025.10.21 | 模型不懂光影折射;小模型也能写报告
21 Oct 2025
Contributed by Lukas
本期的 13 篇论文如下:[00:21] 🪞 PICABench: How Far Are We from Physically Realistic Image Editing?(PICABench:我们离物理真实的图...
2025.10.20 | RPC剪枝提速保准;OmniVinci小数据跨模态称王
20 Oct 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:20] 🧠 A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning(大模型推...
【周末特辑】10月第3周最火AI论文 | 量化噪声变探索,单卡跑RL;冻结编码器放语义,DiT生成新纪录
18 Oct 2025
Contributed by Lukas
本期的 5 篇论文如下:[00:40] TOP1(🔥154) | 🚀 QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs(QeRL:超...
2025.10.17 | AI眼镜预判式服务;视频生成补想象力
17 Oct 2025
Contributed by Lukas
本期的 11 篇论文如下:[00:25] 👓 AI for Service: Proactive Assistance with AI Glasses(AI服务:AI眼镜的主动式协助)[01:06] 🎬...
2025.10.16 | UniMoE一统语音音乐;注意力图点亮大模型推理
16 Oct 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:21] 🎧 UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE(UniMoE-Audio:基于动态容...
2025.10.15 | 像素级自监督ViT刷新生成基准;多智能体评测网文翻译新标尺
15 Oct 2025
Contributed by Lukas
本期的 14 篇论文如下:[00:20] 🖼 Advancing End-to-End Pixel Space Generative Modeling via Self-supervised Pre-training(通过自监督预...
2025.10.14 | 量化误差变奖励,单卡训32B;面向多模态大模型的音视频评测基准
14 Oct 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:23] 🚀 QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs(QeRL:超越效率——...
2025.10.13 | 桌面交互预训练解锁机器人潜能;统一模型赋予相机空间想象力
13 Oct 2025
Contributed by Lukas
本期的 14 篇论文如下:[00:20] 🖥 D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI(D2E:利用桌面数...
【周末特辑】10月第2周最火AI论文 | 递归小模型刷爆推理榜;未来经验点亮零奖励学习
12 Oct 2025
Contributed by Lukas
本期的 5 篇论文如下:[00:33] TOP1(🔥300) | 🧠 Less is More: Recursive Reasoning with Tiny Networks(小而精:用微型网络递归推...
2025.10.10 | 早期经验的Agent Learning;图文交错反思链跃升至24.9%
10 Oct 2025
Contributed by Lukas
本期的 14 篇论文如下:[00:16] 🌱 Agent Learning via Early Experience(基于早期经验的主体学习)[00:50] 🧠 MM-HELIX: Boosting ...
2025.10.09 | Ming-UniVision统一视觉词表;KV-Cache直连让大模型秒聊
09 Oct 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:21] 🔄 Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer(Ming-UniVis...
2025.10.08 | TaTToo用外挂代码干翻大模型;4B小模型32步逼近闭源巨头
08 Oct 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:24] 📊 TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning(TaTToo:面向表格推理...
2025.10.07 | 论文秒变演讲;Video-LMM后训练突破
07 Oct 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:21] 🎬 Paper2Video: Automatic Video Generation from Scientific Papers(论文自动生成学术演讲视频)[0...
2025.10.06 | 15B小模型追平DeepSeek-R1;渐进蒸馏128 token省八成算力
06 Oct 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:28] 🧠 Apriel-1.5-15b-Thinker(Apriel-1.5-15B-Thinker:以小博大实现前沿多模态推理的15B开源模型...
【周末特辑】10月第1周最火AI论文 | Transformer长出大脑的壳;LongLive把长视频做成直播
05 Oct 2025
Contributed by Lukas
本期的 5 篇论文如下:[00:43] TOP1(🔥323) | 🐣 The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain(幼...
2025.10.03 | LongCodeZip删得快准;迈向分钟级高质量视频生成
03 Oct 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:22] 🗜 LongCodeZip: Compress Long Context for Code Language Models(LongCodeZip:面向代码大模型的长上...
2025.10.02 | MCTS破局RLVR瓶颈;GEM开源智能体训练场
02 Oct 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:19] 🧠 DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree...
【月末特辑】9月最火AI论文 | 群体RL共享降本;SAPO让旧机也能训大模型
02 Oct 2025
Contributed by Lukas
本期的 10 篇论文如下:[00:29] TOP1(🔥640) | 🤝 Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing(共享...
2025.10.01 | 自对弈零标注训练;MCP代理深度评测
01 Oct 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:20] 🎮 Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play(Vision-Zero:基于策略化...
2025.09.30 | SLA稀疏注意力砍算力;StableToken抗噪不训模
30 Sep 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:22] ⚡ SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention(SLA:通过可微...
2025.09.29 | 实时长视频边聊边播;分位数基线稳控推理熵
29 Sep 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:20] 🎬 LongLive: Real-time Interactive Long Video Generation(LongLive:实时交互式长视频生成框架)...
【周末特辑】9月第5周最火AI论文 | Qwen3-Omni开源称王; 锁定视觉训解码,Baseer刷新阿文OCR;
27 Sep 2025
Contributed by Lukas
本期的 5 篇论文如下:[00:38] TOP1(🔥116) | 📜 Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR(Baseer:面向阿拉...
2025.09.26 | SciReasoner八项全能;MMR1模糊区炼出开源多模态
26 Sep 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:20] 🔬 SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines(SciReasoner:跨学科夯实科学...
2025.09.25 | 视频模型零样本全能;隐式思维链省token提效
25 Sep 2025
Contributed by Lukas
本期的 10 篇论文如下:[00:22] 🎥 Video models are zero-shot learners and reasoners(视频模型是零样本学习者与推理者)[01:09...
2025.09.24 | 阿语OCR刷新指标;无标注RL涨分
24 Sep 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:24] 📜 Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR(Baseer:面向阿拉伯文档OCR的...
2025.09.23 | 少78条示范让AI飙73.5%;免掩膜视频插主体超Pika
23 Sep 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:21] 🚀 LIMI: Less is More for Agency(LIMI:少即是多,打造AI智能体)[00:55] 🎬 OmniInsert: Mask-Fr...
2025.09.22 | 有向图驱动代码生成;双通道视觉统一模型
22 Sep 2025
Contributed by Lukas
本期的 13 篇论文如下:[00:25] 🗺 RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation(RPG:用于统一可扩展...
【周末特辑】9月第4周最火AI论文 | OmniWorld打造4D数据工厂;WebWeaver让AI边搜边写
20 Sep 2025
Contributed by Lukas
本期的 5 篇论文如下:[00:43] TOP1(🔥95) | 🌍 OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling(OmniWorld:面向...
2025.09.19 | 跨平台GUI模型刷榜;FlowRL分布匹配提推理
19 Sep 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:26] 🖥 ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data(ScaleCUA:基于跨平台数...
2025.09.18 | FP8压缩+翻译微调低成本炼阿语大模型;2B-8B小模型洗数据硬刚GPT-4o
18 Sep 2025
Contributed by Lukas
本期的 14 篇论文如下:[00:19] 🐪 Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale(Hala技术报...
2025.09.17 | WebWeaver框架提升可信长文报告;Agentic预训练扩展智能体系统
17 Sep 2025
Contributed by Lukas
本期的 11 篇论文如下:[00:27] 🔍 WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research(WebWeaver:...
2025.09.16 | OmniWorld建4D数据底座;UI-S1半在线驯界面代理
16 Sep 2025
Contributed by Lukas
本期的 14 篇论文如下:[00:24] 🌍 OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling(OmniWorld:面向4D世界建模...
2025.09.15 | 数据集升级测互动;模型大小非长程瓶颈
15 Sep 2025
Contributed by Lukas
本期的 14 篇论文如下:[00:25] 📚 IntrEx: A Dataset for Modeling Engagement in Educational Conversations(IntrEx:面向教育对话中参...
【周末特辑】9月第3周最火AI论文 | 群智RL提速大模型;小VLA零预训练控机械
14 Sep 2025
Contributed by Lukas
本期的 5 篇论文如下:[00:40] TOP1(🔥455) | 🤝 Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing(共享...
2025.09.12 | HuMo多模态控人视频;SimpleVLA-RL强化升效
12 Sep 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:27] 🎭 HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning(HuMo:通过协同多模...
2025.09.11 | 强化学习提升推理能力;奖励缩放优化视觉生成
11 Sep 2025
Contributed by Lukas
本期的 10 篇论文如下:[00:24] 🧠 A Survey of Reinforcement Learning for Large Reasoning Models(大型推理模型的强化学习综述)...
2025.09.10 | 强化学习并行思维;视觉搜索推理扩展
10 Sep 2025
Contributed by Lukas
本期的 14 篇论文如下:[00:22] 🧠 Parallel-R1: Towards Parallel Thinking via Reinforcement Learning(Parallel-R1: 通过强化学习实现并...
2025.09.09 | REER提升推理性能;WebExplorer训练智能体
09 Sep 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:21] 💡 Reverse-Engineered Reasoning for Open-Ended Generation(面向开放式生成的逆向工程推理)[00:...
2025.09.08 | 语言模型幻觉源于预训练;大模型图形编程性能提升
08 Sep 2025
Contributed by Lukas
本期的 12 篇论文如下:[00:24] 🤔 Why Language Models Hallucinate(语言模型为何产生幻觉)[00:47] 🎨 Symbolic Graphics Programm...
【周末特辑】9月第2周最火AI论文 | LLM智能体RL综述;AI代码安全基准
06 Sep 2025
Contributed by Lukas
本期的 5 篇论文如下:[00:35] TOP1(🔥139) | 🤖 The Landscape of Agentic Reinforcement Learning for LLMs: A Survey(面向大语言模型的...
2025.09.05 | 大型语言模型语义理解弱;图像编辑模型提升几何估计
05 Sep 2025
Contributed by Lukas
本期的 13 篇论文如下:[00:22] 🤔 Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth(废话学:用深度解读无意义...
2025.09.04 | 机器人任务规划高效;数据推理能力提升
04 Sep 2025
Contributed by Lukas
本期的 5 篇论文如下:[00:24] 🤖 Robix: A Unified Model for Robot Interaction, Reasoning and Planning(Robix:一个用于机器人交互、...
2025.09.03 | 智能体RL提升大模型自主性;SimpleTIR解多轮工具推理
03 Sep 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:19] 🤖 The Landscape of Agentic Reinforcement Learning for LLMs: A Survey(面向大语言模型的智能体强化...
2025.09.02 | PVPO优化推理性能;T2R-bench暴露模型短板
02 Sep 2025
Contributed by Lukas
本期的 6 篇论文如下:[00:23] 🧠 PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning(PVPO:基于预估值策略优...
2025.09.01 | R-4B模型优化思考效率;EO-1提升机器人控制能力
01 Sep 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:24] 🧠 R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce ...
【月末特辑】8月最火AI论文 | 科学AI模型缩小性能差距;图像模型解决文本渲染与编辑
31 Aug 2025
Contributed by Lukas
本期的 10 篇论文如下:[00:30] TOP1(🔥242) | 🧪 Intern-S1: A Scientific Multimodal Foundation Model(Intern-S1:一个科学多模态基...
【周末特辑】8月第5周最火AI论文 | 多模态模型效率提升;自博弈策略提高多样性
30 Aug 2025
Contributed by Lukas
本期的 5 篇论文如下:[00:36] TOP1(🔥161) | 🚀 InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficie...
2025.08.29 | 稳定文本到图像生成;高效数学推理
29 Aug 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:24] ⚖ Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning(Pref-GRP...
2025.08.28 | 推理分解减幻觉;可解释性编码信息
28 Aug 2025
Contributed by Lukas
本期的 14 篇论文如下:[00:25] 🧠 Self-Rewarding Vision-Language Model via Reasoning Decomposition(通过推理分解的自奖励视觉语...
2025.08.27 | 物理模型评估显不足;树算法优化提效降本
27 Aug 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:23] 🔬 CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics(CMPhysBench:...
2025.08.26 | 提升模型推理效率;增强生成语义对齐
26 Aug 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:24] 🚀 InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency(InternVL3...
2025.08.25 | 无微调智能体高效学习;四足机器人长周期探索
25 Aug 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:23] 🚀 AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs(AgentFly:无需微调LLM即可微调LLM智能...
【周末特辑】8月第4周最火AI论文 | 视觉模型新突破;科学多模态领先
24 Aug 2025
Contributed by Lukas
本期的 5 篇论文如下:[00:39] TOP1(🔥172) | 🚀 DINOv3(DINOv3:视觉基础模型新里程碑)[01:39] TOP2(🔥170) | 🧪 Intern-S1: ...
2025.08.22 | 科学多模态缩小差距;GUI自动化解决挑战
23 Aug 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:22] 🧪 Intern-S1: A Scientific Multimodal Foundation Model(Intern-S1:一个科学多模态基础模型)[00:...
2025.08.21 | 金融大模型认知诊断;DuPO优化自验证
22 Aug 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:22] 🧠 From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models(从...
2025.08.20 | 智能体链提升效率;长视频3D重建优化
21 Aug 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:23] 🤖 Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL(智能体...
2025.08.19 | Ovis2.5提升多模态;ComoRAG优化长叙事推理
20 Aug 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:20] ✨ Ovis2.5 Technical Report(Ovis2.5 技术报告)[00:51] 🧠 ComoRAG: A Cognitive-Inspired Memory-Organiz...
2025.08.18 | 超越图像思考;自搜索强化
18 Aug 2025
Contributed by Lukas
本期的 13 篇论文如下:[00:19] 💡 Thyme: Think Beyond Images(Thyme:超越图像的思考)[00:48] 🧠 SSRL: Self-Search Reinforcement ...
【周末特辑】8月第3周最火AI论文 | GLM-4.5统一智能体推理编程;We-Math提升视觉数学推理
17 Aug 2025
Contributed by Lukas
本期的 5 篇论文如下:[00:32] TOP1(🔥139) | 🚀 GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models(GLM-4.5:智能体、推...
2025.08.15 | 数学推理手册提升模型能力;连续令牌生成图像模型
16 Aug 2025
Contributed by Lukas
本期的 12 篇论文如下:[00:23] 📚 We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning(We-Math 2.0:一...
2025.08.14 | 分子推理框架提升性能;视频身份控制轻量高效
14 Aug 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:17] 🧪 Mol-R1: Towards Explicit Long-CoT Reasoning in Molecule Discovery(Mol-R1:迈向分子发现中的显式...
2025.08.13 | 多模态AI突破;3D世界生成
13 Aug 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:22] 🤖 WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent(WebWatcher:突破视觉-语言...
2025.08.12 | ReasonRank提升段落排序推理;WideSearch评估智能体广域搜寻
13 Aug 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:18] 🧠 ReasonRank: Empowering Passage Ranking with Strong Reasoning Ability(ReasonRank:赋予段落排序强大...
2025.08.11 | GLM-4.5统一智能体推理编程;Voost高保真虚拟试穿试脱
12 Aug 2025
Contributed by Lukas
本期的 11 篇论文如下:[00:20] 🚀 GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models(GLM-4.5:智能体、推理与编程(...
【周末特辑】8月第2周最火AI论文 | CoT推理是幻象;Qwen-Image渲染领先
10 Aug 2025
Contributed by Lukas
本期的 5 篇论文如下:[00:33] TOP1(🔥174) | 🤔 Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens(LLM思维链推理...
2025.08.08 | 动态微调优推理;零数据自演进强推理
09 Aug 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:16] ✨ On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification(关于SFT泛...
2025.08.07 | VeriGUI提升代理能力;CoT推理实为模式匹配
07 Aug 2025
Contributed by Lukas
本期的 13 篇论文如下:[00:20] 🤖 VeriGUI: Verifiable Long-Chain GUI Dataset(VeriGUI:可验证的长链GUI数据集)[00:40] 🤔 Is Ch...
2025.08.06 | 高速推理扩散模型;紧凑视觉生成模型
07 Aug 2025
Contributed by Lukas
本期的 13 篇论文如下:[00:17] 🚀 Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference(种子扩散:一种具...
2025.08.05 | 图像文本渲染编辑创新;上下文检索提升故事理解
06 Aug 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:18] 🎨 Qwen-Image Technical Report(Qwen-Image技术报告)[00:39] 🔍 SitEmb-v1.5: Improved Context-Aware De...
2025.08.04 | 扩散语言模型变长去噪,高效省资源;PixNerd图像扩散,高效高质量。
05 Aug 2025
Contributed by Lukas
本期的 11 篇论文如下:[00:22] 🔄 Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models(超越固定长度:扩散大...
【月末特辑】7月最火AI论文 | GSPO稳训练;序列级裁剪降方差;上下文工程综述,动态拼装信息流
04 Aug 2025
Contributed by Lukas
本期的 10 篇论文如下:[00:30] TOP1(🔥257) | 🚀 Group Sequence Policy Optimization(组序列策略优化)[02:21] TOP2(🔥227) | 🧮 ...
【周末特辑】8月第1周最火AI论文 | ARPO用高熵分叉省预算;混元世界一句话生成可编辑3D场景
03 Aug 2025
Contributed by Lukas
本期的 5 篇论文如下:[00:32] TOP1(🔥114) | 🤖 Agentic Reinforced Policy Optimization(智能体强化策略优化)[02:17] TOP2(🔥94)...
2025.08.01 | Seed-Prover融合LLM解决IMO数学题;Phi-Ground提升GUI感知精度。
01 Aug 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:22] 🏆 Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving(Seed-Prover:自动化定理证明的...
2025.07.31 | ScreenCoder自动化UI转代码;Falcon-H1混合架构,提升长序列效率。
01 Aug 2025
Contributed by Lukas
本期的 9 篇论文如下:[00:22] 💻 ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents(S...
2025.07.30 | 混元世界从文字像素生成沉浸3D世界;X-Omni用强化学习提升图像生成质量。
31 Jul 2025
Contributed by Lukas
本期的 8 篇论文如下:[00:23] 🌍 HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels(混元...
2025.07.29 | ARPO提升LLM工具交互性能;ARC-Hunyuan-Video-7B深耕短视频理解。
30 Jul 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:23] 🤖 Agentic Reinforced Policy Optimization(智能体强化策略优化)[00:55] 🧠 ARC-Hunyuan-Video-7B: ...
2025.07.28 | GPTQ揭示为Babai算法,保障精度;TTD-DR以扩散模型生成高质量研究报告。
29 Jul 2025
Contributed by Lukas
本期的 5 篇论文如下:[00:25] 💡 The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm(LLM 量化的几何学:GPTQ 作...
【周末特辑】7月第4周最火AI论文 | GUI-G2:高斯奖励提升GUI定位;MiroMind-M1:开源数学推理LLM
26 Jul 2025
Contributed by Lukas
本期的 5 篇论文如下:[00:36] TOP1(🔥118) | 🎯 GUI-G$^2$: Gaussian Reward Modeling for GUI Grounding(GUI-G$^2$: 基于高斯奖励模型...
2025.07.25 | GSPO解决大模型训练崩溃;MUR提升LLM推理效率。
26 Jul 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:24] 🚀 Group Sequence Policy Optimization(组序列策略优化)[00:53] 🧠 MUR: Momentum Uncertainty guided...
2025.07.24 | MLLMs视觉感知仍不足;Yume模型可生成交互虚拟世界。
25 Jul 2025
Contributed by Lukas
本期的 9 篇论文如下:[00:23] 👁 Pixels, Patterns, but No Poetry: To See The World like Humans(像素、模式,却无诗意:像人类一...
2025.07.23 | TIM模型突破LLM上下文限制;Step-Audio 2提升多模态语音对话。
24 Jul 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:24] ♾ Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning(超越上下文限制:用于长程...
2025.07.22 | MiroMind-M1提升数学推理;GUI-G$^2$高斯奖励助GUI定位。
22 Jul 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:25] 🧮 MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Opt...
2025.07.21 | dLLM新型安全漏洞,现有防御不足;俄语语音合成,数据与标注是核心。
22 Jul 2025
Contributed by Lukas
本期的 10 篇论文如下:[00:20] 😈 The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs(隐藏在面具后的恶魔...
【周末特辑】7月第3周最火AI论文 | 上下文工程提升LLM性能;反射生成模型提高推理效率。
20 Jul 2025
Contributed by Lukas
本期的 5 篇论文如下:[00:39] TOP1(🔥116) | 🧮 A Survey of Context Engineering for Large Language Models(大型语言模型上下文工程...
2025.07.18 | 优化LLMs上下文;提升视觉语言模型效率
19 Jul 2025
Contributed by Lukas
本期的 15 篇论文如下:[00:27] 🧮 A Survey of Context Engineering for Large Language Models(大型语言模型上下文工程综述)[01:...
2025.07.17 | RAG提升LLM推理;PhysX生成物理3D资产
18 Jul 2025
Contributed by Lukas
本期的 13 篇论文如下:[00:26] 🧠 Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs(具身智能RAG与深...