Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing
Podcast Image

HuggingFace 每日AI论文速递

Technology Science

Episodes

Showing 201-300 of 630
«« ← Prev Page 3 of 7 Next → »»

2025.11.05 | 向量草图测代码;先画后想补视觉

05 Nov 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:21] 🖼 VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation(VCode:以SVG为符号视...

2025.11.04 | 超稀疏MoE激活万亿参数;视觉模型看图胜GNN

04 Nov 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:23] 🧠 Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation(全激活赋能...

2025.11.03 | OS-Sentinel实时守护手机操作安全;ThinkMorph让小模型边想边画

03 Nov 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:21] 🛡 OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows(OS-S...

【月末特辑】10月最火AI论文 | 幼龙BDH稀疏可解释;迷你递归7兆碾压大模型

02 Nov 2025

Contributed by Lukas

本期的 10 篇论文如下:[00:30] TOP1(🔥522) | 🐣 The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain(幼...

【周末特辑】11月第1周最火AI论文 | 循环模型省参强推理;Concerto 2D-3D自监督涨点

01 Nov 2025

Contributed by Lukas

本期的 5 篇论文如下:[00:35] TOP1(🔥174) | 🔄 Scaling Latent Reasoning via Looped Language Models(通过循环语言模型扩展潜在推...

2025.10.31 | Emu3.5统一预测时空;扩散提示驱动机器人

31 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:26] 🌍 Emu3.5: Native Multimodal Models are World Learners(Emu3.5:原生多模态世界模型让AI看懂并预...

2025.10.30 | 看图写码7B逆袭;视频思维RL破局

30 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:22] 👁 JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence(JanusCoder:面向...

2025.10.29 | 通义深度研究报告;小模型折记忆胜671B巨模型

29 Oct 2025

Contributed by Lukas

本期的 10 篇论文如下:[00:23] 🔍 Tongyi DeepResearch Technical Report(通义深度研究报告:面向长程深度信息检索任务的智...

2025.10.28 | Point Transformer无标对齐长空间;代码递归统一粗细粒度

28 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:23] 🎼 Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations(Concerto:2D-3D联合自...

2025.10.27 | DeepAgent一步推理+ToolPO;视频即提示DiT秒控百种语义

27 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:27] 🧠 DeepAgent: A General Reasoning Agent with Scalable Toolsets(DeepAgent:具备可扩展工具集的通用...

【周末特辑】10月第4周最火AI论文 | 内部概率+投票剪尾,RPC省样本提精度

26 Oct 2025

Contributed by Lukas

本期的 5 篇论文如下:[00:29] TOP1(🔥135) | 🧠 A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning...

2025.10.24 | AdaSPEC挑40% token提速两成;AutoPage 10美分生成交互网页

24 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:23] 🎯 AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders(AdaSPEC:面向高效推测...

2025.10.23 | 线性注意力显存降十倍;动态裁剪PPO稳提分

23 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:19] 🧠 Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning(每一种注意力都...

2025.10.22 | LightMem压缩记忆千倍提速12倍;闭环世界模型微调8万数据反超巨兽

22 Oct 2025

Contributed by Lukas

本期的 14 篇论文如下:[00:19] 🧠 LightMem: Lightweight and Efficient Memory-Augmented Generation(LightMem:轻量高效的记忆增强生...

2025.10.21 | 模型不懂光影折射;小模型也能写报告

21 Oct 2025

Contributed by Lukas

本期的 13 篇论文如下:[00:21] 🪞 PICABench: How Far Are We from Physically Realistic Image Editing?(PICABench:我们离物理真实的图...

2025.10.20 | RPC剪枝提速保准;OmniVinci小数据跨模态称王

20 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:20] 🧠 A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning(大模型推...

【周末特辑】10月第3周最火AI论文 | 量化噪声变探索,单卡跑RL;冻结编码器放语义,DiT生成新纪录

18 Oct 2025

Contributed by Lukas

本期的 5 篇论文如下:[00:40] TOP1(🔥154) | 🚀 QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs(QeRL:超...

2025.10.17 | AI眼镜预判式服务;视频生成补想象力

17 Oct 2025

Contributed by Lukas

本期的 11 篇论文如下:[00:25] 👓 AI for Service: Proactive Assistance with AI Glasses(AI服务:AI眼镜的主动式协助)[01:06] 🎬...

2025.10.16 | UniMoE一统语音音乐;注意力图点亮大模型推理

16 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:21] 🎧 UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE(UniMoE-Audio:基于动态容...

2025.10.15 | 像素级自监督ViT刷新生成基准;多智能体评测网文翻译新标尺

15 Oct 2025

Contributed by Lukas

本期的 14 篇论文如下:[00:20] 🖼 Advancing End-to-End Pixel Space Generative Modeling via Self-supervised Pre-training(通过自监督预...

2025.10.14 | 量化误差变奖励,单卡训32B;面向多模态大模型的音视频评测基准

14 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:23] 🚀 QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs(QeRL:超越效率——...

2025.10.13 | 桌面交互预训练解锁机器人潜能;统一模型赋予相机空间想象力

13 Oct 2025

Contributed by Lukas

本期的 14 篇论文如下:[00:20] 🖥 D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI(D2E:利用桌面数...

【周末特辑】10月第2周最火AI论文 | 递归小模型刷爆推理榜;未来经验点亮零奖励学习

12 Oct 2025

Contributed by Lukas

本期的 5 篇论文如下:[00:33] TOP1(🔥300) | 🧠 Less is More: Recursive Reasoning with Tiny Networks(小而精:用微型网络递归推...

2025.10.10 | 早期经验的Agent Learning;图文交错反思链跃升至24.9%

10 Oct 2025

Contributed by Lukas

本期的 14 篇论文如下:[00:16] 🌱 Agent Learning via Early Experience(基于早期经验的主体学习)[00:50] 🧠 MM-HELIX: Boosting ...

2025.10.09 | Ming-UniVision统一视觉词表;KV-Cache直连让大模型秒聊

09 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:21] 🔄 Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer(Ming-UniVis...

2025.10.08 | TaTToo用外挂代码干翻大模型;4B小模型32步逼近闭源巨头

08 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:24] 📊 TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning(TaTToo:面向表格推理...

2025.10.07 | 论文秒变演讲;Video-LMM后训练突破

07 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:21] 🎬 Paper2Video: Automatic Video Generation from Scientific Papers(论文自动生成学术演讲视频)[0...

2025.10.06 | 15B小模型追平DeepSeek-R1;渐进蒸馏128 token省八成算力

06 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:28] 🧠 Apriel-1.5-15b-Thinker(Apriel-1.5-15B-Thinker:以小博大实现前沿多模态推理的15B开源模型...

【周末特辑】10月第1周最火AI论文 | Transformer长出大脑的壳;LongLive把长视频做成直播

05 Oct 2025

Contributed by Lukas

本期的 5 篇论文如下:[00:43] TOP1(🔥323) | 🐣 The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain(幼...

2025.10.03 | LongCodeZip删得快准;迈向分钟级高质量视频生成

03 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:22] 🗜 LongCodeZip: Compress Long Context for Code Language Models(LongCodeZip:面向代码大模型的长上...

2025.10.02 | MCTS破局RLVR瓶颈;GEM开源智能体训练场

02 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:19] 🧠 DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree...

【月末特辑】9月最火AI论文 | 群体RL共享降本;SAPO让旧机也能训大模型

02 Oct 2025

Contributed by Lukas

本期的 10 篇论文如下:[00:29] TOP1(🔥640) | 🤝 Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing(共享...

2025.10.01 | 自对弈零标注训练;MCP代理深度评测

01 Oct 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:20] 🎮 Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play(Vision-Zero:基于策略化...

2025.09.30 | SLA稀疏注意力砍算力;StableToken抗噪不训模

30 Sep 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:22] ⚡ SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention(SLA:通过可微...

2025.09.29 | 实时长视频边聊边播;分位数基线稳控推理熵

29 Sep 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:20] 🎬 LongLive: Real-time Interactive Long Video Generation(LongLive:实时交互式长视频生成框架)...

【周末特辑】9月第5周最火AI论文 | Qwen3-Omni开源称王; 锁定视觉训解码,Baseer刷新阿文OCR;

27 Sep 2025

Contributed by Lukas

本期的 5 篇论文如下:[00:38] TOP1(🔥116) | 📜 Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR(Baseer:面向阿拉...

2025.09.26 | SciReasoner八项全能;MMR1模糊区炼出开源多模态

26 Sep 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:20] 🔬 SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines(SciReasoner:跨学科夯实科学...

2025.09.25 | 视频模型零样本全能;隐式思维链省token提效

25 Sep 2025

Contributed by Lukas

本期的 10 篇论文如下:[00:22] 🎥 Video models are zero-shot learners and reasoners(视频模型是零样本学习者与推理者)[01:09...

2025.09.24 | 阿语OCR刷新指标;无标注RL涨分

24 Sep 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:24] 📜 Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR(Baseer:面向阿拉伯文档OCR的...

2025.09.23 | 少78条示范让AI飙73.5%;免掩膜视频插主体超Pika

23 Sep 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:21] 🚀 LIMI: Less is More for Agency(LIMI:少即是多,打造AI智能体)[00:55] 🎬 OmniInsert: Mask-Fr...

2025.09.22 | 有向图驱动代码生成;双通道视觉统一模型

22 Sep 2025

Contributed by Lukas

本期的 13 篇论文如下:[00:25] 🗺 RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation(RPG:用于统一可扩展...

【周末特辑】9月第4周最火AI论文 | OmniWorld打造4D数据工厂;WebWeaver让AI边搜边写

20 Sep 2025

Contributed by Lukas

本期的 5 篇论文如下:[00:43] TOP1(🔥95) | 🌍 OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling(OmniWorld:面向...

2025.09.19 | 跨平台GUI模型刷榜;FlowRL分布匹配提推理

19 Sep 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:26] 🖥 ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data(ScaleCUA:基于跨平台数...

2025.09.18 | FP8压缩+翻译微调低成本炼阿语大模型;2B-8B小模型洗数据硬刚GPT-4o

18 Sep 2025

Contributed by Lukas

本期的 14 篇论文如下:[00:19] 🐪 Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale(Hala技术报...

2025.09.17 | WebWeaver框架提升可信长文报告;Agentic预训练扩展智能体系统

17 Sep 2025

Contributed by Lukas

本期的 11 篇论文如下:[00:27] 🔍 WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research(WebWeaver:...

2025.09.16 | OmniWorld建4D数据底座;UI-S1半在线驯界面代理

16 Sep 2025

Contributed by Lukas

本期的 14 篇论文如下:[00:24] 🌍 OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling(OmniWorld:面向4D世界建模...

2025.09.15 | 数据集升级测互动;模型大小非长程瓶颈

15 Sep 2025

Contributed by Lukas

本期的 14 篇论文如下:[00:25] 📚 IntrEx: A Dataset for Modeling Engagement in Educational Conversations(IntrEx:面向教育对话中参...

【周末特辑】9月第3周最火AI论文 | 群智RL提速大模型;小VLA零预训练控机械

14 Sep 2025

Contributed by Lukas

本期的 5 篇论文如下:[00:40] TOP1(🔥455) | 🤝 Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing(共享...

2025.09.12 | HuMo多模态控人视频;SimpleVLA-RL强化升效

12 Sep 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:27] 🎭 HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning(HuMo:通过协同多模...

2025.09.11 | 强化学习提升推理能力;奖励缩放优化视觉生成

11 Sep 2025

Contributed by Lukas

本期的 10 篇论文如下:[00:24] 🧠 A Survey of Reinforcement Learning for Large Reasoning Models(大型推理模型的强化学习综述)...

2025.09.10 | 强化学习并行思维;视觉搜索推理扩展

10 Sep 2025

Contributed by Lukas

本期的 14 篇论文如下:[00:22] 🧠 Parallel-R1: Towards Parallel Thinking via Reinforcement Learning(Parallel-R1: 通过强化学习实现并...

2025.09.09 | REER提升推理性能;WebExplorer训练智能体

09 Sep 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:21] 💡 Reverse-Engineered Reasoning for Open-Ended Generation(面向开放式生成的逆向工程推理)[00:...

2025.09.08 | 语言模型幻觉源于预训练;大模型图形编程性能提升

08 Sep 2025

Contributed by Lukas

本期的 12 篇论文如下:[00:24] 🤔 Why Language Models Hallucinate(语言模型为何产生幻觉)[00:47] 🎨 Symbolic Graphics Programm...

【周末特辑】9月第2周最火AI论文 | LLM智能体RL综述;AI代码安全基准

06 Sep 2025

Contributed by Lukas

本期的 5 篇论文如下:[00:35] TOP1(🔥139) | 🤖 The Landscape of Agentic Reinforcement Learning for LLMs: A Survey(面向大语言模型的...

2025.09.05 | 大型语言模型语义理解弱;图像编辑模型提升几何估计

05 Sep 2025

Contributed by Lukas

本期的 13 篇论文如下:[00:22] 🤔 Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth(废话学:用深度解读无意义...

2025.09.04 | 机器人任务规划高效;数据推理能力提升

04 Sep 2025

Contributed by Lukas

本期的 5 篇论文如下:[00:24] 🤖 Robix: A Unified Model for Robot Interaction, Reasoning and Planning(Robix:一个用于机器人交互、...

2025.09.03 | 智能体RL提升大模型自主性;SimpleTIR解多轮工具推理

03 Sep 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:19] 🤖 The Landscape of Agentic Reinforcement Learning for LLMs: A Survey(面向大语言模型的智能体强化...

2025.09.02 | PVPO优化推理性能;T2R-bench暴露模型短板

02 Sep 2025

Contributed by Lukas

本期的 6 篇论文如下:[00:23] 🧠 PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning(PVPO:基于预估值策略优...

2025.09.01 | R-4B模型优化思考效率;EO-1提升机器人控制能力

01 Sep 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:24] 🧠 R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce ...

【月末特辑】8月最火AI论文 | 科学AI模型缩小性能差距;图像模型解决文本渲染与编辑

31 Aug 2025

Contributed by Lukas

本期的 10 篇论文如下:[00:30] TOP1(🔥242) | 🧪 Intern-S1: A Scientific Multimodal Foundation Model(Intern-S1:一个科学多模态基...

【周末特辑】8月第5周最火AI论文 | 多模态模型效率提升;自博弈策略提高多样性

30 Aug 2025

Contributed by Lukas

本期的 5 篇论文如下:[00:36] TOP1(🔥161) | 🚀 InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficie...

2025.08.29 | 稳定文本到图像生成;高效数学推理

29 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:24] ⚖ Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning(Pref-GRP...

2025.08.28 | 推理分解减幻觉;可解释性编码信息

28 Aug 2025

Contributed by Lukas

本期的 14 篇论文如下:[00:25] 🧠 Self-Rewarding Vision-Language Model via Reasoning Decomposition(通过推理分解的自奖励视觉语...

2025.08.27 | 物理模型评估显不足;树算法优化提效降本

27 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:23] 🔬 CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics(CMPhysBench:...

2025.08.26 | 提升模型推理效率;增强生成语义对齐

26 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:24] 🚀 InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency(InternVL3...

2025.08.25 | 无微调智能体高效学习;四足机器人长周期探索

25 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:23] 🚀 AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs(AgentFly:无需微调LLM即可微调LLM智能...

【周末特辑】8月第4周最火AI论文 | 视觉模型新突破;科学多模态领先

24 Aug 2025

Contributed by Lukas

本期的 5 篇论文如下:[00:39] TOP1(🔥172) | 🚀 DINOv3(DINOv3:视觉基础模型新里程碑)[01:39] TOP2(🔥170) | 🧪 Intern-S1: ...

2025.08.22 | 科学多模态缩小差距;GUI自动化解决挑战

23 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:22] 🧪 Intern-S1: A Scientific Multimodal Foundation Model(Intern-S1:一个科学多模态基础模型)[00:...

2025.08.21 | 金融大模型认知诊断;DuPO优化自验证

22 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:22] 🧠 From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models(从...

2025.08.20 | 智能体链提升效率;长视频3D重建优化

21 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:23] 🤖 Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL(智能体...

2025.08.19 | Ovis2.5提升多模态;ComoRAG优化长叙事推理

20 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:20] ✨ Ovis2.5 Technical Report(Ovis2.5 技术报告)[00:51] 🧠 ComoRAG: A Cognitive-Inspired Memory-Organiz...

2025.08.18 | 超越图像思考;自搜索强化

18 Aug 2025

Contributed by Lukas

本期的 13 篇论文如下:[00:19] 💡 Thyme: Think Beyond Images(Thyme:超越图像的思考)[00:48] 🧠 SSRL: Self-Search Reinforcement ...

【周末特辑】8月第3周最火AI论文 | GLM-4.5统一智能体推理编程;We-Math提升视觉数学推理

17 Aug 2025

Contributed by Lukas

本期的 5 篇论文如下:[00:32] TOP1(🔥139) | 🚀 GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models(GLM-4.5:智能体、推...

2025.08.15 | 数学推理手册提升模型能力;连续令牌生成图像模型

16 Aug 2025

Contributed by Lukas

本期的 12 篇论文如下:[00:23] 📚 We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning(We-Math 2.0:一...

2025.08.14 | 分子推理框架提升性能;视频身份控制轻量高效

14 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:17] 🧪 Mol-R1: Towards Explicit Long-CoT Reasoning in Molecule Discovery(Mol-R1:迈向分子发现中的显式...

2025.08.13 | 多模态AI突破;3D世界生成

13 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:22] 🤖 WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent(WebWatcher:突破视觉-语言...

2025.08.12 | ReasonRank提升段落排序推理;WideSearch评估智能体广域搜寻

13 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:18] 🧠 ReasonRank: Empowering Passage Ranking with Strong Reasoning Ability(ReasonRank:赋予段落排序强大...

2025.08.11 | GLM-4.5统一智能体推理编程;Voost高保真虚拟试穿试脱

12 Aug 2025

Contributed by Lukas

本期的 11 篇论文如下:[00:20] 🚀 GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models(GLM-4.5:智能体、推理与编程(...

【周末特辑】8月第2周最火AI论文 | CoT推理是幻象;Qwen-Image渲染领先

10 Aug 2025

Contributed by Lukas

本期的 5 篇论文如下:[00:33] TOP1(🔥174) | 🤔 Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens(LLM思维链推理...

2025.08.08 | 动态微调优推理;零数据自演进强推理

09 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:16] ✨ On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification(关于SFT泛...

2025.08.07 | VeriGUI提升代理能力;CoT推理实为模式匹配

07 Aug 2025

Contributed by Lukas

本期的 13 篇论文如下:[00:20] 🤖 VeriGUI: Verifiable Long-Chain GUI Dataset(VeriGUI:可验证的长链GUI数据集)[00:40] 🤔 Is Ch...

2025.08.06 | 高速推理扩散模型;紧凑视觉生成模型

07 Aug 2025

Contributed by Lukas

本期的 13 篇论文如下:[00:17] 🚀 Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference(种子扩散:一种具...

2025.08.05 | 图像文本渲染编辑创新;上下文检索提升故事理解

06 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:18] 🎨 Qwen-Image Technical Report(Qwen-Image技术报告)[00:39] 🔍 SitEmb-v1.5: Improved Context-Aware De...

2025.08.04 | 扩散语言模型变长去噪,高效省资源;PixNerd图像扩散,高效高质量。

05 Aug 2025

Contributed by Lukas

本期的 11 篇论文如下:[00:22] 🔄 Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models(超越固定长度:扩散大...

【月末特辑】7月最火AI论文 | GSPO稳训练;序列级裁剪降方差;上下文工程综述,动态拼装信息流

04 Aug 2025

Contributed by Lukas

本期的 10 篇论文如下:[00:30] TOP1(🔥257) | 🚀 Group Sequence Policy Optimization(组序列策略优化)[02:21] TOP2(🔥227) | 🧮 ...

【周末特辑】8月第1周最火AI论文 | ARPO用高熵分叉省预算;混元世界一句话生成可编辑3D场景

03 Aug 2025

Contributed by Lukas

本期的 5 篇论文如下:[00:32] TOP1(🔥114) | 🤖 Agentic Reinforced Policy Optimization(智能体强化策略优化)[02:17] TOP2(🔥94)...

2025.08.01 | Seed-Prover融合LLM解决IMO数学题;Phi-Ground提升GUI感知精度。

01 Aug 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:22] 🏆 Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving(Seed-Prover:自动化定理证明的...

2025.07.31 | ScreenCoder自动化UI转代码;Falcon-H1混合架构,提升长序列效率。

01 Aug 2025

Contributed by Lukas

本期的 9 篇论文如下:[00:22] 💻 ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents(S...

2025.07.30 | 混元世界从文字像素生成沉浸3D世界;X-Omni用强化学习提升图像生成质量。

31 Jul 2025

Contributed by Lukas

本期的 8 篇论文如下:[00:23] 🌍 HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels(混元...

2025.07.29 | ARPO提升LLM工具交互性能;ARC-Hunyuan-Video-7B深耕短视频理解。

30 Jul 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:23] 🤖 Agentic Reinforced Policy Optimization(智能体强化策略优化)[00:55] 🧠 ARC-Hunyuan-Video-7B: ...

2025.07.28 | GPTQ揭示为Babai算法,保障精度;TTD-DR以扩散模型生成高质量研究报告。

29 Jul 2025

Contributed by Lukas

本期的 5 篇论文如下:[00:25] 💡 The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm(LLM 量化的几何学:GPTQ 作...

【周末特辑】7月第4周最火AI论文 | GUI-G2:高斯奖励提升GUI定位;MiroMind-M1:开源数学推理LLM

26 Jul 2025

Contributed by Lukas

本期的 5 篇论文如下:[00:36] TOP1(🔥118) | 🎯 GUI-G$^2$: Gaussian Reward Modeling for GUI Grounding(GUI-G$^2$: 基于高斯奖励模型...

2025.07.25 | GSPO解决大模型训练崩溃;MUR提升LLM推理效率。

26 Jul 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:24] 🚀 Group Sequence Policy Optimization(组序列策略优化)[00:53] 🧠 MUR: Momentum Uncertainty guided...

2025.07.24 | MLLMs视觉感知仍不足;Yume模型可生成交互虚拟世界。

25 Jul 2025

Contributed by Lukas

本期的 9 篇论文如下:[00:23] 👁 Pixels, Patterns, but No Poetry: To See The World like Humans(像素、模式,却无诗意:像人类一...

2025.07.23 | TIM模型突破LLM上下文限制;Step-Audio 2提升多模态语音对话。

24 Jul 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:24] ♾ Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning(超越上下文限制:用于长程...

2025.07.22 | MiroMind-M1提升数学推理;GUI-G$^2$高斯奖励助GUI定位。

22 Jul 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:25] 🧮 MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Opt...

2025.07.21 | dLLM新型安全漏洞,现有防御不足;俄语语音合成,数据与标注是核心。

22 Jul 2025

Contributed by Lukas

本期的 10 篇论文如下:[00:20] 😈 The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs(隐藏在面具后的恶魔...

【周末特辑】7月第3周最火AI论文 | 上下文工程提升LLM性能;反射生成模型提高推理效率。

20 Jul 2025

Contributed by Lukas

本期的 5 篇论文如下:[00:39] TOP1(🔥116) | 🧮 A Survey of Context Engineering for Large Language Models(大型语言模型上下文工程...

2025.07.18 | 优化LLMs上下文;提升视觉语言模型效率

19 Jul 2025

Contributed by Lukas

本期的 15 篇论文如下:[00:27] 🧮 A Survey of Context Engineering for Large Language Models(大型语言模型上下文工程综述)[01:...

2025.07.17 | RAG提升LLM推理;PhysX生成物理3D资产

18 Jul 2025

Contributed by Lukas

本期的 13 篇论文如下:[00:26] 🧠 Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs(具身智能RAG与深...

«« ← Prev Page 3 of 7 Next → »»