Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing
Podcast Image

AI Podcast

深入探索DeepSeek-V3.2-Exp:稀疏注意力如何提升长上下文效率?

29 Sep 2025

Description

本期AI电台FM将深入探讨DeepSeek-AI最新推出的实验性稀疏注意力模型DeepSeek-V3.2-Exp。我们将揭秘其核心技术——DeepSeek稀疏注意力(DSA)如何通过闪电索引器和精细化令牌选择机制,在保持模型性能的同时,显著提升长上下文场景下的训练和推理效率。从架构设计到训练策略,再到实战评估,weedge专家将为您带来全面而生动的解读。

Audio
Featured in this Episode

No persons identified in this episode.

Transcription

This episode hasn't been transcribed yet

Help us prioritize this episode for transcription by upvoting it.

0 upvotes
🗳️ Sign in to Upvote

Popular episodes get transcribed faster

Comments

There are no comments yet.

Please log in to write the first comment.