Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing
Podcast Image

AI Podcast

DeepSeek-R1:通过强化学习激励大型语言模型的推理能力

24 Jan 2025

Description

本播客深入探讨DeepSeek-R1模型,该模型通过大规模强化学习显著提升了大型语言模型的推理能力。我们将分析DeepSeek-R1-Zero和DeepSeek-R1的训练过程、性能表现,以及它们在不同任务上的卓越表现。同时,我们还将讨论如何通过知识蒸馏技术,使更小的模型也能具备强大的推理能力。

Audio
Featured in this Episode

No persons identified in this episode.

Transcription

This episode hasn't been transcribed yet

Help us prioritize this episode for transcription by upvoting it.

0 upvotes
🗳️ Sign in to Upvote

Popular episodes get transcribed faster

Comments

There are no comments yet.

Please log in to write the first comment.