Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing
Podcast Image

AI Podcast

AI Radio FM - Machete: Hopper GPU 优化 GEMM 内核

19 Nov 2024

Description

深度探讨Neural Magic的Machete内核,专为NVIDIA Hopper GPU上的混合输入量化而优化,显著提升大型语言模型推理性能。

Audio
Featured in this Episode

No persons identified in this episode.

Transcription

This episode hasn't been transcribed yet

Help us prioritize this episode for transcription by upvoting it.

0 upvotes
🗳️ Sign in to Upvote

Popular episodes get transcribed faster

Comments

There are no comments yet.

Please log in to write the first comment.