Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing
Podcast Image

Two Voice Devs

Episode 231 - DeepSeek AI: Beating the Odds with Older Tech

17 Mar 2025

Description

DeepSeek AI is turning heads, achieving incredible results with older hardware and clever techniques! Join Allen and Roya as they unravel the secrets behind DeepSeek's success, from their unique attention mechanisms to their cost-effective AI training strategies. But is all as it seems? They also tackle the controversies surrounding DeepSeek, including accusations of data plagiarism and concerns about censorship. This episode is a must-listen for anyone interested in the future of AI!Timestamps:0:00 Why DeepSeek is creating buzz1:06 Unveiling DeepSeek's Two Key Models2:59 Understanding the Power of Attention4:12 What is the latent space?5:55 The nail salon example: Multi-Head Attention Explained10:02 The doctor/cook/police analogy: Mixture of Experts Explained13:51 AI vs. AI: DeepSeek's Cost-Saving Training Method16:01 Hallucinations: Is AI Training Too Risky?20:59 What are Reasoning Models and Why Do They Matter?26:53 LLMs are pattern systems explained28:22 How DeepSeek is using old GPUs32:53 OpenAI vs. DeepSeek: The Data Plagiarism Debate39:32 Political Correctness: The Challenge of Guardrails in AI42:16 Why Open Source is Crucial for the Future of AI43:20 Run DeepSeek locally on OLAMA43:56 Final ThoughtsHashtags: #DeepSeek #AI #LLM #Innovation #TechNews #Podcast #ArtificialIntelligence #MachineLearning #Ethics #OpenAI #DataPrivacy #Censorship #TwoVoiceDevs #DeepLearning #ReasoningModel #AIRevolution #ChinaTech

Audio
Featured in this Episode

No persons identified in this episode.

Transcription

This episode hasn't been transcribed yet

Help us prioritize this episode for transcription by upvoting it.

0 upvotes
🗳️ Sign in to Upvote

Popular episodes get transcribed faster

Comments

There are no comments yet.

Please log in to write the first comment.