Eye On A.I.
#239 Tuhin Srivatsa: How Baseten is Disrupting AI Deployment & Scaling in 2025
26 Feb 2025
This episode is sponsored by Thuma. Thuma is a modern design company that specializes in timeless home essentials that are mindfully made with premium materials and intentional details. To get $100 towards your first bed purchase, go to http://thuma.co/eyeonai ————————————————————————————————————————— AI deployment is broken—can it be fixed? In this episode, Tuhin Srivatsa, CEO & Co-Founder of Baseten, reveals how his company is DISRUPTING AI infrastructure, making it easier, faster, and more cost-effective to deploy and scale AI models in production. As enterprises increasingly turn to open-source AI models and grapple with the high costs and complexity of scaling, Baseten offers a game-changing solution that eliminates bottlenecks and simplifies the process. Discover how Baseten is taking on AWS SageMaker, OpenAI, and cloud-based AI deployment platforms to reshape the future of AI model deployment. What You'll Learn in This Episode: Why AI deployment & scaling is one of the biggest challenges in 2025 How Baseten enables enterprises to run AI models faster & more efficiently The shift from closed-source to open-source AI models—and why it matters The hidden costs of AI inference & how to optimize for performance Why most AI models fail in production and how to prevent it The future of AI infrastructure: What comes next for scalable AI Whether you're a machine learning engineer, AI researcher, startup founder, or enterprise leader, this episode is packed with actionable insights to help you scale AI models without the headaches. Don't miss this conversation on the next era of AI deployment! #AI #ArtificialIntelligence #MachineLearning #Baseten #AIDeployment #AIScaling #Inference #MLInfrastructure #TechPodcast Stay Updated: Craig Smith Twitter: https://twitter.com/craigss Eye on A.I. Twitter: https://twitter.com/EyeOn_AI ————————————————————————————————————————— (00:00) Tuhin Srivatsa's Journey in AI & Baseten (01:50) What is AI Infrastructure & Why It Matters (03:30) How Baseten Optimizes AI Model Deployment (05:19) Why Most AI Deployments Fail (And How to Fix It) (09:17) The Future of Open-Source AI Models in Enterprise (11:01) How Baseten Automates AI Scaling & Inference (14:12) Why AI Developers Struggle with Cloud-Based AI Tools (18:47) The Real Cost of AI Inference (And How to Reduce It) (20:44) Why AI Scaling is the Biggest Challenge in 2025 (26:55) Can AI Run on Non-NVIDIA Chips? (The Hardware Debate) (31:23) The Future of AI Model Deployment & Inference (37:05) How AI Agents & Reasoning Models Are Changing the Game (40:39) The Truth About AI Hype vs. Reality (45:04) How to Get Started with Baseten (45:48) The Future of AI Infrastructure
No persons identified in this episode.
This episode hasn't been transcribed yet
Help us prioritize this episode for transcription by upvoting it.
Popular episodes get transcribed faster
Other recent transcribed episodes
Transcribed and ready to explore now
Before the Crisis: How You and Your Relatives Can Prepare for Financial Caregiving
06 Dec 2025
Motley Fool Money
OpenAI's Code Red, Sacks vs New York Times, New Poverty Line?
06 Dec 2025
All-In with Chamath, Jason, Sacks & Friedberg
OpenAI's Code Red, Sacks vs New York Times, New Poverty Line?
06 Dec 2025
All-In with Chamath, Jason, Sacks & Friedberg
Anthropic Finds AI Answers with Interviewer
05 Dec 2025
The Daily AI Show
#2423 - John Cena
05 Dec 2025
The Joe Rogan Experience
Warehouse to wellness: Bob Mauch on modern pharmaceutical distribution
05 Dec 2025
McKinsey on Healthcare