Embark on a wild race with Gemma as we explore the exciting (and sometimes slow) world of running Google's open-source large language model! We'll test drive different methods, from the leisurely pace of Ollama on a local machine to the speedier Groq platform. Join us as we compare these approaches, analyzing performance, costs, and ease of use for developers working with LLMs. Will the tortoise or the hare win this race? Learn more: * Model card: https://console.cloud.google.com/vertex-ai/publishers/google/model-garden/335 * Ollama: https://ollama.com/ * LangChain.js with Ollama: https://js.langchain.com/docs/integrations/llms/ollama * Groq: https://groq.com/ Timestamps: 0:00:00 - Introduction 0:03:05 - Getting to Know Gemma: Exploring the Model Card 0:05:30 - Vertex AI Endpoint: Fast Deployment, But at What Cost? 0:13:40 - Ollama: The Tortoise of Local LLM Hosting 0:17:40 - LangChain Integration: Adding Functionality to Ollama 0:21:44 - Groq: The Hare of LLM Hardware 0:26:06 - Comparing Approaches: Speed vs. Cost vs. Control 0:27:35 - Future of Open LLMs and Google Cloud Next #GemmaSprint This project was supported, in part, by Cloud Credits from Google
No persons identified in this episode.
This episode hasn't been transcribed yet
Help us prioritize this episode for transcription by upvoting it.
Popular episodes get transcribed faster
Other recent transcribed episodes
Transcribed and ready to explore now
Trump $82 Million Bond Spree, Brazil Tariffs 'Too High,' More
16 Nov 2025
Bloomberg News Now
Ex-Fed Gov Resigned After Rules Violations, Trump Buys $82 Mil of Bonds, More
16 Nov 2025
Bloomberg News Now
THIS TRUMP INTERVIEW WAS INSANE!
16 Nov 2025
HasanAbi
Epstein Emails and Trump's Alleged Involvement
15 Nov 2025
Conspiracy Theories Exploring The Unseen
New Epstein Emails Directly Implicate Trump - H3 Show #211
15 Nov 2025
H3 Podcast
Trump Humiliates Himself on FOX as They Call Him Out
15 Nov 2025
IHIP News