Everyday AI Podcast – An AI and ChatGPT Podcast
EP 628: What’s the best LLM for your team? 7 Steps to evaluate and create ROI for AI
09 Oct 2025
How can you measure ROI on GenAI for your team? 🤔Internal evaluations and intentionality. We've helped thousands of orgs put LLMs to work and ACTUALLY save time. On today's show, we're dishing the 7 steps you need to follow. What’s the best LLM for your team? 7 Steps to evaluate and create ROI for AI -- An Everyday AI chat with Jordan WilsonNewsletter: Sign up for our free daily newsletterMore on this Episode: Episode PageJoin the discussion on LinkedIn: Thoughts on this? Join the convo on LinkedIn and connect with other AI leaders.Upcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: [email protected] with Jordan on LinkedInTopics Covered in This Episode:Choosing the Right Large Language ModelEvaluating LLMs for Business ROIFront-End AI Operating Systems ExplainedCommon Traps in AI Model EvaluationPublic Benchmarks for LLM EvaluationSeven-Step LLM Evaluation FrameworkMeasuring Pre-GenAI Human BaselinesBuilding Realistic AI Test DatasetsCalculating ROI for GenAI ImplementationMonthly Retesting and AI Model UpdatesTimestamps:00:00 Choosing the Right AI Model07:02 Adapting Workflows for AI Integration10:58 "Gemini's Versatile Modes Overview"14:30 Avoiding AI Shiny Object Syndrome15:36 AI Evaluation for Reliability and Improvement20:36 "Data Testing Guide Essentials"25:15 Realistic and Messy Data Essentials26:06 "Building Effective AI Workspaces"31:08 AI Evaluation and ROI Calculation34:11 Human Oversight in AI Testing35:52 Evaluating GenAI Use Cases39:00 "NotebookLM: AI-Powered Idea Organizer"Keywords:Large Language Model, LLM, generative AI, AI operating system, front end AI models, AI evaluation, model ROI, model evaluation steps, AI benchmarks, scientific benchmarks, API connection, enterprise AI, ChatGPT, Claude, Gemini, Copilot, team AI adoption, knowledge worker AI, operating system choice, productivity modes, connectors, deep research mode, agent mode, image generation, web search, Canvas mode, advanced voice mode, business process automation, workflow evaluation, change management, AI training, Send Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info) Head to AI.studio/build to create your first app. Head to AI.studio/build to create your first app.
No persons identified in this episode.
This episode hasn't been transcribed yet
Help us prioritize this episode for transcription by upvoting it.
Popular episodes get transcribed faster
Other recent transcribed episodes
Transcribed and ready to explore now
3ª PARTE | 17 DIC 2025 | EL PARTIDAZO DE COPE
01 Jan 1970
El Partidazo de COPE
13:00H | 21 DIC 2025 | Fin de Semana
01 Jan 1970
Fin de Semana
12:00H | 21 DIC 2025 | Fin de Semana
01 Jan 1970
Fin de Semana
10:00H | 21 DIC 2025 | Fin de Semana
01 Jan 1970
Fin de Semana
13:00H | 20 DIC 2025 | Fin de Semana
01 Jan 1970
Fin de Semana
12:00H | 20 DIC 2025 | Fin de Semana
01 Jan 1970
Fin de Semana