Google SRE Prodcast
Episodes
The One With SLOs
07 Jan 2026
Contributed by Lukas
In this episode, we welcome Alex Hidalgo and Brian Singer of nobl9 to discuss Service Level Objectives (SLOs). Alex and Brian talk about how SLOs can ...
The One With Steph Hippo and Observability
16 Dec 2025
Contributed by Lukas
In this episode, Steph Hippo, Platform Engineering Director at Honeycomb, joins The Prodcast to discuss AI and SRE. Steph explains how observability...
The One with Ben Good and Our Kubernetes Friends
30 Jul 2025
Contributed by Lukas
In this special episode hosts Steve McGhee from the Google SRE Prodcast and Kaslin Fields from the Google Kubernetes Podcast, welcome Google Cloud Sol...
The One With AI Agents, Ramón Llamas, and Swapnil Haria
23 Jul 2025
Contributed by Lukas
Google Staff SRE Ramón Llamas and Google Software Engineer Swapnil Haria join our hosts to explore how AI agents are revolutionizing production mana...
The One with Technical Program Managers and Karanveer Anand
16 Jul 2025
Contributed by Lukas
This episode features Google Technical Program Manager (TPM) Karanveer Anand, who joins our hosts to discuss the unique role of TPMs in Site Reliabili...
The One with STPA, Jeffrey Snover, and Theo Klein
02 Jul 2025
Contributed by Lukas
This episode discusses Systems Theoretic Process Analysis (STPA), a method for analyzing complex systems. Theo Klein, a Google SRE, and Jeffrey Snover...
The One with Startups and Adam Fletcher
25 Jun 2025
Contributed by Lukas
In this episode, hosts Steve McGhee and Matt Siegler are joined by guest, Adam Fletcher, CEO and Co-Founder of MarketStreet. They discuss the current ...
The One with SLOs and Sal Furino
18 Jun 2025
Contributed by Lukas
In this episode, Sal Furino, Customer Reliability Engineer at Bloomberg, discusses all things Service Level Objectives (SLOs) with hosts Steve McGhee ...
The One With the Future of SRE and Matt Zelesko
11 Jun 2025
Contributed by Lukas
Matt Zelesko, the head of Site Reliability Engineering at Google, discusses the evolution of SRE, highlighting the shift from traditional operations t...
The One with AI and Todd Underwood
04 Jun 2025
Contributed by Lukas
In this Google Prodcast episode, Todd Underwood, a reliability expert from Anthropic with experience at Google and OpenAI, discusses the current state...
The One With Data Centers and Peter Pellerzi
28 May 2025
Contributed by Lukas
This episode features guest, Peter Pellerzi (Distinguished Engineer, Google). Peter and the hosts, Matt Siegler and Steve McGhee, focus on the physica...
The One With Security and Jessica Theodat
21 May 2025
Contributed by Lukas
Jessica Theodat (Senior SRE & Security Tech Lead, Google) joins hosts Jordan Greenberg and Steve McGhee to discuss the intersection of security and si...
We're back with Season 4!
16 Apr 2025
Contributed by Lukas
In this "bumpisode", hosts and producers of Prodcast (including our new co-host, Matt Siegler!) reflect on the previous season and introduce the new s...
Special Episode: You Missed a Page from Telebot
29 Jan 2025
Contributed by Lukas
This episode features Javi Beltran, a Google engineering lead who created the "Telebot" theme song. With our beloved hosts, Steve McGhee and Jordan Gr...
Imperative vs. Declarative Change Workflows with Dominic Hutton & Niccolo' Cascarano
11 Dec 2024
Contributed by Lukas
In this episode of the Prodcast, guests Dominic Hutton (Staff SRE, HashiCorp) and Niccolo' Cascarano (Senior Staff SRE at Google) join hosts Steve McG...
Human Factors in Complex Systems with Casey Rosenthal and John Allspaw
04 Dec 2024
Contributed by Lukas
This episode features Casey Rosenthal (Founder, Cirrusly.ai) and John Allspaw (Founder and Principal, Adaptive Capacity Labs), joining our hosts Steve...
Embracing Complexity with Christina Schulman & Dr. Laura Maguire
20 Nov 2024
Contributed by Lukas
In this episode of the Prodcast, we are joined by guests Christina Schulman (Staff SRE, Google) and Dr. Laura Maguire (Principal Engineer, Trace Cogn...
Maglev: load balancing at Google with Cody Smith and Trisha Weir
13 Nov 2024
Contributed by Lukas
In this episode, Cody Smith (CTO and Co-founder, Camus Energy) & Trisha Weir (SRE Department Lead, Google) join hosts Steve McGhee and Jordan Greenbe...
Profiling data with Pat Somaru and Narayan Desai
30 Oct 2024
Contributed by Lukas
In this episode, guests Narayan Desai (Principal SRE, Google) and Pat Somaru (Senior Production Engineer, Meta) join hosts Steve McGhee and Florian Ra...
Google Public DNS (8.8.8.8) with Wilmer van der Gaast and Andy Sykes
23 Oct 2024
Contributed by Lukas
This episode features Google engineers Wilmer van der Gaast (Production on-tall) and Andy Sykes (Senior Staff Systems Engineer, SRE), joining hosts St...
SRE in the Retail and Gaming Worlds with Jordan Chernev & Scott Bowers
16 Oct 2024
Contributed by Lukas
Guests Jordan Chernev (Senior Technology Executive) and Scott Bowers (SRE, Gearbox Software) who hail from the retail and gaming industries, respectiv...
Incident Response with Sarah Butt and Vrai Stacey
09 Oct 2024
Contributed by Lukas
Sarah Butt (Principal Engineer, Centralized Incident Response, Salesforce) and Vrai Stacey (Staff Software Engineer, Google) join hosts Steve McGhee a...
Building Reliable Systems with Silvia Botros and Niall Murphy
02 Oct 2024
Contributed by Lukas
Silvia Botros (SRE Architect, Twilio | Author of "High Performance MySQL, 4th edition") and Niall Murphy (Co-founder & CEO, Stanza) join hosts Steve M...
Creating Systems that are Safe with Liz Fong-Jones
25 Sep 2024
Contributed by Lukas
Liz Fong-Jones (former Google SRE and current Field CTO at honeycomb.io) joins hosts Steve McGhee and Jordan Greenberg for a lively discussion centere...
Production Problems Are For All! with Ben Treynor Sloss
18 Sep 2024
Contributed by Lukas
Ben Treynor Sloss (VP of Engineering, Google) joins hosts Steve McGhee and Dr. Jennifer Petoff (Director of Technical Infrastructure Education, Google...
There Remains a Huge Amount of Work to Do, with Healfdene Goguen
11 Sep 2024
Contributed by Lukas
In this episode, Healfdene Goguen (Principal Engineer, Google) joins hosts Steve McGhee and Jordan Greenberg to discuss the vast amount of work to be...
SRE, a Basis of Influence, with Amy Tobey & Vladyslav Ukis
04 Sep 2024
Contributed by Lukas
In this season of Google Prodcast, current and former SREs, both within and outside of Google, chat with hosts Steve McGhee and Jordan Greenberg to di...
Life of An SRE: Life after Google SRE, with Carla Geisser, Cody Smith, and Laura Nolan
07 Nov 2023
Contributed by Lukas
Former Google SREs, or "Xooglers", talk with hosts MP and Steve McGhee about site reliability engineering outside of Google. What's the difference in ...
Life of An SRE with Sabrina Farmer
31 Oct 2023
Contributed by Lukas
Sabrina Farmer, VP of Engineering at Google, talks about her career journey through Site Reliability Engineering. What does management mean? What's ...
Life of An SRE with Dave Reisner
17 Oct 2023
Contributed by Lukas
Dave Reisner talks about his path to Staff SRE, from ArchLinux contributor through DevOps to software engineer. This episode emphasizes the value of s...
Life of an SRE with Stephen Benjamin
10 Oct 2023
Contributed by Lukas
Explore the role and responsibilities of an SRE manager with Stephen Benjamin.
Life of An SRE with Jessica Theodat
03 Oct 2023
Contributed by Lukas
Explore the role and responsibilities of a Senior SRE with Jessica Theodat, as she discusses life-work balance, the value of mentoring, and being a Bl...
Life of An SRE with Shannon Brady and Theo Klein
26 Sep 2023
Contributed by Lukas
Explore the career path of SREs Shannon Brady and Theo Klein as they discusses their paths to Site Reliability Engineering and finding their areas of ...
Life of An SRE with Mariuxi Vasconez and Julian Alarcon
19 Sep 2023
Contributed by Lukas
In this episode, Mariuxi and Julian discuss their paths to SRE: what drew them initially to SRE, and what motivates them to continue developing skills...
Life of An SRE Episode 1: Tom Cranitch and Megan Yin
12 Sep 2023
Contributed by Lukas
How does one become an SRE? And what's the career like? In this episode, Tom and Megan discuss their path to SRE.
Creating the SRE Prodcast with John Reese (JTR)
07 Jun 2022
Contributed by Lukas
Host MP English and former Google SRE John Reese (JTR) chat about the creation of the Prodcast. Visit https://sre.google/prodcast for transcripts and ...
Postmortems with Ayelet Sachto
31 May 2022
Contributed by Lukas
Ayelet Sachto offers advice on creating an actionable, transparent, and blameless postmortem culture. Visit https://sre.google/prodcast for transcript...
Incident Management with Adrienne Walcer
24 May 2022
Contributed by Lukas
Adrienne Walcer discusses how to approach and organize incident management efforts throughout the production lifecycle. Visit https://sre.google/prodc...
On-Call Rotations with Andrew Widdowson (APW)
17 May 2022
Contributed by Lukas
Andrew Widdowson (APW) shares strategies for successful on-call rotations. Visit https://sre.google/prodcast for transcripts and links to further read...
Automation with Pierre Palatin
10 May 2022
Contributed by Lukas
Pierre Palatin dives into different automation strategies, how to build confidence in your system, and why designing the UI may be your biggest challe...
Client-Transparent Migrations with Pavan Adharapurapu
03 May 2022
Contributed by Lukas
Pavan Adharapurapu details how to approach large-scale migrations while optimizing for user experience. Visit https://sre.google/prodcast for transcri...
Rethinking SLOs with Narayan Desai
26 Apr 2022
Contributed by Lukas
Narayan Desai explains why SLOs can be problematic and proposes alternative methods for monitoring complex, large-scale systems. Visit https://sre.goo...
Alerting with Amelia Harrison
19 Apr 2022
Contributed by Lukas
Amelia Harrison advises on when and how to alert, ideal coverage, and tuning. Visit https://sre.google/prodcast for transcripts and links to further r...
Customer-Centric Monitoring with Silvia Esparrachiari
12 Apr 2022
Contributed by Lukas
Silvia Esparrachiari talks about the challenges of monitoring and the importance of understanding your users. Visit https://sre.google/prodcast for tr...
SRE Philosophy with Jennifer Mace (Macey)
05 Apr 2022
Contributed by Lukas
What is SRE, anyway? Jennifer Mace (Macey) gives us her definition of "site reliability engineer," discusses how to manage risk, and shares key questi...