This is Fine! A podcast about resilience engineering and software
Episodes
The 2025 DORA Report w/special guest Fred Hebert
12 Mar 2026
Contributed by Lukas
You can find the 2025 DORA Report here: https://dora.dev/research/2025/dora-report/Read more of Fred’s work/opinions here: https://ferd.ca/If you wa...
Building and Revising Adaptive Capacity Sharing for Technical Incident Response with Beth Adele Long
26 Feb 2026
Contributed by Lukas
The Keewenaw snow gauge that Colette mentioned is a tourist attraction. If you want to see where measurements are at for the season you can find them ...
Outsourcing and Resilience
12 Feb 2026
Contributed by Lukas
Colette mentioned Menlo Innovations https://menloinnovations.com/ and Atomic Object https://atomicobject.com/ who both build custom software for folks...
The Messy 9 and Coding with AI - A Panel Discussion
01 Feb 2026
Contributed by Lukas
Special thanks to John Allspaw, Sheeri Cabral, Martin Smith, and David Woods for joining us!Ben Affleck’s been making the promo rounds, but the spec...
Going Solid
17 Jan 2026
Contributed by Lukas
If you’re feeling like you need to do more to respond to our moment:Lots of place to donate to in the twin cities are listed here: https://mspmag.co...
The Year in Resilience w/special guest John Allspaw
31 Dec 2025
Contributed by Lukas
Seriously though, can’t wait to gtfo of this year.Palisades fire links: https://www.nbclosangeles.com/investigations/anonymous-letter-demands-indepe...
Incident Status: On Hold w/special guest Will Gallego
28 Nov 2025
Contributed by Lukas
Mentioned multiple times, Em Ruppe’s amazing talk on incident severity: https://www.usenix.org/conference/srecon24americas/presentation/ruppeWe talk...
Complex Systems and the Messy Nine w/special guests Dave Woods and John Allspaw
13 Nov 2025
Contributed by Lukas
The writeup on the AWS outage from AWS themselves, if you haven’t seen it: https://aws.amazon.com/message/101925/Dave’s department at OSU, Cogniti...
All the things about Incident Command
30 Oct 2025
Contributed by Lukas
It’s Spamton G (not J) Spamton, Clint! Get hip to the game characters! https://deltarune.fandom.com/wiki/SpamtonThere are a couple of incident ...
Root Cause Analysis vs. Resilience Engineering w/special guest Lorin Hochstein
16 Oct 2025
Contributed by Lukas
A history of the 5 whys and root cause analysis from papersSome critiques of the 5 whys:From John Allspaw: https://www.oreilly.com/radar/the-infinite-...
First Stories/Second Stories
02 Oct 2025
Contributed by Lukas
More robustness than resilience, but worth repeating that you should always check your earthquake go-bag: https://www.earthquakeauthority.com/blog/201...
How (Not) to Introduce Resilience Engineering at Work with special guest Michelle Casey
18 Sep 2025
Contributed by Lukas
Lorikeets are pretty: https://en.wikipedia.org/wiki/Rainbow_lorikeetYou think Colette’s kidding about the kangaroo? https://www.youtube.com/watch?v=...
How long should you wait after an incident to do your retro?
25 Jul 2025
Contributed by Lukas
Corn sweat is a real thing: https://www.scientificamerican.com/article/humidity-from-corn-sweat-intensifies-extreme-heat-wave-in-midwest-u-s/Also, plu...
Lund University - Academic Theory and Practice
10 Jul 2025
Contributed by Lukas
A huge thanks to our panelists:John AllspawJed NeedleChad ToddRISF and TiF will host a live follow up to this episode on July 31st! ...
What’s the ROI on Reliability and Resilience work?
27 Jun 2025
Contributed by Lukas
Dave Wood’s Talk at SRECon 25 was on Complexification and SRE: https://www.youtube.com/watch?v=lmBvUJnGUX4Jens Rasmussen’s model - Is really well ...
Runbooks: the Good, Bad and Ugly w/special guest Andrew Hatch
03 Jun 2025
Contributed by Lukas
You can register for the After-the-Episode chat with Andrew at https://resilienceinsoftware.org/networks/events/129997Tickets are free for members, $1...
What is an incident? How come no one declare them?
21 May 2025
Contributed by Lukas
Michael Wettick’s Lund thesis is great, and Laura Maguire’s paper on the Costs of Coordination that is a shortened version of her dissertation is ...
Chaos Engineering w/special guest Casey Rosenthal
07 May 2025
Contributed by Lukas
The O’Reilly book on Chaos Engineering by Casey and Nora Jones is here: https://www.oreilly.com/library/view/chaos-engineering/9781492043850/Some of...
Burnout on Aisle 3
26 Apr 2025
Contributed by Lukas
Clint wrote the Socio-Technical Reality Engineer as a blog post it’s a good read.The Burnout book by the Nagoski sisters is A+++ reading.Those Found...
Resilience, Complexity, and Your Boss a collab w/Punk Rock Safety
09 Apr 2025
Contributed by Lukas
Ben (Goodheart), Dave (Provan) and Ron (Gantt) have the very awesome podcast Punk Rock Safety (punkrocksafety.com) - you can get your own punk rock sa...
Live From SRECon
28 Mar 2025
Contributed by Lukas
No video for this one because it didn’t really end up working.We had some awesome people with us for this show:Eric DobbsWill GallegoJuan Carlos Ram...
Teaser Episode - Season 2
12 Mar 2025
Contributed by Lukas
The XKCD comic that’s in Colette’s thesis is DependencyJustin Reock is at DXhttps://punkrocksafety.com/ are our mutual podcast friends
Episode 10 - When They go Full ITIL on You w/special guest john allspaw
20 Feb 2025
Contributed by Lukas
You can find John at Adaptive Capacity Labs or his (old) blog at Kitchen Soap. ITIL is… well, it’s a thing.Colette’s “You’re surprised ...
Episode 9 - Learning from Incidents with special guest Alex Elman
12 Feb 2025
Contributed by Lukas
You can find ACL (Adaptive Capacity Labs), the folks who train software engineers how to do LFI and who we speak so fondly of here.Colette mentioned A...
Episode 8 - Why Human Factors and Not Technical Ones
29 Jan 2025
Contributed by Lukas
The spicy Allspaw take that inspired our listener is here: https://www.linkedin.com/posts/jallspaw_a-im-a-bit-salty-today-b-if-you-dont-activity-72879...
Episode 7 - AI and Resilience with special guest Courtney Nash
22 Jan 2025
Contributed by Lukas
The VOID is one of our favorite things!Some of Courtney’s inoculation of the MTTR virus can be found here:An interview with InfoQA talk ...
Episode 6 - Can You Buy Resilience? With Special Guest Steve McGhee
08 Jan 2025
Contributed by Lukas
Steve is the host of the Google SRE Prodcast, you should check it out!Colette got her chickens from Greenfire Farms, and her chicken coop from Carolin...
episode 5 - curating your resilience engineering 101
22 Dec 2024
Contributed by Lukas
We talk about our favorite recommendations for someone who's just getting into this whole resilience engineering thing.A small note: Clint's voice is ...
Episode 4 - A look at the 2024 dora report
11 Dec 2024
Contributed by Lukas
Fred’s wonderful blogThis year’s DORA reportLee, Ramsey & Hicks on productivity and performanceBainbridge’s classic: The Ironies of Automati...
Episode 3 - lions, tigers and metrics, oh my!
04 Dec 2024
Contributed by Lukas
We answered a set of questions about how to deal with dashboards and MTTR and how to make the best of the situation with the help of special guest Van...
Episode 2 - Does Software Need Safety?
21 Nov 2024
Contributed by Lukas
We talk to the pioneer of resilience engineering in the software world John Allspaw about how he discovered this world, and we answer a reader questio...
Episode 1 - Every Second Counts
07 Nov 2024
Contributed by Lukas
The introduction episode of This is Fine! A podcast about resilience engineering in the software world. Clint and Colette discuss conferences and a li...