Marc Brooker

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

Like we don't need to fix these root causes because our on-calls are superheroic and they're going to stay up all night and they're going to, you know, they're going to hack around things and they don't mind being paged a hundred times a week.

1324.817 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

And

1337.298 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

that can feel from the inside like it's a good culture, right?

1338.64 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

Like, oh, wow, these people are super strong owners.

1341.924 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

They're super engaged.

1344.327 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

They really care.

1345.368 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

They're really working hard on call.

1346.329 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

And those are all good signals.

1349.152 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

But then when you look at it from the outside, it's like, wow, we're not actually fixing the causes of things.

1351.115 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

We're just doing this fantastically expensive investment of taking all of these people and their strong ownership and their expertise and spending them just on this break-fix cycle.

1355.4 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

And that's where you need to kind of look at it from the outside and say, well, let's take this energy of this team, fantastic energy, and focus it on improving the service, getting out of the cycle, finding new things to fix, finding new things to build.

1366.855 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

And that can be hard because it can be hard for those folks who've been in that mode to look at it and say, this feels so good.

1385.131 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

It feels really like we're caring about our customers and caring about our product and caring about our business.

1394.754 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

to realize that, oh no, we're actually caring about it at the wrong level and we're not serving our business in the best possible way by being so narrowly and tactically focused on this break-fix cycle.

1401.027 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

And that's where you sort of need to pop them out and say, well, let's spend more time thinking about

1414.374 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

Let's spend more time thinking about the causes of things.

1420.687 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

Let's spend more time addressing these things in a more strategic way.

1425.356 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

And wow, okay, now you've got so much more time to do that because you've broken the cycle and you can improve your product in different ways.

1430.988 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

Yeah, so caching's good, right?

1476.096 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

Like it's, hey, I'm going to take these core ideas from computer science of temporal and spatial locality, and I'm going to exploit those to make my system faster, scale better, et cetera.

1477.458 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment