Martin Kleppmann

Designing Data-intensive Applications with Martin Kleppmann

Distributed system theory just doesn't make any assumptions about that sort of timing if we can avoid it.

Designing Data-intensive Applications with Martin Kleppmann

Or rather, some theory does make those assumptions, but it's a dangerous assumption to make because occasionally the network delay does become much higher than what is typical.

2556.388 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

Another thing is about crashes, for example.

2566.067 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

distributed system theory just says like nodes can crash but what does that actually mean like what in practice does it mean for a node to become unavailable because it might be a software crash but yes it might be a hardware failure it might be somebody unplugging the power cable it might be that

2569.874 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

the node is actually still running, but it's just become disconnected from the network.

2585.922 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

The point of this book chapter really is to defend and justify those theoretical models that we use for analyzing distributed systems and just giving a lot of stories and case studies that show that actually tons of stuff does go wrong.

2589.63 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

and like don't believe anyone who says oh failures are rare it's don't don't worry about it it's fine uh the the moral of this chapter is really that actually you know if you want to make things reliable you really do have to worry about a whole bunch of weird unusual but but certainly possible edge cases timing is another one of those things like you know it's very easy to assume that your clocks are correct and most of the times the clocks are pretty correct

2606.324 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

But we just can't rely on it because actually they're just not precise enough on the whole.

2632.447 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

And so a lot of it is about it's very tempting to make certain assumptions that things are well behaved and in distributed systems, we just have to try to get away from those assumptions if we want the systems to work reliably, even in the face of things going wrong.

2636.775 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

But it was a really fun chapter to write because, you know, it's essentially a big collection of stuff that has gone wrong.

2653.608 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

And so I went through a bunch of postmortems published by various tech companies, for example, in order to see, OK, what was the root cause of how things went wrong and what kind of lessons can we draw from this that apply to the book in general?

2659.494 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

And, you know, there's some fun stuff like the sharks biting undersea cables and damaging them.

2672.427 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

That just, you know, makes for a great story.

2678.093 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

And then I hear that in recent years, the shielding of undersea cables has got better and therefore the sharks are not biting them anymore.

2680.798 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

But instead, the cows on land are stepping on cables and occasionally causing network interruptions that way.

2687.77 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

And, you know, that sort of thing is just, it makes it a bit more fun.

2692.739 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

yeah but i think there's there's no like right answer it's a it's a trade-off between risk and cost broadly speaking and that means a business decision has to be made in terms of where the business wants to lie uh on that trade-off and so the goal of this chapter is really just to give people the information in order to make an educated decision but i don't want to make that decision for people that's for businesses themselves to decide that's very clear

2737.561 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

Yeah, so there are some things that we've been able to take out of the book compared to the first edition.

2779.057 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

In particular, for example, coverage of MapReduce was quite detailed in the first edition, but basically MapReduce is dead.

2784.724 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

Nobody uses it anymore.

2791.353 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment