Martin Kleppmann

Designing Data-intensive Applications with Martin Kleppmann

Its successors, like in the form of Spark and Flink, for example, they are used.

Designing Data-intensive Applications with Martin Kleppmann

And so we still reference MapReduce in the second edition, but more as a learning tool in order to understand how these kind of partition-sharded batch processing systems work.

2797.561 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

So that's one thing where we've been able to reduce the coverage.

2808.335 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

But other areas where we've increased the coverage are, for example, systems in support of AI.

2812.101 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

And so even though this is not an AI book, but there are still data systems concerns that arise when needing to support AI applications, like a classic one is vector indexes, for example.

2818.251 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

And so we've added some coverage of vector indexes to the storage engine chapter.

2829.85 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

Fit in really well there because it already covers various different indexing strategies anyway.

2834.214 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

And so vector indexes, it's just another indexing strategy.

2839.38 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

We also added some coverage of data frames, for example.

2843.164 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

That's not an exclusively AI thing, but data frames are quite a good data representation for training data, for example.

2846.547 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

And that was not one of the data models that we discussed in the first edition, but we decided to add to the second edition because it has actually become a very important data model that people are using alongside all of the classic data models like relational and graph and JSON documents and so on.

2854.235 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

And so there are these places where we've just expanded the coverage a bit to reflect the kinds of systems people are building, for example, to support AI without it changing the direction of the book entirely.

2869.996 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

Absolutely.

2910.614 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

Yeah.

2911.336 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

So the motivation for putting in an ethics section there in the first edition was that I just felt it had been quite ignored as a concern during my time in industry.

2911.656 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

That's like,

2925.367 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

especially in startups, people were very focused on like building a product that their customers would love and really like deprioritizing these sort of ethical questions in the process.

2926.51 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

And so, for example, with the consumer facing products, it might be that the products are very much geared towards essentially data harvesting, collecting behavioral data, because that's what can be monetized in the form of advertising and

2939.046 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

there seemed to be just very little reflection on what was good and bad about these sort of things.

2954.827 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

So I really just wanted to encourage a bit of thinking there.

2960.021 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment