Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Martin Kleppmann

๐Ÿ‘ค Speaker
607 total appearances

Appearances Over Time

Podcast Appearances

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

Its successors, like in the form of Spark and Flink, for example, they are used.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

And so we still reference MapReduce in the second edition, but more as a learning tool in order to understand how these kind of partition-sharded batch processing systems work.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

So that's one thing where we've been able to reduce the coverage.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

But other areas where we've increased the coverage are, for example, systems in support of AI.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

And so even though this is not an AI book, but there are still data systems concerns that arise when needing to support AI applications, like a classic one is vector indexes, for example.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

And so we've added some coverage of vector indexes to the storage engine chapter.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

Fit in really well there because it already covers various different indexing strategies anyway.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

And so vector indexes, it's just another indexing strategy.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

We also added some coverage of data frames, for example.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

That's not an exclusively AI thing, but data frames are quite a good data representation for training data, for example.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

And that was not one of the data models that we discussed in the first edition, but we decided to add to the second edition because it has actually become a very important data model that people are using alongside all of the classic data models like relational and graph and JSON documents and so on.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

And so there are these places where we've just expanded the coverage a bit to reflect the kinds of systems people are building, for example, to support AI without it changing the direction of the book entirely.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

Absolutely.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

Yeah.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

So the motivation for putting in an ethics section there in the first edition was that I just felt it had been quite ignored as a concern during my time in industry.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

That's like,

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

especially in startups, people were very focused on like building a product that their customers would love and really like deprioritizing these sort of ethical questions in the process.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

And so, for example, with the consumer facing products, it might be that the products are very much geared towards essentially data harvesting, collecting behavioral data, because that's what can be monetized in the form of advertising and

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

there seemed to be just very little reflection on what was good and bad about these sort of things.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

So I really just wanted to encourage a bit of thinking there.