Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Martin Kleppmann

๐Ÿ‘ค Speaker
607 total appearances

Appearances Over Time

Podcast Appearances

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

It was a Rails app with a Postgres database, basically.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

and some Redis and some similar things like that mixed in.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

So actually, you know, nothing particularly revolutionary.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

We essentially built a graph database on top of Postgres, so there was a little bit of technical interest in there, but, you know, nothing particularly outrageous.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

After our team got disbanded, I switched over to the stream processing team.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

So Kafka had just been developed at LinkedIn and had just been open sourced at the time.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

Yeah, they developed it, right?

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

Oh, it was just being open sourced.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

Yeah, I think it had just been open sourced.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

And then I got to work on SAMSA, which was a stream processing framework on top of Kafka.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

Yes.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

So I think Jay Kreps has a pretty good blog post from that era called The Log, where he explains his motivation behind Kafka and why make it an append-only log rather than like a traditional message queue or something like that.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

I think the motivation was really about data integration, because there were a whole bunch of databases and event generating systems, like activity events from users, for example.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

They were all generating data in a sort of stream shape

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

And then a bunch of downstream systems that wanted to consume this, like wanted to get it into the data warehouse and wanted to be able to get it into the Hadoop cluster at the time in order to run like machine learning and things over it.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

And there was just this data integration problem of actually like, how do you physically get the data out of one system and into another?

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

And Jay designed Kafka as this integration point, essentially like the

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

almost a kind of lowest common denominator, but still a general purpose abstraction for integrating various data sources and to downstream data sinks.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

That's right, yes, because like previously the biggest company I had worked in was Reported with five people.

The Pragmatic Engineer
Designing Data-intensive Applications with Martin Kleppmann

We had a