Martin Kleppmann

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

It was a Rails app with a Postgres database, basically.

601.962 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

and some Redis and some similar things like that mixed in.

605.287 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

So actually, you know, nothing particularly revolutionary.

608.652 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

We essentially built a graph database on top of Postgres, so there was a little bit of technical interest in there, but, you know, nothing particularly outrageous.

611.757 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

After our team got disbanded, I switched over to the stream processing team.

630.558 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

So Kafka had just been developed at LinkedIn and had just been open sourced at the time.

635.783 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

Yeah, they developed it, right?

640.549 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

Oh, it was just being open sourced.

641.309 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

Yeah, I think it had just been open sourced.

642.811 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

And then I got to work on SAMSA, which was a stream processing framework on top of Kafka.

645.013 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

Yes.

668.652 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

So I think Jay Kreps has a pretty good blog post from that era called The Log, where he explains his motivation behind Kafka and why make it an append-only log rather than like a traditional message queue or something like that.

669.353 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

I think the motivation was really about data integration, because there were a whole bunch of databases and event generating systems, like activity events from users, for example.

686.017 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

They were all generating data in a sort of stream shape

697.143 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

And then a bunch of downstream systems that wanted to consume this, like wanted to get it into the data warehouse and wanted to be able to get it into the Hadoop cluster at the time in order to run like machine learning and things over it.

701.272 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

And there was just this data integration problem of actually like, how do you physically get the data out of one system and into another?

713.725 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

And Jay designed Kafka as this integration point, essentially like the

720.512 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

almost a kind of lowest common denominator, but still a general purpose abstraction for integrating various data sources and to downstream data sinks.

725.858 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

That's right, yes, because like previously the biggest company I had worked in was Reported with five people.

749.329 View full episode →

The Pragmatic Engineer

Designing Data-intensive Applications with Martin Kleppmann

We had a

754.235 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment