Marc Brooker

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

broadly that class of problems.

972.959 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

So folks can say, hey, I'm going to build on D-SQL and just not have this whole class of problems.

976.163 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

And I think that's a really kind of powerful outer loop of the post-mortem process is to say, how do we turn all of these lessons into new services and into service improvements?

982.572 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

How do you prevent misbehaving clients from being a problem for the database?

995.691 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

Yeah, so in DSQL's case, we have no pessimistic locking.

1001.225 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

And so within the scope of a transaction, everything that happens in that transaction, all of the reads happen using this mechanism called multiversion concurrency control, where every row in the database, we sort of store a history of versions.

1007.191 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

And so you can read an old version of a row without blocking writers and saying, hey, you can't update this because I just read it.

1023.168 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

And then locally within the query processor that's handling a connection, we spool the writes locally and then you get to commit time and we do this optimistic check of, can I commit this transaction at the transaction commit time?

1030.317 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

And so combining those two mechanisms of having multiversion concurrency control and the scale-out storage that comes with it and the commit time optimistic checks, we can strongly say that there is no way that a reader of a piece of data can block other writers, and there's no way that a writer of data can block readers.

1047.243 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

Writers can block writers, but only...

1072.281 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

Only by changing data, not just by looking at it.

1075.947 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

And so you can say, well, I can cause, sorry, writers can't block writers, but they can prevent other writers' transactions from eventually committing by making a bunch of changes.

1078.872 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

And that is inherent to the definition of the particular database isolation level.

1091.012 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

Yeah, it's actually surprisingly small.

1108.264 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

And it's surprisingly small because if you look at the access patterns for most online databases, even ones that do a lot of write traffic, that write traffic tends to be quite concentrated.

1110.707 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

And it's quite unusual for an online database workload or even an analytics workload

1121.618 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

to make a second version of every row in the database.

1127.905 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

Typically what it's doing is making a, you know, first, second, third, the hundredth version of this row and a 50th version of that row, but the vast majority of data isn't changing.

1132.511 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

And so it's super workload dependent, as is everything in the database world, but the overhead tends to be relatively small.

1142.485 View full episode →

The Peterman Pod

AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker

I would say it's unusual for...

1151.978 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment