The Binary Breakdown
Episodes
NeonDB: A Serverless PostgreSQL Analysis
31 Jul 2025
Contributed by Lukas
NeonDB is a pioneering serverless PostgreSQL database designed for cloud-native environments, fundamentally characterized by its separation of compute...
Anna: A KVS For Any Scale
29 May 2025
Contributed by Lukas
This research paper introduces Anna, a key-value store (KVS) designed for scalable performance across diverse computing environments, from single mult...
Conflict-free Replicated Data Types
21 May 2025
Contributed by Lukas
This academic paper introduces Conflict-free Replicated Data Types (CRDTs), which are abstract data types designed for distributed systems where data ...
CAP Twelve Years Later: How the "Rules" Have Changed
14 May 2025
Contributed by Lukas
This content from InfoQ provides insights for software architects and developers through various formats like newsletters, articles, and conference in...
Raft versus Paxos: An Understandable Consensus Algorithm
07 May 2025
Contributed by Lukas
Raft, a consensus algorithm designed for managing a replicated log in distributed systems. It aims to be more understandable than Paxos, a widely used...
Neo4j Architecture: Graph Database Internals, Performance, and Optimization
01 May 2025
Contributed by Lukas
This compilation of resources offers a comprehensive examination of Neo4j's graph database architecture. It explains how Neo4j differs from relati...
Sentry: Error Monitoring at Scale - Design Principles Analysis
23 Apr 2025
Contributed by Lukas
Sentry is a large-scale, open-source error monitoring platform designed for modern distributed systems. It prioritizes actionable insights by focusing...
Istio Service Mesh: Architecture, Security, and Traffic Management
16 Apr 2025
Contributed by Lukas
These excerpts offer a detailed look at Istio's service mesh architecture, a critical component for managing microservices in cloud-native environ...
CockroachDB: SQL for Global Scale Design Principles
09 Apr 2025
Contributed by Lukas
CockroachDB is a distributed SQL database designed for global scalability and resilience. The database achieves this through a unique architecture bui...
Snowflake: Revolutionizing Cloud Data Warehousing and Analytics
02 Apr 2025
Contributed by Lukas
Snowflake, a cloud-native data warehouse, revolutionizes modern analytics through its unique architecture and capabilities. The platform separates co...
Kubernetes: Container Orchestration, Architecture, and Evolution
26 Mar 2025
Contributed by Lukas
This collection of excerpts comprehensively examines Kubernetes, the leading container orchestration platform. It traces the historical evolution of c...
Elasticsearch: Architecture, Applications, and Emerging Trends
19 Mar 2025
Contributed by Lukas
This compilation of excerpts thoroughly examines Elasticsearch, focusing on its architecture, applications, and future trends. The core architecture a...
Ray: A Distributed Framework for Emerging AI Applications
12 Mar 2025
Contributed by Lukas
This research paper introduces Ray, a distributed framework designed for emerging AI applications, particularly those involving reinforcement learning...
Zanzibar: Google's Global Authorization System
05 Mar 2025
Contributed by Lukas
This paper details Zanzibar, Google's globally distributed authorization system, designed to manage access control lists (ACLs) at a massive scale...
Google Mesa: A Geo-Replicated, Near Real-Time Data Warehouse
26 Feb 2025
Contributed by Lukas
**Mesa** is a highly scalable, geo-replicated data warehousing system developed at Google to handle petabytes of data related to its advertising busin...
Time, Clocks, and the Ordering of Events in a Distributed System
19 Feb 2025
Contributed by Lukas
This paper, "Time, Clocks, and the Ordering of Events in a Distributed System," explores the challenges of defining and managing time in dis...
ZooKeeper: Wait-Free Coordination for Internet-Scale Systems
12 Feb 2025
Contributed by Lukas
This paper details the design and implementation of ZooKeeper, a high-performance coordination service for large-scale distributed systems. ZooKeeper ...
TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems
05 Feb 2025
Contributed by Lukas
This paper details TensorFlow, a large-scale machine learning system developed by Google. TensorFlow uses dataflow graphs to represent computation and...
Firestore: A Serverless NoSQL Database
29 Jan 2025
Contributed by Lukas
This paper details Google Firestore, a NoSQL serverless database built on Spanner. It highlights Firestore's ease of use, scalability, real-time query...
Apache Flink: Stream and Batch Processing in a Single Engine
22 Jan 2025
Contributed by Lukas
This research paper details Apache Flink, an open-source system unifying stream and batch data processing. Flink uses a dataflow model to handle vario...
Kafka: A Distributed Messaging System for Log Processing
15 Jan 2025
Contributed by Lukas
This paper introduces Kafka, a novel distributed messaging system designed for high-throughput log processing. Kafka addresses limitations in existing...
LinkedIn: Using Set Cover to Optimize a Large-Scale Low Latency Distributed Graph
08 Jan 2025
Contributed by Lukas
This research paper details LinkedIn's solution for optimizing low-latency graph computations within their large-scale distributed graph system. To ...
Monolith: A Real-Time Recommendation System
01 Jan 2025
Contributed by Lukas
This research paper details Monolith, a real-time recommendation system developed by Bytedance. Monolith addresses challenges in building scalable rec...
Meta FlexiRaft: Flexible Quorums for Raft Consensus
25 Dec 2024
Contributed by Lukas
This research paper details FlexiRaft, a modified Raft consensus algorithm designed for Meta's petabyte-scale MySQL deployments. The core improvement ...
Spanner: Google’s Globally Distributed Database
07 Dec 2024
Contributed by Lukas
This research paper details Spanner, Google's globally-distributed database system. Spanner achieves strong consistency across its geographically disp...
Meta Minesweeper: Scalable Statistical Root Cause Analysis on App Telemetry
30 Nov 2024
Contributed by Lukas
This research paper introduces Minesweeper, a novel technique for automated root cause analysis (RCA) of software bugs at scale. Leveraging telemetry ...
Cassandra- A Decentralized Structured Storage System
24 Nov 2024
Contributed by Lukas
This paper details Cassandra, a decentralized structured storage system designed for managing massive amounts of structured data across numerous commo...
FoundationDB: A Distributed Unbundled Transactional Key Value Store
17 Nov 2024
Contributed by Lukas
The provided text is an excerpt from a research paper on FoundationDB, an open-source, distributed transactional key-value store. The paper details Fo...
Amazon Aurora: Design Considerations for High Throughput Cloud-Native Relational Databases
09 Nov 2024
Contributed by Lukas
This document describes the design of Amazon Aurora, a cloud-native relational database service built to handle high-throughput, online transaction pr...
Pregel: A System for Large-Scale Graph Processing
09 Nov 2024
Contributed by Lukas
The article is a paper published in 2010 by researchers at Google that introduces Pregel, a large-scale graph processing system. Pregel is designed fo...
Dapper, a Large-Scale Distributed Systems Tracing Infrastructure
08 Nov 2024
Contributed by Lukas
This paper from Google describes the design and implementation of Dapper, Google’s system for tracing requests in distributed systems. The authors e...
Google: The Chubby lock service for loosely-coupled distributed systems
31 Oct 2024
Contributed by Lukas
This document describes the development and implementation of Google's Chubby lock service, a highly available and reliable system that provides coars...
Megastore: Providing Scalable, Highly Available Storage for Interactive Services
28 Oct 2024
Contributed by Lukas
The provided text describes the architecture and design of Megastore, a Google-developed storage system designed to meet the needs of interactive onli...
Bigtable: A Distributed Storage System for Structured Data
28 Oct 2024
Contributed by Lukas
The article, “Bigtable: A Distributed Storage System for Structured Data,” describes a large-scale distributed data storage system developed at Go...
MapReduce: Simplified Data Processing on Large Clusters
28 Oct 2024
Contributed by Lukas
MapReduce is a programming model that simplifies the process of processing large datasets on clusters of commodity machines. It allows users to defin...
The Google File System
28 Oct 2024
Contributed by Lukas
The source is a technical paper that describes the Google File System (GFS), a scalable distributed file system designed to meet Google's data process...
TAO: Facebook’s Distributed Data Store for the Social Graph
28 Oct 2024
Contributed by Lukas
Facebook developed a distributed data store called TAO to efficiently serve the social graph data. TAO prioritizes read optimization, availability, an...
Scaling Memcache at Facebook
28 Oct 2024
Contributed by Lukas
This document details how Facebook engineers scaled Memcached, a popular open-source in-memory caching solution, to accommodate the demands of the wor...
Monarch: Google’s Planet-Scale In-Memory Time Series Database
27 Oct 2024
Contributed by Lukas
This technical paper details the architecture and design of Monarch, a planet-scale in-memory time series database developed at Google. Monarch is ...
Gorilla: A Fast, Scalable, In-Memory Time Series Database
27 Oct 2024
Contributed by Lukas
The provided text describes the architecture and functionality of Gorilla, Facebook's in-memory time series database. Gorilla was developed to addres...
Building a three-tier architecture on a budget
27 Oct 2024
Contributed by Lukas
This document, an AWS blog post, guides users through the process of building a cost-effective, three-tier architecture using serverless technologies ...
Saas Lens: Deploy multi-tenant SaaS workloads using AWS services
27 Oct 2024
Contributed by Lukas
This whitepaper outlines the AWS Well-Architected Framework specifically for Software as a Service (SaaS) applications. It examines how to design and ...
Streaming Media Lens
27 Oct 2024
Contributed by Lukas
This document is a white paper about the AWS Well-Architected Framework, particularly focusing on its application to streaming media workloads. It def...
Dynamo: Amazon’s Highly Available Key-value Store
27 Oct 2024
Contributed by Lukas
This technical paper details the design and implementation of Dynamo, a highly available and scalable key-value storage system developed by Amazon.c...