Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing
Podcast Image

The Binary Breakdown

Education

Activity Overview

Episode publication activity over the past year

Episodes

NeonDB: A Serverless PostgreSQL Analysis

31 Jul 2025

Contributed by Lukas

NeonDB is a pioneering serverless PostgreSQL database designed for cloud-native environments, fundamentally characterized by its separation of compute...

Anna: A KVS For Any Scale

29 May 2025

Contributed by Lukas

This research paper introduces Anna, a key-value store (KVS) designed for scalable performance across diverse computing environments, from single mult...

Conflict-free Replicated Data Types

21 May 2025

Contributed by Lukas

This academic paper introduces Conflict-free Replicated Data Types (CRDTs), which are abstract data types designed for distributed systems where data ...

CAP Twelve Years Later: How the "Rules" Have Changed

14 May 2025

Contributed by Lukas

This content from InfoQ provides insights for software architects and developers through various formats like newsletters, articles, and conference in...

Raft versus Paxos: An Understandable Consensus Algorithm

07 May 2025

Contributed by Lukas

Raft, a consensus algorithm designed for managing a replicated log in distributed systems. It aims to be more understandable than Paxos, a widely used...

Neo4j Architecture: Graph Database Internals, Performance, and Optimization

01 May 2025

Contributed by Lukas

This compilation of resources offers a comprehensive examination of Neo4j's graph database architecture. It explains how Neo4j differs from relati...

Sentry: Error Monitoring at Scale - Design Principles Analysis

23 Apr 2025

Contributed by Lukas

Sentry is a large-scale, open-source error monitoring platform designed for modern distributed systems. It prioritizes actionable insights by focusing...

Istio Service Mesh: Architecture, Security, and Traffic Management

16 Apr 2025

Contributed by Lukas

These excerpts offer a detailed look at Istio's service mesh architecture, a critical component for managing microservices in cloud-native environ...

CockroachDB: SQL for Global Scale Design Principles

09 Apr 2025

Contributed by Lukas

CockroachDB is a distributed SQL database designed for global scalability and resilience. The database achieves this through a unique architecture bui...

Snowflake: Revolutionizing Cloud Data Warehousing and Analytics

02 Apr 2025

Contributed by Lukas

Snowflake, a cloud-native data warehouse, revolutionizes modern analytics through its unique architecture and capabilities. The platform separates co...

Kubernetes: Container Orchestration, Architecture, and Evolution

26 Mar 2025

Contributed by Lukas

This collection of excerpts comprehensively examines Kubernetes, the leading container orchestration platform. It traces the historical evolution of c...

Elasticsearch: Architecture, Applications, and Emerging Trends

19 Mar 2025

Contributed by Lukas

This compilation of excerpts thoroughly examines Elasticsearch, focusing on its architecture, applications, and future trends. The core architecture a...

Ray: A Distributed Framework for Emerging AI Applications

12 Mar 2025

Contributed by Lukas

This research paper introduces Ray, a distributed framework designed for emerging AI applications, particularly those involving reinforcement learning...

Zanzibar: Google's Global Authorization System

05 Mar 2025

Contributed by Lukas

This paper details Zanzibar, Google's globally distributed authorization system, designed to manage access control lists (ACLs) at a massive scale...

Google Mesa: A Geo-Replicated, Near Real-Time Data Warehouse

26 Feb 2025

Contributed by Lukas

**Mesa** is a highly scalable, geo-replicated data warehousing system developed at Google to handle petabytes of data related to its advertising busin...

Time, Clocks, and the Ordering of Events in a Distributed System

19 Feb 2025

Contributed by Lukas

This paper, "Time, Clocks, and the Ordering of Events in a Distributed System," explores the challenges of defining and managing time in dis...

ZooKeeper: Wait-Free Coordination for Internet-Scale Systems

12 Feb 2025

Contributed by Lukas

This paper details the design and implementation of ZooKeeper, a high-performance coordination service for large-scale distributed systems. ZooKeeper ...

TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems

05 Feb 2025

Contributed by Lukas

This paper details TensorFlow, a large-scale machine learning system developed by Google. TensorFlow uses dataflow graphs to represent computation and...

Firestore: A Serverless NoSQL Database

29 Jan 2025

Contributed by Lukas

This paper details Google Firestore, a NoSQL serverless database built on Spanner. It highlights Firestore's ease of use, scalability, real-time query...

Apache Flink: Stream and Batch Processing in a Single Engine

22 Jan 2025

Contributed by Lukas

This research paper details Apache Flink, an open-source system unifying stream and batch data processing. Flink uses a dataflow model to handle vario...

Kafka: A Distributed Messaging System for Log Processing

15 Jan 2025

Contributed by Lukas

This paper introduces Kafka, a novel distributed messaging system designed for high-throughput log processing. Kafka addresses limitations in existing...

LinkedIn: Using Set Cover to Optimize a Large-Scale Low Latency Distributed Graph

08 Jan 2025

Contributed by Lukas

This research paper details LinkedIn's solution for optimizing low-latency graph computations within their large-scale distributed graph system. To ...

Monolith: A Real-Time Recommendation System

01 Jan 2025

Contributed by Lukas

This research paper details Monolith, a real-time recommendation system developed by Bytedance. Monolith addresses challenges in building scalable rec...

Meta FlexiRaft: Flexible Quorums for Raft Consensus

25 Dec 2024

Contributed by Lukas

This research paper details FlexiRaft, a modified Raft consensus algorithm designed for Meta's petabyte-scale MySQL deployments. The core improvement ...

Spanner: Google’s Globally Distributed Database

07 Dec 2024

Contributed by Lukas

This research paper details Spanner, Google's globally-distributed database system. Spanner achieves strong consistency across its geographically disp...

Meta Minesweeper: Scalable Statistical Root Cause Analysis on App Telemetry

30 Nov 2024

Contributed by Lukas

This research paper introduces Minesweeper, a novel technique for automated root cause analysis (RCA) of software bugs at scale. Leveraging telemetry ...

Cassandra- A Decentralized Structured Storage System

24 Nov 2024

Contributed by Lukas

This paper details Cassandra, a decentralized structured storage system designed for managing massive amounts of structured data across numerous commo...

FoundationDB: A Distributed Unbundled Transactional Key Value Store

17 Nov 2024

Contributed by Lukas

The provided text is an excerpt from a research paper on FoundationDB, an open-source, distributed transactional key-value store. The paper details Fo...

Amazon Aurora: Design Considerations for High Throughput Cloud-Native Relational Databases

09 Nov 2024

Contributed by Lukas

This document describes the design of Amazon Aurora, a cloud-native relational database service built to handle high-throughput, online transaction pr...

Pregel: A System for Large-Scale Graph Processing

09 Nov 2024

Contributed by Lukas

The article is a paper published in 2010 by researchers at Google that introduces Pregel, a large-scale graph processing system. Pregel is designed fo...

Dapper, a Large-Scale Distributed Systems Tracing Infrastructure

08 Nov 2024

Contributed by Lukas

This paper from Google describes the design and implementation of Dapper, Google’s system for tracing requests in distributed systems. The authors e...

Google: The Chubby lock service for loosely-coupled distributed systems

31 Oct 2024

Contributed by Lukas

This document describes the development and implementation of Google's Chubby lock service, a highly available and reliable system that provides coars...

Megastore: Providing Scalable, Highly Available Storage for Interactive Services

28 Oct 2024

Contributed by Lukas

The provided text describes the architecture and design of Megastore, a Google-developed storage system designed to meet the needs of interactive onli...

Bigtable: A Distributed Storage System for Structured Data

28 Oct 2024

Contributed by Lukas

The article, “Bigtable: A Distributed Storage System for Structured Data,” describes a large-scale distributed data storage system developed at Go...

MapReduce: Simplified Data Processing on Large Clusters

28 Oct 2024

Contributed by Lukas

MapReduce is a programming model that simplifies the process of processing large datasets on clusters of commodity machines. It allows users to defin...

The Google File System

28 Oct 2024

Contributed by Lukas

The source is a technical paper that describes the Google File System (GFS), a scalable distributed file system designed to meet Google's data process...

TAO: Facebook’s Distributed Data Store for the Social Graph

28 Oct 2024

Contributed by Lukas

Facebook developed a distributed data store called TAO to efficiently serve the social graph data. TAO prioritizes read optimization, availability, an...

Scaling Memcache at Facebook

28 Oct 2024

Contributed by Lukas

This document details how Facebook engineers scaled Memcached, a popular open-source in-memory caching solution, to accommodate the demands of the wor...

Monarch: Google’s Planet-Scale In-Memory Time Series Database

27 Oct 2024

Contributed by Lukas

This technical paper details the architecture and design of Monarch, a planet-scale in-memory time series database developed at Google. Monarch is ...

Gorilla: A Fast, Scalable, In-Memory Time Series Database

27 Oct 2024

Contributed by Lukas

The provided text describes the architecture and functionality of Gorilla, Facebook's in-memory time series database. Gorilla was developed to addres...

Building a three-tier architecture on a budget

27 Oct 2024

Contributed by Lukas

This document, an AWS blog post, guides users through the process of building a cost-effective, three-tier architecture using serverless technologies ...

Saas Lens: Deploy multi-tenant SaaS workloads using AWS services

27 Oct 2024

Contributed by Lukas

This whitepaper outlines the AWS Well-Architected Framework specifically for Software as a Service (SaaS) applications. It examines how to design and ...

Streaming Media Lens

27 Oct 2024

Contributed by Lukas

This document is a white paper about the AWS Well-Architected Framework, particularly focusing on its application to streaming media workloads. It def...

Dynamo: Amazon’s Highly Available Key-value Store

27 Oct 2024

Contributed by Lukas

This technical paper details the design and implementation of Dynamo, a highly available and scalable key-value storage system developed by Amazon.c...