Distributed Bytes

Dynamo, DynamoDB, and Aurora DSQL - Marc's Blog

2025-10-14

People often ask me about the architectural relationship between Amazon Dynamo (as described in the classic 2007 SOSP paper), Amazon DynamoDB (the serverless distributed NoSQL database from AWS), and Aurora DSQL (the serverless distributed SQL database from AWS). There’s a ton to say on the topic, but I’ll start off on comparing how the systems achieve a few key properties.

Dynamo, DynamoDB, and Aurora DSQL - Marc's Blog

Linearizability testing S2 with deterministic simulation

2025-09-30

With S2, it is a hard requirement that our Stream API operations exhibit linearizability. Linearizable systems are far simpler to reason about, and many applications are only possible to build on top of data platforms that offer strong consistency guarantees like this. Because it's important, we also need to test it! We can gain confidence that S2 is linearizable by taking an empirical validation approach, using a model checker like Knossos, or Porcupine.

Linearizability testing S2 with deterministic simulation

How I solved a distributed queue problem after 15 years | DBOS

2025-09-22

Learn how queues make horizontal scaling, scheduling, and flow control easier in cloud systems, and how to make them durable and observable.

How I solved a distributed queue problem after 15 years | DBOS

Understanding Paxos the intuitive way | Relentless Leader

2025-08-09

We are on a path to build a strong foundation in distributed systems. We have already gone over distributed time; the next topic we will cover is Distributed Consensus. To build the foundation on distributed consensus, we will go over Paxos. Paxos revolutionized distributed computing by providing the first provably correct solution for achieving consensus among unreliable processors, forming the theoretical foundation for modern distributed systems and databases. Paxos is one of the most important and most difficult to understand algorithm. In this blog I will simplify and explain paxos in a very intuitive way.

Understanding Paxos the intuitive way | Relentless Leader

Murat and Aleksey Read Papers: "Real Life Is Uncertain. Consensus Should Be Too!" - YouTube

2025-07-31

Murat Demirbas (https://muratbuffalo.blogspot.com) and Aleksey Charapko (https://charap.co) read and discuss "Real Life Is Uncertain. Consensus Should Be Too...

Learning about distributed systems: where to start?

2025-05-30

This is definitely not a "learn distributed systems in 21 days" post. I recommend a principled, from the foundations-up, studying of distrib...

Learning about distributed systems: where to start?

FLP Result: Impossibility Of Distributed Consensus with One Faulty Process

2025-05-29

The consensus problem involves an asynchronous system of processes,some of which may be unreliable. The problem is for the reliable processesto agree on a binary value. In this paper, it is shown that every protocol for this problem has the possibility of nontermination, even with only one faulty process. By way of contrast, solutions are known for the synchronous case, the “Byzantine Generals” problem.

FLP Result: Impossibility Of Distributed Consensus with One Faulty Process

Just make it scale: An Aurora DSQL story | All Things Distributed

2025-05-28

AWS Senior Principal Engineers, Niko Matsakis and Marc Bowes, take us inside Aurora DSQL's development: scaling write operations without two-phase commit, overcoming garbage collection hurdles, and embracing Rust for both data and control planes.

Just make it scale: An Aurora DSQL story | All Things Distributed

Reasoning about Distributed Protocols with Smart Casual Verification

2025-05-27

Here at decentralized thoughts, we spend a lot of time reasoning about distributed protocols. Often, we focus on solving distributed consensus, personally it’s my favorite CS problem, but it’s also famously one of the most difficult and subtle problems in distributed computing. Reasoning about distributed algorithms is hard at the...

Reasoning about Distributed Protocols with Smart Casual Verification

Apache Iceberg Internals Dive Deep On Performance | Relentless Leader

2025-05-15

In this blog I will go over how Apache Iceberg contributes to performance of compute engine. Apache Iceberg is an ACID table format designed for large-scale analytics workloads. While its consistency and schema evolution features are covered in previous blog, its impact on query performance can be equally transformative. By the end of this document, you will have a deep understanding of how Iceberg enhances performance, the trade-offs involved, and best practices for maximizing efficiency in read-heavy workloads.

Apache Iceberg Internals Dive Deep On Performance | Relentless Leader

Welcome to Distributed Bytes!

Recent Posts