Welcome to Distributed Bytes!

I share interesting articles, videos, papers and more about distributed systems, formal methods and computer science.

Made with by Federico Ponzi

Recent Posts

TernFS: an exabyte scale, multi-region distributed filesystem | XTX Tech Blog

This post motivates TernFS, explains its high-level architecture, and then explores some key implementation details.

TernFS: an exabyte scale, multi-region distributed filesystem | XTX Tech Blog

Just make it scale: An Aurora DSQL story | All Things Distributed

The goal with Aurora DSQL’s design is to break up the database into bite-sized chunks with clear interfaces and explicit contracts. Each component follows the Unix mantra—do one thing, and do it well—but working together they are able to offer all the features users expect from a database (transactions, durability, queries, isolation, consistency, recovery, concurrency, performance, logging, and so on).

Just make it scale: An Aurora DSQL story | All Things Distributed

Aurora DSQL: How authentication and authorization works

In this article, I’m going to explain how connections to Aurora DSQL are authenticated and authorized. This information is meant to be supplemental to what is found in the official Amazon Aurora DSQL documentation.

Aurora DSQL: How authentication and authorization works

Dynamo, DynamoDB, and Aurora DSQL - Marc's Blog

People often ask me about the architectural relationship between Amazon Dynamo (as described in the classic 2007 SOSP paper), Amazon DynamoDB (the serverless distributed NoSQL database from AWS), and Aurora DSQL (the serverless distributed SQL database from AWS). There’s a ton to say on the topic, but I’ll start off on comparing how the systems achieve a few key properties.

Dynamo, DynamoDB, and Aurora DSQL - Marc's Blog

Dynamo, DynamoDB, and Aurora DSQL - Marc's Blog

People often ask me about the architectural relationship between Amazon Dynamo (as described in the classic 2007 SOSP paper), Amazon DynamoDB (the serverless distributed NoSQL database from AWS), and Aurora DSQL (the serverless distributed SQL database from AWS). There’s a ton to say on the topic, but I’ll start off on comparing how the systems achieve a few key properties.

Dynamo, DynamoDB, and Aurora DSQL - Marc's Blog

Linearizability testing S2 with deterministic simulation

With S2, it is a hard requirement that our Stream API operations exhibit linearizability. Linearizable systems are far simpler to reason about, and many applications are only possible to build on top of data platforms that offer strong consistency guarantees like this. Because it's important, we also need to test it! We can gain confidence that S2 is linearizable by taking an empirical validation approach, using a model checker like Knossos, or Porcupine.

Linearizability testing S2 with deterministic simulation

How I solved a distributed queue problem after 15 years | DBOS

Learn how queues make horizontal scaling, scheduling, and flow control easier in cloud systems, and how to make them durable and observable.

How I solved a distributed queue problem after 15 years | DBOS

Understanding Paxos the intuitive way | Relentless Leader

We are on a path to build a strong foundation in distributed systems. We have already gone over distributed time; the next topic we will cover is Distributed Consensus. To build the foundation on distributed consensus, we will go over Paxos. Paxos revolutionized distributed computing by providing the first provably correct solution for achieving consensus among unreliable processors, forming the theoretical foundation for modern distributed systems and databases. Paxos is one of the most important and most difficult to understand algorithm. In this blog I will simplify and explain paxos in a very intuitive way.

Understanding Paxos the intuitive way | Relentless Leader

Murat and Aleksey Read Papers: "Real Life Is Uncertain. Consensus Should Be Too!" - YouTube

Murat Demirbas (https://muratbuffalo.blogspot.com) and Aleksey Charapko (https://charap.co) read and discuss "Real Life Is Uncertain. Consensus Should Be Too...

Murat and Aleksey Read Papers:

Learning about distributed systems: where to start?

This is definitely not a "learn distributed systems in 21 days" post. I recommend a principled, from the foundations-up, studying of distrib...

Learning about distributed systems: where to start?