Distributed Bytes

Formal Methods Beyond Correctness: Isolation & Permissiveness of Distributed Transactions in MongoDB | MongoDB

2026-01-31

Learn how we used modular protocol specification for verifying both correctness and performance of MongoDB’s distributed transactions protocol.

Formal Methods Beyond Correctness: Isolation & Permissiveness of Distributed Transactions in MongoDB | MongoDB

One-off Verified Transpilation with Claude

2026-01-27

We can automatically check correctness properties of a TLA+ specification using TLC, a model checker that will exhaustively explore a spec’s reachable states...

One-off Verified Transpilation with Claude

Scaling PostgreSQL

2026-01-24

How OpenAI scales PostgreSQL

Scaling PostgreSQL to power 800 million ChatGPT users

2026-01-24

For years, PostgreSQL has been one of the most critical, under-the-hood data systems powering core products like ChatGPT and OpenAI’s API. As our user base grows rapidly, the demands on our databases have increased exponentially, too. Over the past year, our PostgreSQL load has grown by more than 10x, and it continues to rise quickly.

Scaling PostgreSQL to power 800 million ChatGPT users

SpiceDB Documentation - Authzed Docs

2026-01-22

Welcome to the SpiceDB and AuthZed docs site.

On Idempotency Keys - Gunnar Morling

2026-01-18

In distributed systems, there’s a common understanding that it is not possible to guarantee exactly-once delivery of messages. What is possible though is exactly-once processing. By adding a unique …

What Does Write Skew Look Like?

2025-11-25

This post is about gaining intuition for Write Skew, and, by extension, Snapshot Isolation. Snapshot Isolation is billed as a transaction isolation level that offers a good mix between performance and correctness.

How to do distributed locking — Martin Kleppmann’s blog

2025-11-24

Redis has been gradually making inroads into areas of data management where there are stronger consistency and durability expectations – which worries me, because this is not what Redis is designed for. Arguably, distributed locking is one of those areas. Let’s examine it in some more detail.

How to do distributed locking — Martin Kleppmann’s blog

Reproducing the AWS Outage Race Condition with a Model Checker | Waqas Younas' blog

2025-11-10

As a small experiment, we’ll use a model checker to see how such a race could happen. Formal verification can’t prevent every failure, but it helps us think more clearly about correctness and reason about subtle concurrency bugs.

Reproducing the AWS Outage Race Condition with a Model Checker | Waqas Younas' blog

TLA+ Modeling of AWS outage DNS race condition

2025-11-06

On Oct 19–20, 2025, AWS’s N. Virginia region suffered a major DynamoDB outage triggered by a DNS automation defect that broke endpoint resol...

TLA+ Modeling of AWS outage DNS race condition

Welcome to Distributed Bytes!

Recent Posts