
Book Overflow Replication, Partitioning, & Transactions - Designing Data-Intensive Applications by Martin Kleppman
Jan 26, 2026
They dig into replication: leader elections, Raft/Paxos, replication lag and strategies like CRDTs. Partitioning and sharding get practical treatment, including hot keys, suffix sharding, and rebalancing. Transactions and isolation are explored, covering ACID components, isolation anomalies, and locking trade-offs. They also debate when eventual consistency is acceptable and modern tooling like managed Postgres.
AI Snips
Chapters
Books
Transcript
Episode notes
Alert On Replication Lag
- Monitor replication lag and alert when it exceeds expected thresholds to detect degraded consistency.
- Treat sudden lag increases as incidents that may affect user-visible behavior.
CRDTs Power Real-Time Collaboration
- Nathan points to CRDTs (conflict-free replicated data types) as practical tools used by Google Docs and Figma for collaborative editing.
- He mentions Martin Kleppman's involvement in AutoMerge, a CRDT project.
Partitioning Is A Balancing Act
- Partitioning (sharding) strategies must balance data distribution, query patterns, and hotspots.
- Poor partition keys create uneven load, complex joins, and costly repartitioning work.



