TY - GEN
T1 - Mako
T2 - 19th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2025
AU - Shen, Weihai
AU - Cui, Yang
AU - Sen, Siddhartha
AU - Angel, Sebastian
AU - Mu, Shuai
N1 - Publisher Copyright:
© 2025 by The USENIX Association. All rights reserved.
PY - 2025
Y1 - 2025
N2 - This paper introduces Mako, a highly available, high-throughput, and horizontally scalable transactional key-value store. Mako performs strongly consistent geo-replication to maintain availability despite entire datacenter failures, uses multi-core machines for fast serializable transaction processing, and shards data to scale out. To achieve these properties, especially to overcome the overheads of distributed transactions in geo-replicated settings, Mako decouples transaction execution and replication. This enables Mako to run transactions speculatively and very fast, and replicate transactions in the background to make them fault-tolerant. The key innovation in Mako is the use of two-phase commit (2PC) speculatively to allow distributed transactions to proceed without having to wait for their decisions to be replicated, while also preventing unbounded cascading aborts if shards fail prior to the end of replication. Our experimental evaluation on Azure shows that Mako processes 3.66M TPC-C transactions per second when data is split across 10 shards, each of which runs with 24 threads. This is an 8.6× higher throughput than state-of-the-art systems optimized for geo-replication.
AB - This paper introduces Mako, a highly available, high-throughput, and horizontally scalable transactional key-value store. Mako performs strongly consistent geo-replication to maintain availability despite entire datacenter failures, uses multi-core machines for fast serializable transaction processing, and shards data to scale out. To achieve these properties, especially to overcome the overheads of distributed transactions in geo-replicated settings, Mako decouples transaction execution and replication. This enables Mako to run transactions speculatively and very fast, and replicate transactions in the background to make them fault-tolerant. The key innovation in Mako is the use of two-phase commit (2PC) speculatively to allow distributed transactions to proceed without having to wait for their decisions to be replicated, while also preventing unbounded cascading aborts if shards fail prior to the end of replication. Our experimental evaluation on Azure shows that Mako processes 3.66M TPC-C transactions per second when data is split across 10 shards, each of which runs with 24 threads. This is an 8.6× higher throughput than state-of-the-art systems optimized for geo-replication.
UR - https://www.scopus.com/pages/publications/105011599776
M3 - Conference contribution
AN - SCOPUS:105011599776
T3 - Proceedings of the 19th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2025
SP - 129
EP - 152
BT - Proceedings of the 19th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2025
PB - USENIX Association
Y2 - 7 July 2025 through 9 July 2025
ER -