Blockchain has come a long way: a system that was initially proposed specifically for cryptocurrencies is now being adapted and adopted as a general-purpose transactional system. As blockchain evolves into another data management system, the natural question is how it compares against distributed database systems. Existing works on this comparison focus on high-level properties, such as security and throughput. They stop short of showing how the underlying design choices contribute to the overall differences. Our work fills this important gap and provides a principled framework for analyzing the emerging trend of blockchain-database fusion. We perform a twin study of blockchains and distributed database systems as two types of transactional systems. We propose a taxonomy that illustrates the dichotomy across four dimensions, namely replication, concurrency, storage, and sharding. Within each dimension, we discuss how the design choices are driven by two goals: security for blockchains, and performance for distributed databases. To expose the impact of different design choices on the overall performance, we conduct an in-depth performance analysis of two blockchains, namely Quorum and Hyperledger Fabric, and two distributed databases, namely TiDB, and etcd. Lastly, we propose a framework for back-of-the-envelope performance forecast of blockchain-database hybrids.
翻译:长途跋涉:一个最初专门提议用于加密的系统正在作为通用交易系统加以改造和采用。当块链进入另一个数据管理系统时,自然的问题是它如何与分布式数据库系统进行比较。关于这种比较的现有工作侧重于高层次的属性,例如安全和吞吐量。它们没有显示基本设计选择是如何造成总体差异的。我们的工作填补了这一重要的空白,并为分析块链数据库融合的新趋势提供了一个原则性框架。我们对作为两类交易系统的块链和分布式数据库系统进行了双轨研究。我们建议了一种分类学,以说明在四个方面,即复制、计算、储存和裁断。在每一个方面,我们讨论设计选择是如何由两个目标驱动的:障碍链的安全以及分布式数据库的性能。为了暴露不同设计选择对总体业绩的影响,我们对两个块链进行深入的绩效分析,即数字链和超移动式数据库,以及两个分布式数据库,即混合数据库的运行状态框架。