In recent years, column stores (or C-stores for short) have emerged as a novel approach to deal with read-mostly data warehousing applications. Experimental evidence suggests that, for certain types of queries, the new features of C-stores result in orders of magnitude improvement over traditional relational engines. At the same time, some C-store proponents argue that C-stores are fundamentally different from traditional engines, and therefore their benefits cannot be incorporated into a relational engine short of a complete rewrite. In this paper we challenge this claim and show that many of the benefits of C-stores can indeed be simulated in traditional engines with no changes whatsoever. We then identify some limitations of our ?pure-simulation? approach for the case of more complex queries. Finally, we predict that traditional relational engines will eventually leverage most of the benefits of C-stores natively, as is currently happening in other domains such as XML data.
翻译:近年来,专栏仓库(或简称C储存室)已成为处理多数数据存储应用程序的新颖方法。实验证据表明,对于某些类型的查询,C类商店的新特点导致传统关系引擎的量级改进。与此同时,一些C类商店的支持者认为,C类商店与传统引擎有根本的不同,因此,它们的好处无法在完全改写之前纳入一个关系引擎。在本文中,我们质疑这一主张,并表明C类商店的许多好处确实可以在传统引擎中模拟,没有任何变化。然后我们找出了我们更复杂查询的“纯模拟?”方法的一些局限性。最后,我们预测,传统关系引擎最终将利用C类商店本地的大部分好处,正如目前在XML数据等其他领域正在发生的那样。