As the amount of data explodes rapidly, more and more corporations are using data centers to make effective decisions and gain a competitive edge. Data analysis applications play a significant role in data centers, and hence it has became increasingly important to understand their behaviors in order to further improve the performance of data center computer systems. In this paper, after investigating three most important application domains in terms of page views and daily visitors, we choose eleven representative data analysis workloads and characterize their micro-architectural characteristics by using hardware performance counters, in order to understand the impacts and implications of data analysis workloads on the systems equipped with modern superscalar out-of-order processors. Our study on the workloads reveals that data analysis applications share many inherent characteristics, which place them in a different class from desktop (SPEC CPU2006), HPC (HPCC), and service workloads, including traditional server workloads (SPECweb2005) and scale-out service workloads (four among six benchmarks in CloudSuite), and accordingly we give several recommendations for architecture and system optimizations. On the basis of our workload characterization work, we released a benchmark suite named DCBench for typical datacenter workloads, including data analysis and service workloads, with an open-source license on our project home page on http://prof.ict.ac.cn/DCBench. We hope that DCBench is helpful for performing architecture and small-to-medium scale system researches for datacenter computing.
翻译:由于数据量迅速爆炸,越来越多的公司正在使用数据中心来作出有效决定并获得竞争优势。数据分析应用程序在数据中心中发挥重要作用,因此,理解其行为已变得越来越重要,以便进一步改善数据中心计算机系统的业绩。 在本文中,在对页面浏览量和每日访客这三个最重要的应用领域进行调查之后,我们选择了11个有代表性的数据分析工作量,并通过使用硬件性能计数器来描述其微观结构特征,以便了解数据分析工作量对配备现代超天平超级外级处理器的系统的影响和影响。我们对工作量的研究表明,数据分析应用程序具有许多内在特点,使数据应用具有与桌面(SPEC CPU2006)、HPC(HPCCC)和服务业工作量不同的类别,包括传统服务器工作量(SECECweb2005)和升级服务工作量(云西特6个基准中的4个基准),因此我们对架构和系统优化提出若干公开建议。关于我们工作量定性工作的基础,我们发布了一个名为DCBeBenchch公司的基准套,用于典型的数据中心工作量(Cechlouds-rental liger lichemal lical lichal) 工作,包括用于进行典型数据分析。