Efficiently exploiting servers in data centers requires performance analysis methods that account not only for the stochastic nature of demand but also for server heterogeneity. Although several recent works proved optimality results for heterogeneity-aware variants of classical load-balancing algorithms in the many-server regime, we still lack a fundamental understanding of the impact of heterogeneity on performance in finite-size systems. In this paper, we consider a load-balancing algorithm that leads to a product-form queueing model and can therefore be analyzed exactly even when the number of servers is finite. We develop new analytical methods that exploit its product-form stationary distribution to understand the joint impact of the speeds and buffer lengths of servers on performance. These analytical results are supported and complemented by numerical evaluations that cover a large variety of scenarios.
翻译:有效利用数据中心的服务器需要绩效分析方法,这些方法不仅考虑到需求的随机性,而且考虑到服务器的异质性。虽然最近的一些工程证明在许多服务器系统中,传统负载平衡算法的异异异性具有最佳性,但我们仍对异性对有限尺寸系统性能的影响缺乏基本了解。在本文中,我们考虑的是导致产品形式排队模式的负载平衡算法,因此即使在服务器数量有限的情况下也可以精确地加以分析。我们开发了新的分析方法,利用其产品形式固定分布来理解服务器速度和缓冲长度对性能的共同影响。这些分析结果得到涵盖多种情景的数字评估的支持和补充。