Current-day data centers and high-volume cloud services employ a broad set of heterogeneous servers. In such settings, client requests typically arrive at multiple entry points, and dispatching them to servers is an urgent distributed systems problem. This paper presents an efficient solution to the load balancing problem in such systems that improves on and overcomes problems of previous solutions. The load balancing problem is formulated as a stochastic optimization problem, and an efficient algorithmic solution is obtained based on a subtle mathematical analysis of the problem. Finally, extensive evaluation of the solution on simulated data shows that it outperforms previous solutions. Moreover, the resulting dispatching policy can be computed very efficiently, making the solution practically viable.
翻译:当今的数据中心和高容量云层服务使用广泛的多种服务器。 在这样的环境下,客户请求通常会到达多个切入点,将请求发送到服务器是一个紧迫的分布式系统问题。本文件为改进和克服以往解决方案问题的系统平衡负担问题提供了一个有效的解决方案。 平衡负荷问题被描述为一个随机优化问题,在对问题进行微妙的数学分析的基础上获得一种高效的算法解决方案。 最后,对模拟数据解决方案的广泛评估表明,它比以往的解决方案要好。 此外,由此产生的发送政策可以非常高效地计算,使解决方案切实可行。