In large-scale distributed systems, balancing the load in an efficient way is crucial in order to achieve low latency. Recently, some load balancing policies have been suggested which are able to achieve a bounded maximum queue length in the large-scale limit. However, these policies have thus far only been studied in case of exponential job sizes. As job sizes are more variable in real systems, we investigate how the performance of these policies (and in particular the value of these bounds) is impacted by the job size distribution. We present a unified analysis which can be used to compute the bound on the queue length in case of phase-type distributed job sizes for four load balancing policies. We find that in most cases, the bound on the maximum queue length can be expressed in closed form. In addition, we obtain job size (in)dependent bounds on the expected response time. Our methodology relies on the use of the cavity process. That is, we conjecture that the cavity process captures the behaviour of the real system as the system size grows large. For each policy, we illustrate the accuracy of the cavity process by means of simulation.
翻译:在大型分布式系统中,以有效的方式平衡负荷对于实现低延迟至关重要。最近,提出了一些负负平衡政策建议,这些政策能够达到大尺度限制的最大排队长度。然而,迄今为止,这些政策仅是在指数性工作规模的情况下研究的。由于工作规模在实际系统中比较多变,我们调查这些政策的绩效(特别是这些界限的价值)如何受到工作规模分布的影响。我们提出了一个统一的分析,可以用来计算四级级类型分布式工作规模对排队长度的界限。我们发现,在大多数情况下,对最大排队长度的束缚可以用封闭的形式表示。此外,我们获得的工作规模(取决于预期的反应时间)取决于预期的反应时间。我们的方法依赖于使用洞穴过程。也就是说,我们推测,随着系统规模的扩大,洞穴过程能够捕捉到实际系统的行为。我们用模拟手段来说明洞穴过程的准确性。