资源需求在时间变化下的利用最大化 (Maximizing Utilization under Time-Varying Resource Requirements)

Low utilization has been one of the key limiting factors for the continued scaling of today's computing systems. One main reason for the low utilization is the temporal variation in the actual resource requirements of jobs, because reserving resources for jobs based on the peak requirements then results in idle resources for non-peak time. To increase utilization, in practice, resources are often overcommitted, so jobs may need to contend for resources at times and experience performance degradation, incurring a cost. To make use of the flexibility in resource allocation provided by overcommitment to the best advantage, it is critical to answer the following fundamental question that has not been well studied: given an acceptable budget for the cost associated with resource contention, how can we achieve the maximum utilization of the system? In this paper, we propose a job model that captures the time-varying resource requirement of each job by a Markov chain. We consider a stochastic formulation of the job dispatch problem in an infinite-server system. Our goal is to maximize system utilization by minimizing the number of active servers (servers running at least one job), subject to a budget on the cost of resource contention. Our main result is the design of a job dispatch policy that is asymptotically optimal as the job arrival rate increases. The novel technical framework that we develop reduces the policy design problem under study to that in a single-server system through policy conversions, which may be of independent interest. The framework allows us to design the asymptotically optimal policy by solving a linear program.

翻译：低利用率是持续扩大当今计算系统的关键限制因素之一。低利用率的一个主要原因是工作实际资源需求的时间变化,因为根据高峰需求为工作保留资源会导致闲置资源,从而导致非高峰时间的闲置资源。实际上,为了增加资源利用率,工作往往过于投入,因此,工作可能需要有时争夺资源,经历业绩退化,造成成本。为了利用过度承诺所提供的资源分配灵活性,以最佳优势最佳利用,必须回答以下尚未研究的基本问题:鉴于与资源争议相关的费用预算可以接受,我们如何能够实现系统的最大利用率?在本文件中,我们提出一个工作模式,抓住Markov系统对每项工作的时间变化资源的需求。我们考虑在一个无限服务器系统中对工作发送问题进行随机拼凑的配方。我们的目标是通过最大限度地减少运行至少一项工作的活跃服务器的数量,最大限度地利用系统,以资源争议所涉预算为准。我们的主要结果是,通过设计单一设计政策框架,通过优化政策框架来降低工作效率,从而降低工作到升级的利率。我们的主要结果是,在设计过程中,通过一个技术工具,通过设计框架来降低工作到升级的利率。