学习以结构化处理速度变化的多服务员工作 (Learning to Schedule Multi-Server Jobs with Fluctuated Processing Speeds)

Multi-server jobs are imperative in modern cloud computing systems. A noteworthy feature of multi-server jobs is that, they usually request multiple computing devices simultaneously for their execution. How to schedule multi-server jobs online with a high system efficiency is a topic of great concern. Firstly, the scheduling decisions have to satisfy the service locality constraints. Secondly, the scheduling decisions needs to be made online without the knowledge of future job arrivals. Thirdly, and most importantly, the actual service rate experienced by a job is usually in fluctuation because of the dynamic voltage and frequency scaling (DVFS) and power oversubscription techniques when multiple types of jobs co-locate. A majority of online algorithms with theoretical performance guarantees are proposed. However, most of them require the processing speeds to be knowable, thereby the job completion times can be exactly calculated. To present a theoretically guaranteed online scheduling algorithm for multi-server jobs without knowing actual processing speeds apriori, in this paper, we propose ESDP (Efficient Sampling-based Dynamic Programming), which learns the distribution of the fluctuated processing speeds over time and simultaneously seeks to maximize the cumulative overall utility. The cumulative overall utility is formulated as the sum of the utilities of successfully serving each multi-server job minus the penalty on the operating, maintaining, and energy cost. ESDP is proved to have a polynomial complexity and a logarithmic regret, which is a State-of-the-Art result. We also validate it with extensive simulations and the results show that the proposed algorithm outperforms several benchmark policies with improvements by up to 73%, 36%, and 28%, respectively.

翻译：在现代云计算系统中,多服务员的工作是必需的。多服务员工作的一个值得注意的特点是,他们通常同时要求多个计算设备来执行这些任务。如何将多服务员的工作安排在网上,系统效率很高,这是一个令人十分关注的问题。首先,时间安排决定必须满足服务地点的限制。第二,在不知晓未来到达工作的情况下,需要在线做出时间安排决定。第三,而且最重要的是,一个工作的实际服务率通常会波动,因为动态电压和频率缩放(DVFS)以及当多种工作同时办公时,他们通常会同时要求使用多计算器。提出了大多数具有理论性绩效保证的在线算法。然而,大多数这些算法都需要处理速度才能被了解,从而可以准确计算完成工作的时间限制。为了在不知晓实际处理速度的情况下对多服务员工作进行理论上的在线时间安排算法,我们在此文件中建议ESDP(基于精选的缩缩缩缩缩缩缩校准基准)和超标定的超标码技术。我们通过多种时间的处理速度来了解波动的分布,同时力求最大限度地提高整个运行成本,同时使整个服务局的利用率。