Traditionally, on-demand, rigid, and malleable applications have been scheduled and executed on separate systems. The ever-growing workload demands and rapidly developing HPC infrastructure trigger the interest of converging these applications on a single HPC system. Although allocating the hybrid workloads within one system could potentially improve system efficiency, it is difficult to balance the tradeoff between the responsiveness of on-demand requests, the incentive for malleable jobs, and the performance of rigid applications. In this study, we present several scheduling mechanisms to address the issues involved in co-scheduling on-demand, rigid, and malleable jobs on a single HPC system. We extensively evaluate and compare their performance under various configurations and workloads. Our experimental results show that our proposed mechanisms are capable of serving on-demand workloads with minimal delay, offering incentives for declaring malleability, and improving system performance.
翻译:不断增加的工作量要求和迅速发展的高氯联苯基础设施引发了将这些申请集中在单一的高氯联苯系统中的兴趣。虽然在一个系统中分配混合工作量有可能提高系统效率,但很难在对需求要求的反应、对可选工作的积极性以及僵化申请的绩效之间取得平衡。在本研究中,我们提出了若干时间安排机制,以解决在单一的高氯联苯系统中同时安排需求、僵硬和可选工作所涉及的问题。我们广泛评价和比较了这些申请在不同配置和工作量下的业绩。我们的实验结果显示,我们提议的机制能够以尽可能短的延迟方式按需求处理工作量,为宣布可选用性提供奖励,并改进系统绩效。