Datacenter networks routinely support the data transfers of distributed computing frameworks in the form of coflows, i.e., sets of concurrent flows related to a common task. The vast majority of the literature has focused on the problem of scheduling coflows for completion time minimization, i.e., to maximize the average rate at which coflows are dispatched in the network fabric. However, many modern applications generate coflows dedicated to online services and mission-critical computing tasks which have to comply with specific completion deadlines. In this paper, we introduce $\mathtt{WDCoflow}$, a new algorithm to maximize the weighted number of coflows that complete before their deadline. By combining a dynamic programming algorithm along with parallel inequalities, our heuristic solution performs at once coflow admission control and coflow prioritization, imposing a $\sigma$-order on the set of coflows. With extensive simulation, we demonstrate the effectiveness of our algorithm in improving up to $3\times$ more coflows that meet their deadline in comparison the best SotA solution, namely $\mathtt{CS\text{-}MHA}$. Furthermore, when weights are used to differentiate coflow classes, $\mathtt{WDCoflow}$ is able to improve the admission per class up to $4\times$, while increasing the average weighted coflow admission rate.
翻译:数据中心网络通常支持分布式计算框架的数据传输,其中Coflows是指与共同任务相关的并发流集。绝大部分文献都集中在按完成时间最小化的Coflows调度问题上,即最大化网络内部调度平均速率。然而,许多现代应用程序生成的Coflows都是专门用于在线服务和关键任务计算,这些Coflows必须遵守特定的完成时限。本文介绍了一种新算法 $\mathtt{WDCoflow}$,以最大化在截止日期前完成的加权Coflows个数。通过结合动态规划算法和并行不等式,我们的启发式方法一次性执行Coflow入站控制和Coflow优先排序,对Coflows集合施加 $\sigma-$顺序。通过广泛的仿真,我们证明了我们的算法有效性,在与最佳SotA解决方案 $\mathtt{CS\text{-}MHA}$ 相比,我们的算法在满足其截止日期的Coflows方面实现了高达 $3\times$ 的改进。此外,当使用权重区分Coflow类别时, $\mathtt{WDCoflow}$ 能够将每个类的入站控制提高高达 $4\times$ ,同时增加平均加权Coflow入站速率。