Coflow scheduling improves data-intensive application performance by improving their networking performance. State-of-the-art online coflow schedulers in essence approximate the classic Shortest-Job-First (SJF) scheduling by learning the coflow size online. In particular, they use multiple priority queues to simultaneously accomplish two goals: to sieve long coflows from short coflows, and to schedule short coflows with high priorities. Such a mechanism pays high overhead in learning the coflow size: moving a large coflow across the queues delays small and other large coflows, and moving similar-sized coflows across the queues results in inadvertent round-robin scheduling. We propose Philae, a new online coflow scheduler that exploits the spatial dimension of coflows, i.e., a coflow has many flows, to drastically reduce the overhead of coflow size learning. Philae pre-schedules sampled flows of each coflow and uses their sizes to estimate the average flow size of the coflow. It then resorts to Shortest Coflow First, where the notion of shortest is determined using the learned coflow sizes and coflow contention. We show that the sampling-based learning is robust to flow size skew and has the added benefit of much improved scalability from reduced coordinator-local agent interactions. Our evaluation using an Azure testbed, a publicly available production cluster trace from Facebook shows that compared to the prior art Aalo, Philae reduces the coflow completion time (CCT) in average (P90) cases by 1.50x (8.00x) on a 150-node testbed and 2.72x (9.78x) on a 900-node testbed. Evaluation using additional traces further demonstrates Philae's robustness to flow size skew.
翻译:科流列表通过改善网络化性能,提高了数据密集应用程序的运行表现。 最先进的在线在线联流调度员在本质上接近经典的 Shortest- Job- First (SJF) 在线联流规模。 特别是, 他们使用多个优先队列来同时实现两个目标: 筛选从短串流的长串流, 并排短串流, 并安排高优先级的短期连流。 这种机制在学习连流规模时支付高额管理费: 将大量串流移动到队列中, 延迟小和其他大型连流, 并在整个队列中移动类似规模的连流, 结果是不为可上流转。 我们提议Philae, 一个新的在线连流线调度调度员, 即: 利用短期流流流流, 大幅流, 大幅降低连流, 以大幅降低连流的连流。 Phile, 使用基于双流的直流的直径流, 降低前期的流, 使用前期的直径流, 和前期的直径流, 将显示我们现有直流的直流的直流, 快速的直流, 显示为SLelex 流, 流, 递的直路路路路路路路路路路路, 流, 演示的比的比 显示的直路路路路路流, 流, 通过S 流 流 流 流, 流 流, 向 向 显示的直路, 向 递 递 递 递 递 递 递 递 递 递 递 递 递 递 。