We study the balanced $k$-way hypergraph partitioning problem, with a special focus on its practical applications to manycore scheduling. Given a hypergraph on $n$ nodes, our goal is to partition the node set into $k$ parts of size at most $(1+\epsilon)\cdot \frac{n}{k}$ each, while minimizing the cost of the partitioning, defined as the number of cut hyperedges, possibly also weighted by the number of partitions they intersect. We show that this problem cannot be approximated to within a $n^{1/\text{poly} \log\log n}$ factor of the optimal solution in polynomial time if the Exponential Time Hypothesis holds, even for hypergraphs of maximal degree 2. We also study the hardness of the partitioning problem from a parameterized complexity perspective, and in the more general case when we have multiple balance constraints. Furthermore, we consider two extensions of the partitioning problem that are motivated from practical considerations. Firstly, we introduce the concept of hyperDAGs to model precedence-constrained computations as hypergraphs, and we analyze the adaptation of the balanced partitioning problem to this case. Secondly, we study the hierarchical partitioning problem to model hierarchical NUMA (non-uniform memory access) effects in modern computer architectures, and we show that ignoring this hierarchical aspect of the communication cost can yield significantly weaker solutions.
翻译:我们研究了平衡的 $k$-way 超图分割问题,特别关注其在许多核心调度中的实际应用。给定一个 $n$ 节点的超图,我们的目标是将节点集合分成最多 $(1+\epsilon)\cdot \frac{n}{k}$ 个部分,并尽可能地减少分割的成本,成本由切割的超边数定义,可能也由其相交的分区数加权得出。我们证明,如果指数时间假设成立,则即使对于最大度为 2 的超图,该问题也无法在多项式时间内近似到最优解的 $n^{1 / \text {poly}\log\log n}$ 倍。从参数化复杂度的角度以及在有多个平衡约束的更一般情况下,我们还研究了分割问题的难度。此外,我们考虑了两个从实际考虑出发的分割问题扩展。首先,我们引入了超 DAGs 的概念,将有先决约束的计算建模为超图,并分析了平衡分割问题在此情况下的适应性。其次,我们研究了层次分割问题,以模拟现代计算机体系结构中的分层 NUMA(非统一存储访问)效应,并证明忽略通信成本的分层特征可能会导致较弱的解。