Processing massive application graphs on distributed memory systems requires to map the graphs onto the system's processing elements (PEs). This task becomes all the more important when PEs have non-uniform communication costs or the input is highly irregular. Typically, mapping is addressed using partitioning, in a two-step approach or an integrated one. Parallel partitioning tools do exist; yet, corresponding mapping algorithms or their public implementations all have major sequential parts or other severe scaling limitations. In this paper, we propose a parallel algorithm that maps graphs onto the PEs of a hierarchical system. Our solution integrates partitioning and mapping; it models the system hierarchy in a concise way as an implicit labeled tree. The vertices of the application graph are labeled as well, and these vertex labels induce the mapping. The mapping optimization follows the basic idea of parallel label propagation, but we tailor the gain computations of label changes to quickly account for the induced communication costs. Our MPI-based code is the first public implementation of a parallel graph mapping algorithm; to this end, we extend the partitioning library ParHIP. To evaluate our algorithm's implementation, we perform comparative experiments with complex networks in the million- and billion-scale range. In general our mapping tool shows good scalability on up to a few thousand PEs. Compared to other MPI-based competitors, our algorithm achieves the best speed to quality trade-off and our quality results are even better than non-parallel mapping tools.
翻译:分布式内存系统中的大规模应用程序图处理要求将图表映射到系统中的处理元素( PE) 。 当 PE 具有非统一通信成本或输入高度不规则时,这一任务就变得更加重要。 通常地, 映射会用分割法、 两步方法或集成方法来处理。 平行分割工具确实存在; 相应的映射算法或其公共实施都具有重要的相继部分或其他严重缩放限制。 在本文中, 我们提议了一个平行的算法, 将图形映射到系统处理元素的 PE 上。 我们的解算法将分区和映射整合; 它以简洁的方式将系统等级结构建成隐含标签的树。 应用程序图的顶端也贴上标签, 而这些顶端标签会引导绘图。 映射最优化会遵循平行标签传播的基本理念, 但是我们将标签变更的收益计算方法调整为快速计算引致通信成本。 我们基于 MPI 的代码是首次公开实施平行的图形映射算法; 到此端, 我们将共享图书馆 剖析系统以简洁的方式建成为隐含图树树树树树树树。 为了评估我们的系统的质量图, 我们的图质量图, 我们的顶部质量图的顶部的顶部的顶部的图, 将我们进行着比测算的网络的比测算取到更精确的网络, 我们的比的模型的轨道, 上, 我们在千年的比的比的轨道上, 我们的轨道上, 我们的算法将进行到更精确度为10亿次的算取的算取的轨道,, 我们的算取的算取的实验室的算取的实验室的轨道的比的比的比的比的比的比的轨道, 。