This paper considers structures of systems beyond dyadic (pairwise) interactions and investigates mathematical modeling of multi-way interactions and connections as hypergraphs, where captured relationships among system entities are set-valued. To date, in most situations, entities in a hypergraph are considered connected as long as there is at least one common "neighbor". However, minimal commonality sometimes discards the "strength" of connections and interactions among groups. To this end, considering the "width" of a connection, referred to as the $s$-overlap of neighbors, provides more meaningful insights into how closely the communities or entities interact with each other. In addition, $s$-overlap computation is the fundamental kernel to construct the line graph of a hypergraph, a low-order approximation of the hypergraph which can carry significant information about the original hypergraph. Subsequent stages of a data analytics pipeline then can apply highly-tuned graph algorithms on the line graph to reveal important features. Given a hypergraph, computing the $s$-overlaps by exhaustively considering all pairwise entities can be computationally prohibitive. To tackle this challenge, we develop efficient algorithms to compute $s$-overlaps and the corresponding line graph of a hypergraph. We propose several heuristics to avoid execution of redundant work and improve performance of the $s$-overlap computation. Our parallel algorithm, combined with these heuristics, is orders of magnitude (more than $10\times$) faster than the naive algorithm in all cases and the SpGEMM algorithm with filtration in most cases (especially with large $s$ value).
翻译:本文考虑超越dyadi( pairwise) 互动的系统结构, 并调查多路互动和连接的数学模型的数学模型, 即高光谱, 系统实体之间截取的关系被设定为定值。 到目前为止, 高光谱中的实体只要至少有一个共同的“ 邻居”, 就会被视为连接。 然而, 最小的共性有时会丢弃各组之间连接和互动的“ 强度 ” 。 为此, 考虑到连接的“ 强度 ”, 即邻居的超额, 更有意义地揭示社区或实体彼此之间互动的距离。 此外, 美元超额计算是构建高光谱线图的基本内核。 高光谱中, 数据分析管道之后的阶段可以应用高度调的图表算法来显示重要特征。 高超标值计算美元, 以所有正比值的实体之间更密切地计算。 此外, 美元计算成本计算是最高值, 与高超值的计算, 我们的算算算法 。