Hypergraphs are generalisation of graphs in which a hyperedge can connect any number of vertices. It can describe n-ary relationships and high-order information among entities compared to conventional graphs. In this paper, we study the fundamental problem of subgraph matching on hypergraphs (i.e, subhypergraph matching). Existing methods directly extend subgraph matching algorithms to the case of hypergraphs. However, this approach delays hyperedge verification and underutilises the high-order information in hypergraphs, which leads to large search space and high enumeration cost. Furthermore, with the growing size of hypergraphs, it is becoming hard to compute subhypergraph matching sequentially. Thus, we propose an efficient and parallel subhypergraph matching system, HGMatch, to handle subhypergraph matching in massive hypergraphs. We proposes a novel match-by-hyperedge framework to utilise high-order information in hypergraphs and uses set operations for efficient candidates generation. Moreover, we develop an optimised parallel execution engine in HGMatch based on the dataflow model, which features a task-based scheduler and fine-grained dynamic work stealing to achieve bounded memory execution and better load balancing. Experimental evaluation on 10 real-world datasets shows that HGMatch outperforms the extended version of the state-of-the-art subgraph matching algorithms (CFL, DAF, CECI and RapidMatch) by orders of magnitude when using a single thread, and achieves almost linear scalability when the number of threads increases.
翻译:超格是图表的概括化, 高端可以连接任何数量的顶点。 它可以描述实体与常规图形相比的正负关系和高端信息。 在本文中, 我们研究高压比对( 亚高压比对) 的子对齐基问题。 现有方法将子对齐算算法直接扩展至高压比对法。 但是, 这种方法会延迟超格核查, 并低估高端高端信息, 从而导致大量搜索空间和高查点成本。 此外, 随着超格学规模的不断增大, 各实体之间的正变得难以进行子对齐的子对齐。 因此, 我们建议建立一个高效和平行的子对齐匹配系统( 即子对比对齐) 。 现有方法将子对齐算算算算算法的算法直接扩展到高端点信息, 并使用高端点生成高效候选人生成的操作。 此外, 我们根据数据流流模型, 在HGMatch中开发一个优化的平行执行引擎引擎引擎引擎, 将基于基于基于基于基于基于任务流流流流压的直径直径直径直径测的直线比比匹配匹配比值的直径系统, 的直径直径的直径对等的直径路路路路段路段路段路段路段,, 运行的缩缩缩缩缩缩缩缩的算, 以在运行的运行的直径路面的直径路面的直径路路段运行的运行的运行的运行的运行式运行式运行式平式平压式路路路路段路段路段路段路面的运行式路段,,, 。