We propose GraphMineSuite (GMS): the first benchmarking suite for graph mining that facilitates evaluating and constructing high-performance graph mining algorithms. First, GMS comes with a benchmark specification based on extensive literature review, prescribing representative problems, algorithms, and datasets. Second, GMS offers a carefully designed software platform for seamless testing of different fine-grained elements of graph mining algorithms, such as graph representations or algorithm subroutines. The platform includes parallel implementations of more than 40 considered baselines, and it facilitates developing complex and fast mining algorithms. High modularity is possible by harnessing set algebra operations such as set intersection and difference, which enables breaking complex graph mining algorithms into simple building blocks that can be separately experimented with. GMS is supported with a broad concurrency analysis for portability in performance insights, and a novel performance metric to assess the throughput of graph mining algorithms, enabling more insightful evaluation. As use cases, we harness GMS to rapidly redesign and accelerate state-of-the-art baselines of core graph mining problems: degeneracy reordering (by up to >2x), maximal clique listing (by up to >9x), k-clique listing (by 1.1x), and subgraph isomorphism (by up to 2.5x), also obtaining better theoretical performance bounds.
翻译:我们提议GreaphMineSuite(GMS):为图形采矿设计的第一个基准套件,该套套套件有助于评估和构建高性能图形采矿算法。首先,GMS采用基于广泛文献审查的基准规格,规定了代表性问题、算法和数据集。第二,GMS提供了一个精心设计的软件平台,用于无缝测试图形采矿算法的不同细微细元素,如图示或亚程算法。该平台包括平行实施40多个考虑的基线,它有利于开发复杂和快速的采矿算法。通过利用固定交叉和差异等固定的代数操作,可以实现高模块化,从而能够将复杂的图形采矿算法破碎成可以单独试验的简单建筑块。GMS得到支持,通过对业绩洞察的可移植性进行广泛的同值分析,以及采用新的性能衡量标准来评估图形采矿算法的乘数,从而能够进行更深刻的评价。作为案例,我们利用GMS系统快速重新设计和加速州立的图形采矿问题基线。通过利用定序(通过 >2x的交叉和差异)等固定的代数操作,将复杂的图表采矿算法升级(也通过25x获得更好的分级)和排序。