Learning community structures in graphs has broad applications across scientific domains. While graph neural networks (GNNs) have been successful in encoding graph structures, existing GNN-based methods for community detection are limited by requiring knowledge of the number of communities in advance, in addition to lacking a proper probabilistic formulation to handle uncertainty. We propose a simple framework for amortized community detection, which addresses both of these issues by combining the expressive power of GNNs with recent methods for amortized clustering. Our models consist of a graph representation backbone that extracts structural information and an amortized clustering network that naturally handles variable numbers of clusters. Both components combine into well-defined models of the posterior distribution of graph communities and are jointly optimized given labeled graphs. At inference time, the models yield parallel samples from the posterior of community labels, quantifying uncertainty in a principled way. We evaluate several models from our framework on synthetic and real datasets and demonstrate superior performance to previous methods. As a separate contribution, we extend recent amortized probabilistic clustering architectures by adding attention modules, which yield further improvements on community detection tasks.
翻译:图表中的学习社区结构在科学领域具有广泛的应用性。虽然图形神经网络(GNNs)在编码图形结构中取得了成功,但现有的基于GNN的共同体探测方法因事先要求了解社区数量而受到限制,因为除了缺乏处理不确定性的适当概率配方外,还需要事先了解社区数量;我们建议了一个用于摊销社区检测的简单框架,通过将GNNs的表达力与最新摊销组合方法相结合,解决上述两个问题。我们的模型包括一个图形代表主干柱,可以提取结构信息,一个自动处理不同组群数的摊合组合网络。两个组成部分都结合到图表社区后方分布的明确界定模型中,并且共同优化了给定标签的图表。在推断时,模型从社区标签的外表生成平行样本,以有原则的方式量化不确定性。我们从我们的合成和真实数据集框架中评估了几个模型,并展示了优异的性。作为一个单独的贡献,我们通过增加关注模块,推广了近期的和解稳定组合组合组合组合组合结构,从而进一步改进了社区探测任务。