Community detection, aiming to group the graph nodes into clusters with dense inner-connection, is a fundamental graph mining task. Recently, it has been studied on the heterogeneous graph, which contains multiple types of nodes and edges, posing great challenges for modeling the high-order relationship between nodes. With the surge of graph embedding mechanism, it has also been adopted to community detection. A remarkable group of works use the meta-path to capture the high-order relationship between nodes and embed them into nodes' embedding to facilitate community detection. However, defining meaningful meta-paths requires much domain knowledge, which largely limits their applications, especially on schema-rich heterogeneous graphs like knowledge graphs. To alleviate this issue, in this paper, we propose to exploit the context path to capture the high-order relationship between nodes, and build a Context Path-based Graph Neural Network (CP-GNN) model. It recursively embeds the high-order relationship between nodes into the node embedding with attention mechanisms to discriminate the importance of different relationships. By maximizing the expectation of the co-occurrence of nodes connected by context paths, the model can learn the nodes' embeddings that both well preserve the high-order relationship between nodes and are helpful for community detection. Extensive experimental results on four real-world datasets show that CP-GNN outperforms the state-of-the-art community detection methods.
翻译:社区探测旨在将图形节点分组成密集内部连接的群集,是一项基本的图形采矿任务。最近,在包含多种类型节点和边缘的混杂图上研究了它,这对制作节点之间的高度秩序关系模型提出了巨大的挑战。随着图形嵌入机制的激增,它也被采纳到社区探测中。一个了不起的工程群利用元路径捕捉节点之间的高秩序关系,将其嵌入节点嵌入结点的嵌入网点,以便利社区探测。然而,确定有意义的元路径需要许多域知识,这在很大程度上限制了它们的应用,特别是在精密的混合图中。为了缓解这一问题,我们在本文件中提议利用上下文路径捕捉节点之间的高秩序关系,并建立一个基于背景路径的图表神经网络模型(CP-GNNN)模型。它将节点之间的高秩序关系嵌入节点嵌入网络,以便区分不同关系的重要性。通过尽可能扩大节点的预想,特别是精密的混合的混合图像图图图图,我们建议利用上的环境路径来捕捉取节点点的同步连接的预想点点点点点点点点,通过高轨道来保存四个实验模型,从而保持真实的状态的路径。