Attributed graph clustering is one of the most important tasks in graph analysis field, the goal of which is to group nodes with similar representations into the same cluster without manual guidance. Recent studies based on graph contrastive learning have achieved impressive results in processing graph-structured data. However, existing graph contrastive learning based methods 1) do not directly address the clustering task, since the representation learning and clustering process are separated; 2) depend too much on graph data augmentation, which greatly limits the capability of contrastive learning; 3) ignore the contrastive message for subspace clustering. To accommodate the aforementioned issues, we propose a generic framework called Dual Contrastive Attributed Graph Clustering Network (DCAGC). In DCAGC, by leveraging Neighborhood Contrast Module, the similarity of the neighbor nodes will be maximized and the quality of the node representation will be improved. Meanwhile, the Contrastive Self-Expression Module is built by minimizing the node representation before and after the reconstruction of the self-expression layer to obtain a discriminative self-expression matrix for spectral clustering. All the modules of DCAGC are trained and optimized in a unified framework, so the learned node representation contains clustering-oriented messages. Extensive experimental results on four attributed graph datasets show the superiority of DCAGC compared with 16 state-of-the-art clustering methods. The code of this paper is available at https://github.com/wangtong627/Dual-Contrastive-Attributed-Graph-Clustering-Network.
翻译:图形化的图形群集是图形分析字段中最重要的任务之一,其目标之一是将具有类似表达面的节点分组成同一组群,而无需人工指导。最近基于图形对比学习的研究在处理图形结构数据方面取得了令人印象深刻的成果。然而,现有的图形对比学习方法1 并不直接处理组合任务,因为演示学习和分组进程是分开的;2 过于依赖图表数据增强,这大大限制了对比学习的能力;3 忽视子空间群集的对比性信息。为了容纳上述问题,我们提议了一个通用框架,称为“双对立属性图形群集网(DCAGC)。在DCAGC中,通过利用邻接点对比模块,相近节点的相似性将得到最大化,节点代表的质量将得到提高。同时,对比自解调模块的构建方式是尽量减少节点代表性,在自我表达层层层群集之前和重建后,获得一个有区别性的自我表达式的自我表达式矩阵。DCAGC的所有模块都经过培训,并且优化了DA-D级群集在统一的图像结构框架中,这种面向的模型展示了16个图像的模型,展示了可理解的模型,展示了数据库的模型的模型显示了16级压式的模型的模型。