Deep graph clustering, which aims to reveal the underlying graph structure and divide the nodes into different groups, has attracted intensive attention in recent years. However, we observe that, in the process of node encoding, existing methods suffer from representation collapse which tends to map all data into the same representation. Consequently, the discriminative capability of the node representation is limited, leading to unsatisfied clustering performance. To address this issue, we propose a novel self-supervised deep graph clustering method termed Dual Correlation Reduction Network (DCRN) by reducing information correlation in a dual manner. Specifically, in our method, we first design a siamese network to encode samples. Then by forcing the cross-view sample correlation matrix and cross-view feature correlation matrix to approximate two identity matrices, respectively, we reduce the information correlation in the dual-level, thus improving the discriminative capability of the resulting features. Moreover, in order to alleviate representation collapse caused by over-smoothing in GCN, we introduce a propagation regularization term to enable the network to gain long-distance information with the shallow network structure. Extensive experimental results on six benchmark datasets demonstrate the effectiveness of the proposed DCRN against the existing state-of-the-art methods.
翻译:深图群集旨在揭示基本图形结构并将节点分为不同的组别,近年来引起了人们的高度关注。然而,我们注意到,在节点编码过程中,现有方法存在代表比例崩溃,往往将所有数据映射成相同的代表比例。因此,节点代表的差别性能力有限,导致不满意的组合性表现。为解决这一问题,我们建议采用一种新的自我监督的深层图形群集方法,称为“双重关联减少网络”(DCRN),通过双重方式减少信息相关性。具体地说,我们用方法先设计一个Siamese网络对样本进行编码。然后,通过将交叉视图样本关联矩阵和交叉视图特征关联矩阵分别逼近两个身份矩阵,我们减少了双层信息的相关性,从而提高了由此形成的特征的差别性能力。此外,为了减轻因GCN的过度移动而导致的代表比例崩溃,我们引入了传播正规化术语,使网络能够从浅色网络结构获得长距离的信息。然后,通过将六种基准数据集的实验结果对现有的DCR方法进行广泛的实验性展示。