Recent works for attributed network clustering utilize graph convolution to obtain node embeddings and simultaneously perform clustering assignments on the embedding space. It is effective since graph convolution combines the structural and attributive information for node embedding learning. However, a major limitation of such works is that the graph convolution only incorporates the attribute information from the local neighborhood of nodes but fails to exploit the mutual affinities between nodes and attributes. In this regard, we propose a variational co-embedding learning model for attributed network clustering (VCLANC). VCLANC is composed of dual variational auto-encoders to simultaneously embed nodes and attributes. Relying on this, the mutual affinity information between nodes and attributes could be reconstructed from the embedding space and served as extra self-supervised knowledge for representation learning. At the same time, trainable Gaussian mixture model is used as priors to infer the node clustering assignments. To strengthen the performance of the inferred clusters, we use a mutual distance loss on the centers of the Gaussian priors and a clustering assignment hardening loss on the node embeddings. Experimental results on four real-world attributed network datasets demonstrate the effectiveness of the proposed VCLANC for attributed network clustering.
翻译:最近关于归属的网络群集的作品利用图形组合,获得嵌入空间的节点嵌入,同时执行嵌入空间的分组任务。这是有效的,因为图形组合将用于嵌入空间的结构性和属性信息结合起来,但是,这种作品的一个主要局限性是,图形组合只包含来自当地节点周围的属性信息,但未能利用节点和属性之间的相互联系。在这方面,我们提议了一个可变共装学习模式,用于定位网络群集(VLACNC)的功能。 VLACNC由双重变式自动编码器组成,可以同时嵌入节点和属性。在此基础上,节点和属性之间的相互亲近性信息可以从嵌入空间中重建,并用作代表学习的超自监督知识。与此同时,可训练高尔的混合物模型被用作推导出节点组合任务(VLACNCM)的预选功能。为了加强被推断的集群群集(VLAC)的性能,我们使用在高尔斯前列中心之间的相互距离损失来同时嵌入节点和在节点网络上的聚合损失。在新节点和属性网络上进行硬化配置,以显示被归属的虚拟网络的功能。