With the explosive growth of multi-source data, multi-view clustering has attracted great attention in recent years. Most existing multi-view methods operate in raw feature space and heavily depend on the quality of original feature representation. Moreover, they are often designed for feature data and ignore the rich topology structure information. Accordingly, in this paper, we propose a generic framework to cluster both attribute and graph data with heterogeneous features. It is capable of exploring the interplay between feature and structure. Specifically, we first adopt graph filtering technique to eliminate high-frequency noise to achieve a clustering-friendly smooth representation. To handle the scalability challenge, we develop a novel sampling strategy to improve the quality of anchors. Extensive experiments on attribute and graph benchmarks demonstrate the superiority of our approach with respect to state-of-the-art approaches.
翻译:随着多来源数据的爆炸性增长,多视角集群近年来引起了极大的注意,大多数现有多视角方法在原始地物空间运作,严重依赖原始地物表示质量;此外,这些多视角方法往往设计为地物数据,忽视丰富的地貌结构信息;因此,在本文件中,我们提出一个通用框架,将属性数据和图表数据组合在一起,并具有多种特征;能够探索特征和结构之间的相互作用;具体地说,我们首先采用图表过滤技术,消除高频噪音,以便实现有利于集群的平稳代表。为了应对可缩放性挑战,我们制定了新的取样战略,以提高锚点的质量。关于属性和图表基准的广泛实验显示了我们在最新方法方面的优势。