Despite significant progress, there remain three limitations to the previous multi-view clustering algorithms. First, they often suffer from high computational complexity, restricting their feasibility for large-scale datasets. Second, they typically fuse multi-view information via one-stage fusion, neglecting the possibilities in multi-stage fusions. Third, dataset-specific hyperparameter-tuning is frequently required, further undermining their practicability. In light of this, we propose a fast multi-view clustering via ensembles (FastMICE) approach. Particularly, the concept of random view groups is presented to capture the versatile view-wise relationships, through which the hybrid early-late fusion strategy is designed to enable efficient multi-stage fusions. With multiple views extended to many view groups, three levels of diversity (w.r.t. features, anchors, and neighbors, respectively) are jointly leveraged for constructing the view-sharing bipartite graphs in the early-stage fusion. Then, a set of diversified base clusterings for different view groups are obtained via fast graph partitioning, which are further formulated into a unified bipartite graph for final clustering in the late-stage fusion. Notably, FastMICE has almost linear time and space complexity, and is free of dataset-specific tuning. Experiments on 22 multi-view datasets demonstrate its advantages in scalability (for extremely large datasets), superiority (in clustering performance), and simplicity (to be applied) over the state-of-the-art. Code available: https://github.com/huangdonghere/FastMICE.
翻译:尽管取得了显著进展,但先前的多视图组合算法仍有三个局限性。 首先,随机组合法往往具有高度的计算复杂性,限制了大规模数据集的可行性。 其次,它们通常通过一个阶段的合并将多视图信息结合在一起,忽视了多阶段合并的可能性。 第三,经常需要特定数据集的超光度调整,从而进一步损害其实用性。 有鉴于此,我们提议通过组合(FastMICE)方法快速进行多视图组合。 特别是,随机组合的概念是用来捕捉多功能的视图-智慧关系,通过这种关系,设计混合的早期组合战略是为了促成有效的多阶段合并。随着多种观点的扩大,多阶段合并可能忽略多阶段合并的可能性。第三,经常需要特定数据集的超光谱超光谱超光谱调,从而进一步破坏其早期混集图的实用性。 然后,通过快速图形分割获得一套不同视图组的多样化基础组合,这些组合将进一步应用到一个统一的双部分的视图-视觉组合战略,从而使得有效的多阶段混合组合战略能够实现有效的多阶段融合。 将多种观点组合的组合的组合法-最高级的组合数据在最后的组合中,在最后的组合中展示中,最精确的组合数据中,最精确的组合中,最精确的组合中,最精确的轨道-最精确的精确的轨道-最精确的轨道-最精确的轨道-最精确性-最精确性数据。