The field of deep clustering combines deep learning and clustering to learn representations that improve both the learned representation and the performance of the considered clustering method. Most existing deep clustering methods are designed for a single clustering method, e.g., k-means, spectral clustering, or Gaussian mixture models, but it is well known that no clustering algorithm works best in all circumstances. Consensus clustering tries to alleviate the individual weaknesses of clustering algorithms by building a consensus between members of a clustering ensemble. Currently, there is no deep clustering method that can include multiple heterogeneous clustering algorithms in an ensemble to update representations and clusterings together. To close this gap, we introduce the idea of a consensus representation that maximizes the agreement between ensemble members. Further, we propose DECCS (Deep Embedded Clustering with Consensus representationS), a deep consensus clustering method that learns a consensus representation by enhancing the embedded space to such a degree that all ensemble members agree on a common clustering result. Our contributions are the following: (1) We introduce the idea of learning consensus representations for heterogeneous clusterings, a novel notion to approach consensus clustering. (2) We propose DECCS, the first deep clustering method that jointly improves the representation and clustering results of multiple heterogeneous clustering algorithms. (3) We show in experiments that learning a consensus representation with DECCS is outperforming several relevant baselines from deep clustering and consensus clustering. Our code can be found at https://gitlab.cs.univie.ac.at/lukas/deccs
翻译:深海集群领域结合了深层次的学习和集群,以了解能够改善所学代表性和已考虑的集群方法的绩效的表述方式。大多数现有的深层集群方法都是为单一集群方法设计的,例如, k- means、 光谱集群或高斯混合模式,但众所周知,任何组合算法在各种情况下都不会发挥最佳作用。共识集群试图通过在集群整体成员之间建立共识来减轻集群算法的个别弱点。目前,没有深度集群方法可以将多种混合组合算法纳入一个共同的组合,以更新各种表述和集群。为了缩小这一差距,我们提出了协商一致代表制的想法,以最大限度地实现共同群体成员之间的协议。此外,我们提议了DECCS(深入嵌入集的集群与共识代表制的结合),即通过扩大嵌入空间,使所有共同组合成员都能够就共同的集群结果达成一致。我们的贡献如下:(1) 我们提出为混合组合组合进行首次学习共识表达的理念,在采用新的理念,在采用多层次的集群中,我们提议采用一种采用多层次的模型。