Deep clustering has recently emerged as a promising technique for complex image clustering. Despite the significant progress, previous deep clustering works mostly tend to construct the final clustering by utilizing a single layer of representation, e.g., by performing $K$-means on the last fully-connected layer or by associating some clustering loss to a specific layer. However, few of them have considered the possibilities and potential benefits of jointly leveraging multi-layer representations for enhancing the deep clustering performance. In light of this, this paper presents a Deep Clustering via Ensembles (DeepCluE) approach, which bridges the gap between deep clustering and ensemble clustering by harnessing the power of multiple layers in deep neural networks. Particularly, we utilize a weight-sharing convolutional neural network as the backbone, which is trained with both the instance-level contrastive learning (via an instance projector) and the cluster-level contrastive learning (via a cluster projector) in an unsupervised manner. Thereafter, multiple layers of feature representations are extracted from the trained network, upon which a set of diversified base clusterings can be generated via a highly efficient clusterer. Then, the reliability of the clusters in multiple base clusterings is automatically estimated by exploiting an entropy-based criterion, based on which the multiple base clusterings are further formulated into a weighted-cluster bipartite graph. By partitioning this bipartite graph via transfer cut, the final image clustering result can therefore be obtained. Experimental results on six image datasets confirm the advantages of our DeepCluE approach over the state-of-the-art deep clustering approaches.
翻译:尽管取得了巨大进展,但先前的深层集群工作大多倾向于通过利用单一代表层来构建最后的集群,例如,在最后一个完全连接的层上使用K美元,或者将某些集群损失与特定层挂钩。然而,很少有人考虑过联合利用多层代表来提高深层集群性能的可能性和潜在好处。鉴于这一点,本文件通过Ensembbles(DeepCluE)方法展示了深层集群方法,该方法通过利用深神经网络中多个层的优势来弥合深层集群和聚合之间的鸿沟。特别是,我们利用一个权重共享的混合神经网络作为主干线,该主干线通过实例对比学习(通过实例投影机)和集群级对比学习(通过集投影投影机)来共同利用多层代表来提高深层集群性能。此后,从经过培训的国家网络中提取了多层地段地段的特征展示方法,通过高效率的层层层结构网络来生成一组多样化的基团组合。然后,通过高效率的高级的层分组将多层数据分组数据分组数据自动地标制成成一个基础。