Deep subspace clustering has attracted increasing attention in recent years. Almost all the existing works are required to load the whole training data into one batch for learning the self-expressive coefficients in the framework of deep learning. Although these methods achieve promising results, such a learning fashion severely prevents from the usage of deeper neural network architectures (e.g., ResNet), leading to the limited representation abilities of the models. In this paper, we propose a new deep subspace clustering framework, motivated by the energy-based models. In contrast to previous approaches taking the weights of a fully connected layer as the self-expressive coefficients, we propose to learn an energy-based network to obtain the self-expressive coefficients by mini-batch training. By this means, it is no longer necessary to load all data into one batch for learning, and it thus becomes a reality that we can utilize deeper neural network models for subspace clustering. Considering the powerful representation ability of the recently popular self-supervised learning, we attempt to leverage self-supervised representation learning to learn the dictionary. Finally, we propose a joint framework to learn both the self-expressive coefficients and dictionary simultaneously, and train the model in an end-to-end manner. The experiments are performed on three publicly available datasets, and extensive experimental results demonstrate our method can significantly outperform the other related approaches. For instance, on the three datasets, our method can averagely achieve $13.8\%$, $15.4\%$, $20.8\%$ improvements in terms of Accuracy, NMI, and ARI over SENet which is proposed very recently and obtains the second best results in the experiments.
翻译:近些年来,深层子空间群集引起了越来越多的关注。几乎所有现有的工程都要求将整个培训数据装入一组,以在深层学习框架内学习自我表达系数。虽然这些方法取得了令人乐观的成果,但这种学习方式严重妨碍了使用更深的神经网络结构(如ResNet),导致模型的演示能力有限。在本文中,我们提议一个新的深层子空间群集框架,其动力是能源模型。与以前的做法相比,将完全连接层的重量作为自我表达系数,我们建议学习一个基于能源的网络,以便通过微型批量培训获得自我表达系数。虽然这些方法取得了令人乐观的成果,但这种学习方式已经不再需要将所有数据装入一组更深层的神经网络结构(如ResNet),从而导致模型使用更深层的神经网络模型来进行子空间群集。考虑到最近流行的自我监督学习的强大代表能力,我们试图利用自我超强的演示的演示语言组学习结果。 最后,我们提议了一个联合框架,通过微型批量的自我表达方式来学习以自我表达系数获得自我表达系数。 80 3 并且同时将我们所演算的自我表达数据的方法 。