Multi-view clustering is an important research topic due to its capability to utilize complementary information from multiple views. However, there are few methods to consider the negative impact caused by certain views with unclear clustering structures, resulting in poor multi-view clustering performance. To address this drawback, we propose self-supervised discriminative feature learning for deep multi-view clustering (SDMVC). Concretely, deep autoencoders are applied to learn embedded features for each view independently. To leverage the multi-view complementary information, we concatenate all views' embedded features to form the global features, which can overcome the negative impact of some views' unclear clustering structures. In a self-supervised manner, pseudo-labels are obtained to build a unified target distribution to perform multi-view discriminative feature learning. During this process, global discriminative information can be mined to supervise all views to learn more discriminative features, which in turn are used to update the target distribution. Besides, this unified target distribution can make SDMVC learn consistent cluster assignments, which accomplishes the clustering consistency of multiple views while preserving their features' diversity. Experiments on various types of multi-view datasets show that SDMVC achieves state-of-the-art performance.
翻译:多观点集群是一个重要的研究专题,因为它能够利用多种观点的补充信息。然而,几乎没有什么方法来考虑某些观点与一些观点不清晰的分组结构所造成的负面影响,从而导致多观点群集性能差。为了解决这一缺陷,我们建议为深层多观点群集(SDMVC)进行自我监督的歧视性特征学习。具体地说,深度自动分类器用于独立学习每个观点的嵌入特征。为了利用多观点补充信息,我们将所有观点的嵌入性特征归结成全球特征,这些特征可以克服某些观点不明确的分组结构的负面影响。以自我监督的方式,获得假标签以构建统一的目标分布,进行多观点歧视特征学习。在这一过程中,可以挖掘全球歧视信息,以监督所有观点,学习更具有歧视性的特征,这反过来被用来更新目标分布。此外,这种统一的目标分布可以使SDMVC学习连贯一致的群集任务,从而在保持其多样性的同时实现多种观点的组合一致性。关于多观点集成的实验显示SDMVC的绩效。