The success of existing multi-view clustering relies on the assumption of sample integrity across multiple views. However, in real-world scenarios, samples of multi-view are partially available due to data corruption or sensor failure, which leads to incomplete multi-view clustering study (IMVC). Although several attempts have been proposed to address IMVC, they suffer from the following drawbacks: i) Existing methods mainly adopt cross-view contrastive learning forcing the representations of each sample across views to be exactly the same, which might ignore view discrepancy and flexibility in representations; ii) Due to the absence of non-observed samples across multiple views, the obtained prototypes of clusters might be unaligned and biased, leading to incorrect fusion. To address the above issues, we propose a Cross-view Partial Sample and Prototype Alignment Network (CPSPAN) for Deep Incomplete Multi-view Clustering. Firstly, unlike existing contrastive-based methods, we adopt pair-observed data alignment as 'proxy supervised signals' to guide instance-to-instance correspondence construction among views. Then, regarding of the shifted prototypes in IMVC, we further propose a prototype alignment module to achieve incomplete distribution calibration across views. Extensive experimental results showcase the effectiveness of our proposed modules, attaining noteworthy performance improvements when compared to existing IMVC competitors on benchmark datasets.
翻译:现有的多视图聚类的成功依赖于多视图中样本完整性的假设。然而,在现实场景中,由于数据损坏或传感器故障,多视图样本的可用性部分受限,这导致了不完整多视图聚类研究(IMVC)。虽然已经提出了一些解决IMVC的方法,但它们存在以下缺点:(i) 现有方法主要采用跨视图对比学习,迫使每个样本在多个视图上的表示完全相同,这可能忽略了视图差异和表示的灵活性;(ii) 由于多个视图中不存在非观察样本,所以所得到的聚类原型可能是不对齐和有偏差的,导致融合不正确。为了解决上述问题,我们提出了跨视图部分样本和原型对齐网络(CPSPAN)进行不完整多视图聚类。首先,不同于现有的对比方法,我们采用以所配对观察到样本的数据对齐作为“代理监督信号”来指导跨视图实例对实例的对应的构建。然后,针对IMVC中原型的偏移,我们进一步提出了一个原型对齐模块,实现跨视图的不完整分布校准。大量实验证明了我们提出的模块的有效性,在基准数据集上与现有IMVC竞争对手相比都获得了显着的性能提升。