With recent advances in data collection from multiple sources, multi-view data has received significant attention. In multi-view data, each view represents a different perspective of data. Since label information is often expensive to acquire, multi-view clustering has gained growing interest, which aims to obtain better clustering solution by exploiting complementary and consistent information across all views rather than only using an individual view. Due to inevitable sensor failures, data in each view may contain error. Error often exhibits as noise or feature-specific corruptions or outliers. Multi-view data may contain any or combination of these error types. Blindly clustering multi-view data i.e., without considering possible error in view(s) could significantly degrade the performance. The goal of error-robust multi-view clustering is to obtain useful outcome even if the multi-view data is corrupted. Existing error-robust multi-view clustering approaches with explicit error removal formulation can be structured into five broad research categories - sparsity norm based approaches, graph based methods, subspace based learning approaches, deep learning based methods and hybrid approaches, this survey summarizes and reviews recent advances in error-robust clustering for multi-view data. Finally, we highlight the challenges and provide future research opportunities.
翻译:由于从多种来源收集数据方面的最近进展,多视角数据受到高度重视。在多视角数据中,每种观点都代表了数据的不同视角。由于标签信息往往非常昂贵,因此多视角分组越来越引起人们的兴趣,目的是通过利用所有观点的互补和一致信息,而不是仅仅使用个人观点,从而获得更好的集群解决方案。由于传感器不可避免的故障,每种观点中的数据都可能包含错误。错误往往表现为噪音或特定特征的腐败或异端。多视角数据可能包含这些错误类型的任何或组合。盲目组合多视角数据,即,不考虑视图中可能存在的错误,可能会显著地降低绩效。错误-机器人多视角组合的目标是获得有用的结果,即使多视角数据被损坏。现有的错误-紫外多视角组合方法与明确错误清除配制可以分为五大类研究 — 基于空间规范的方法、基于图表的方法、基于子空间的学习方法、深层次的学习方法和混合方法,本调查可以总结并审查最近多视角数据错误-robust组合组合的进展。最后,我们强调各种挑战和未来的机会。