Spectral-based subspace clustering methods have proved successful in many challenging applications such as gene sequencing, image recognition, and motion segmentation. In this work, we first propose a novel spectral-based subspace clustering algorithm that seeks to represent each point as a sparse convex combination of a few nearby points. We then extend the algorithm to constrained clustering and active learning settings. Our motivation for developing such a framework stems from the fact that typically either a small amount of labelled data is available in advance; or it is possible to label some points at a cost. The latter scenario is typically encountered in the process of validating a cluster assignment. Extensive experiments on simulated and real data sets show that the proposed approach is effective and competitive with state-of-the-art methods.
翻译:基于光谱的子空间集群方法在基因测序、图像识别和运动分离等许多具有挑战性的应用中证明是成功的。在这项工作中,我们首先提出一个新的基于光谱的子空间集群算法,它试图将每个点作为附近几个点的稀疏混流体组合来代表。然后我们将算法扩大到有限的集群和积极的学习环境。我们制定这种框架的动机来自以下事实:通常有少量贴标签的数据可以提前获得;或者有可能以成本标出某些点。后一种情况通常在验证集群任务的过程中遇到。关于模拟和实际数据集的广泛实验表明,拟议的方法是有效的,与最先进的方法具有竞争力。