This paper studies the subspace clustering problem in which data points collected from high-dimensional ambient space lie in a union of linear subspaces. Subspace clustering becomes challenging when the dimension of intersection between subspaces is large and most of the self-representation based methods are sensitive to the intersection between the span of clusters. In sharp contrast to the self-representation based methods, a recently proposed clustering method termed Innovation Pursuit, computed a set of optimal directions (directions of innovation) to build the adjacency matrix. This paper focuses on the Innovation Pursuit Algorithm to shed light on its impressive performance when the subspaces are heavily intersected. It is shown that in contrast to most of the existing methods which require the subspaces to be sufficiently incoherent with each other, Innovation Pursuit only requires the innovative components of the subspaces to be sufficiently incoherent with each other. These new sufficient conditions allow the clusters to be strongly close to each other. Motivated by the presented theoretical analysis, a simple yet effective projection based technique is proposed which we show with both numerical and theoretical results that it can boost the performance of Innovation Pursuit.
翻译:本文研究从高维环境空间收集的数据点在直线子空间结合中收集的子空间集群问题。当子空间之间的交叉层范围很大,而且大多数自代表法方法对集群之间的交叉点十分敏感时,子空间集群就具有挑战性。与基于自我代表的方法截然不同的是,最近提出的一种称为创新探索的分组方法,它计算了一套最佳方向(创新方向),以构建相近矩阵。本文件侧重于创新追求阿尔高利特姆,以在子空间高度交错时说明其令人印象深刻的性能。它表明,与大多数要求子空间彼此充分不相容的现有方法相比,创新追求仅仅要求子空间的创新组成部分相互不相容。这些新的充分条件使得集群能够相互密切接近。根据提出的理论分析,我们用数字和理论结果展示了一种简单而有效的预测技术,我们用它以数字和理论结果显示它能够促进创新的绩效。