We present Topological Point Cloud Clustering (TPCC), a new method to cluster points in an arbitrary point cloud based on their contribution to global topological features. TPCC synthesizes desirable features from spectral clustering and topological data analysis and is based on considering the spectral properties of a simplicial complex associated to the considered point cloud. As it is based on considering sparse eigenvector computations, TPCC is similarly easy to interpret and implement as spectral clustering. However, by focusing not just on a single matrix associated to a graph created from the point cloud data, but on a whole set of Hodge-Laplacians associated to an appropriately constructed simplicial complex, we can leverage a far richer set of topological features to characterize the data points within the point cloud and benefit from the relative robustness of topological techniques against noise. We test the performance of TPCC on both synthetic and real-world data and compare it with classical spectral clustering.
翻译:我们提出了一种新的方法——拓扑点云聚类(Topological Point Cloud Clustering,TPCC),用于基于点在全局拓扑特征中的贡献来聚类任意点云。TPCC综合了谱聚类和拓扑数据分析的良好特性,并基于考虑到与所考虑的点云相关的单纯形复合物(simplicial complex)的谱特性。由于它是基于稀疏特征向量计算的,因此TPCC的解释和实现与谱聚类类似容易。然而,通过不仅仅关注从点云数据创建的图形式上的单个矩阵,而是关注一整套与构建的单纯形复合物相关的Hodge-Laplacians,我们可以利用更丰富的拓扑特征来表征点云中的数据点,并受到拓扑技术相对于噪声的相对健壮性的好处。我们在合成和实际数据上测试了TPCC的性能,并将其与经典谱聚类进行了比较。