Many attempts have been made in recent decades to integrate machine learning (ML) and topological data analysis. A prominent problem in applying persistent homology to ML tasks is finding a vector representation of a persistence diagram (PD), which is a summary diagram for representing topological features. From the perspective of data fitting, a stable vector representation, namely, persistence B-spline grid (PBSG), is proposed based on the efficient technique of progressive-iterative approximation for least-squares B-spline function fitting. We theoretically prove that the PBSG method is stable with respect to the metric of 1-Wasserstein distance defined on the PD space. The proposed method was tested on a synthetic data set, data sets of randomly generated PDs, data of a dynamical system, and 3D CAD models, showing its effectiveness and efficiency
翻译:近几十年来,人们多次尝试将机器学习(ML)和地形学数据分析结合起来,在将持久性同同质适用于ML任务方面,一个突出的问题是找到持久性图(PD)的矢量代表,这是代表地形特征的简要图表,从数据安装的角度来看,建议采用一种稳定的矢量代表,即持久性B-spline 电网(PBSG),其依据是最小平方B-spline函数适当化的渐进性近近似有效技术,我们理论上证明PBSG方法在PD空间界定的1-Wasserstein距离指标方面是稳定的。提议的方法是在合成数据集、随机生成的PD数据集、动态系统数据和3D CAD模型上测试的,显示了其有效性和效率。