A novel topological-data-analytical (TDA) method is proposed to distinguish, from noise, small holes surrounded by high-density regions of a probability density function whose mass is concentrated near a manifold (or more generally, a CW complex) embedded in a high-dimensional Euclidean space. The proposed method is robust against additive noise and outliers. In particular, sample points are allowed to be perturbed away from the manifold. Traditional TDA tools, like those based on the distance filtration, often struggle to distinguish small features from noise, because of their short persistence. An alternative filtration, called Robust Density-Aware Distance (RDAD) filtration, is proposed to prolong the persistence of small holes surrounded by high-density regions. This is achieved by weighting the distance function by the density in the sense of Bell et al. Distance-to-measure is incorporated to enhance stability and mitigate noise due to the density estimation. The utility of the proposed filtration in identifying small holes, as well as its robustness against noise, are illustrated through an analytical example and extensive numerical experiments. Basic mathematical properties of the proposed filtration are proven.
翻译:提议采用一种新的地形数据分析法(TDA)方法,从噪音和高密度概率密度功能区域环绕的小孔中,将质量集中在高维的欧洲极地空间内嵌入的块状(或更一般地说,CW综合体)内,其质量集中在高维的欧洲极地空间内的一个块状(或CW综合体)周围的概率密度功能所环绕的概率密度函数区域所环绕的小孔与噪音所环绕的噪音和小孔。提议采用的方法对添加性噪声和外端物具有很强的力度,特别是允许将样品点与外层隔开。传统的TDA工具,如以距离过滤为基础的工具,往往因为其短持久性而难以区分小的特性和噪音。建议采用另一种过滤法,即称为Robust Density-Awarn距离(RDADAD)过滤法(RDADAD),以延长高密度区域环绕的小孔的持久性。这是通过对Bell等人和他人感知的密度感官的距离功能进行加权加权和广泛的数学实验来实现。