Through the use of examples, we explain one way in which applied topology has evolved since the birth of persistent homology in the early 2000s. The first applications of topology to data emphasized the global shape of a dataset, such as the three-circle model for $3 \times 3$ pixel patches from natural images, or the configuration space of the cyclo-octane molecule, which is a sphere with a Klein bottle attached via two circles of singularity. In these studies of global shape, short persistent homology bars are disregarded as sampling noise. More recently, however, persistent homology has been used to address questions about the local geometry of data. For instance, how can local geometry be vectorized for use in machine learning problems? Persistent homology and its vectorization methods, including persistence landscapes and persistence images, provide popular techniques for incorporating both local geometry and global topology into machine learning. Our meta-hypothesis is that the short bars are as important as the long bars for many machine learning tasks. In defense of this claim, we survey applications of persistent homology to shape recognition, agent-based modeling, materials science, archaeology, and biology. Additionally, we survey work connecting persistent homology to geometric features of spaces, including curvature and fractal dimension, and various methods that have been used to incorporate persistent homology into machine learning.
翻译:通过举例,我们解释了自2000年代早期持久性同族主义诞生以来应用地貌学演变的一种方法。数据学的最初应用强调数据集的全球形状,例如自然图像3美元3元的三环模型像素补,或环球-奥氏分子的配置空间,这是一个通过两个奇点圈附着的克莱因瓶子的球体。在全球形状的这些研究中,短长的持久性同系物条被忽略为抽样噪音。然而,最近,持续性同系物学被用来解决有关数据当地几何学的问题。例如,如何将本地几何学用于机器学习问题?持久性同系物学及其矢量化方法,包括持久性地貌和持久性图像,为将本地几何和全球性的表面学纳入机器学习提供了流行技术。我们的元理学理论是,短线与许多机器学习任务的长线一样重要。为了维护这一主张,我们调查了持续性同系学的应用,将识别、代理人基质学和持续性研究纳入持续性生物学、持续性生物学研究、持续性生物学、历史、历史学学、历史学、历史学、历史学、历史学、历史学、历史学、历史学、历史学学、历史学学学学学、历史学学学、历史学、历史学、历史学学学学学等。