对等级数据集进行联合几何和地形学分析 (Joint Geometric and Topological Analysis of Hierarchical Datasets)

In a world abundant with diverse data arising from complex acquisition techniques, there is a growing need for new data analysis methods. In this paper we focus on high-dimensional data that are organized into several hierarchical datasets. We assume that each dataset consists of complex samples, and every sample has a distinct irregular structure modeled by a graph. The main novelty in this work lies in the combination of two complementing powerful data-analytic approaches: topological data analysis (TDA) and geometric manifold learning. Geometry primarily contains local information, while topology inherently provides global descriptors. Based on this combination, we present a method for building an informative representation of hierarchical datasets. At the finer (sample) level, we devise a new metric between samples based on manifold learning that facilitates quantitative structural analysis. At the coarser (dataset) level, we employ TDA to extract qualitative structural information from the datasets. We showcase the applicability and advantages of our method on simulated data and on a corpus of hyper-spectral images. We show that an ensemble of hyper-spectral images exhibits a hierarchical structure that fits well the considered setting. In addition, we show that our new method gives rise to superior classification results compared to state-of-the-art methods.

翻译：在一个拥有来自复杂获取技术的丰富数据的世界中,日益需要新的数据分析方法。在本文中,我们侧重于由几个等级数据集组成的高维数据。我们假设每个数据集由复杂的样本组成,每个样本都有不同的非常规结构,以图表为模型。这项工作的主要新颖之处在于两种补充强大的数据分析分析方法的结合:地形数据分析(TDA)和几何多元学习。几何主要包含本地信息,而地形学本身就提供了全球描述仪。基于这一组合,我们提出了一个构建等级数据集信息化代表性的方法。在精细(样本)一级,我们根据多种学习,设计了一个样本之间的新指标,以便利定量结构分析。在剖析(数据集)一级,我们使用TDA从数据集一级提取定性结构信息。我们展示了我们的方法在模拟数据和超光谱图像的组合方面的适用性和优势。我们展示了一种超光谱图像展示了一种等级结构,这种等级结构在精细的层次结构上展示了我们所考虑的排序的方法。此外,我们展示了一种先进的方法。我们展示了一种先进的方法。

相关内容

流形学习

关注 345

流形学习，全称流形学习方法(Manifold Learning)，自2000年在著名的科学杂志《Science》被首次提出以来，已成为信息科学领域的研究热点。在理论和应用上，流形学习方法都具有重要的研究意义。假设数据是均匀采样于一个高维欧氏空间中的低维流形，流形学习就是从高维采样数据中恢复低维流形结构，即找到高维空间中的低维流形，并求出相应的嵌入映射，以实现维数约简或者数据可视化。它是从观测到的现象中去寻找事物的本质，找到产生数据的内在规律。

【图与几何深度学习】Graph and geometric deep learning，49页ppt

专知会员服务

65+阅读 · 2021年4月24日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日