High-dimensional imaging is becoming increasingly relevant in many fields from astronomy and cultural heritage to systems biology. Visual exploration of such high-dimensional data is commonly facilitated by dimensionality reduction. However, common dimensionality reduction methods do not include spatial information present in images, such as local texture features, into the construction of low-dimensional embeddings. Consequently, exploration of such data is typically split into a step focusing on the attribute space followed by a step focusing on spatial information, or vice versa. In this paper, we present a method for incorporating spatial neighborhood information into distance-based dimensionality reduction methods, such as t-Distributed Stochastic Neighbor Embedding (t-SNE). We achieve this by modifying the distance measure between high-dimensional attribute vectors associated with each pixel such that it takes the pixel's spatial neighborhood into account. Based on a classification of different methods for comparing image patches, we explore a number of different approaches. We compare these approaches from a theoretical and experimental point of view. Finally, we illustrate the value of the proposed methods by qualitative and quantitative evaluation on synthetic data and two real-world use cases.
翻译:高维成像在从天文学和文化遗产到系统生物学等许多领域越来越具有相关性。对此类高维数据的视觉探索通常由维度减少来推动。然而,在构建低维嵌入器时,普通的维度减少方法并不包括图像中的空间信息,如本地质地特征等。因此,对这些数据的探索通常会分为一个步骤,侧重于属性空间,然后以空间信息为重点,或者反之亦然。在本文中,我们提出了一个将空间邻里信息纳入基于远程的维度减少方法的方法,如T-分布式蒸汽邻里嵌入(t-SNE)的方法。我们通过修改与每个像素相关的高维属性矢量之间的距离测量,从而将像素的空间邻里纳入考虑。根据对图像间隔的不同方法的分类,我们探讨一些不同的方法。我们从理论和实验的角度比较这些方法。最后,我们通过对合成数据和两个现实使用案例进行定性和定量评估,来说明拟议方法的价值。