Hyperspectral images, which store a hundred or more spectral bands of reflectance, have become an important data source in natural and social sciences. Hyperspectral images are often generated in large quantities at a relatively coarse spatial resolution. As such, unsupervised machine learning algorithms incorporating known structure in hyperspectral imagery are needed to analyze these images automatically. This work introduces the Spatial-Spectral Image Reconstruction and Clustering with Diffusion Geometry (DSIRC) algorithm for partitioning highly mixed hyperspectral images. DSIRC reduces measurement noise through a shape-adaptive reconstruction procedure. In particular, for each pixel, DSIRC locates spectrally correlated pixels within a data-adaptive spatial neighborhood and reconstructs that pixel's spectral signature using those of its neighbors. DSIRC then locates high-density, high-purity pixels far in diffusion distance (a data-dependent distance metric) from other high-density, high-purity pixels and treats these as cluster exemplars, giving each a unique label. Non-modal pixels are assigned the label of their diffusion distance-nearest neighbor of higher density and purity that is already labeled. Strong numerical results indicate that incorporating spatial information through image reconstruction substantially improves the performance of pixel-wise clustering.
翻译:超光谱图像存储了100个或100个以上的光谱反射波段,已成为自然和社会科学中的一个重要数据源。超光谱图像通常以相对粗略的空间分辨率大量生成。因此,需要不经监督的机器学习算法,将已知的结构纳入超光谱图像中,以便自动分析这些图像。这项工作引入了空间光谱图像重建和集成集成集成集成与分流高度混合的超光谱图像(DSIRC)算法。DSIRC通过一个形状适应性重建程序减少测量噪音。特别是,对于每个像素来说,DSIRC将光谱相关像素定位在一个数据适应性空间区域中,并用其邻居的图像来重建像素的光谱特征。DSIRC随后将高密度、高纯度像素混集成和集成集成(一种依赖数据的远距离测量)引入其他高密度高密度、高纯度像素和这些作为集成体的外层标处理。对于每个像类来说,给每一个独特的标签都有光谱相对应的像素,并重新配置像团的光谱特征,通过其高密度的分辨率的分辨率标签,显示其密度的密度的密度的密度的密度的密度的分辨率的分辨率的分辨率。