The technology of hyperspectral imaging (HSI) records the visual information upon long-range-distributed spectral wavelengths. A representative hyperspectral image acquisition procedure conducts a 3D-to-2D encoding by the coded aperture snapshot spectral imager (CASSI) and requires a software decoder for the 3D signal reconstruction. By observing this physical encoding procedure, two major challenges stand in the way of a high-fidelity reconstruction. (i) To obtain 2D measurements, CASSI dislocates multiple channels by disperser-titling and squeezes them onto the same spatial region, yielding an entangled data loss. (ii) The physical coded aperture leads to a masked data loss by selectively blocking the pixel-wise light exposure. To tackle these challenges, we propose a spatial-spectral (S^2-) Transformer network with a mask-aware learning strategy. First, we simultaneously leverage spatial and spectral attention modeling to disentangle the blended information in the 2D measurement along both two dimensions. A series of Transformer structures are systematically designed to fully investigate the spatial and spectral informative properties of the hyperspectral data. Second, the masked pixels will induce higher prediction difficulty and should be treated differently from unmasked ones. Thereby, we adaptively prioritize the loss penalty attributing to the mask structure by inferring the pixel-wise reconstruction difficulty upon the mask-encoded prediction. We theoretically discusses the distinct convergence tendencies between masked/unmasked regions of the proposed learning strategy. Extensive experiments demonstrates that the proposed method achieves superior reconstruction performance. Additionally, we empirically elaborate the behaviour of spatial and spectral attentions under the proposed architecture, and comprehensively examine the impact of the mask-aware learning.
翻译:超光谱成像( HISI) 技术记录长距离分布光谱波长的视觉信息。 具有代表性的超光谱图像获取程序由代码孔径光光谱成像仪( CASSI) 进行3D至2D编码, 需要3D信号重建软件解码器。 通过观察这种物理编码程序, 两大挑战存在于高不洁度重建的道路上。 (一) 为了获得2D测量, CASSI通过撒布和挤压到同一空间区域, 从而造成数据折叠损失。 (二) 物理编码孔径通过有选择地阻断像子光谱光谱光谱显示器光谱光谱显示数据丢失。 首先, 我们同时利用空间和光谱关注模型, 在2D测量中将混合的信息分解为两个维度。 一系列变换结构正在系统地设计, 来彻底调查在高空和光谱的精确度光谱上显示数据重建过程中的清晰度变化, 将显示我们从高空和光谱结构中学习高空和高光谱变变变变变的变的变的变的变的变的变的变的变的变的变的变的变的变的变的变的变的变的变, 。