Most computer vision and machine learning-based approaches for historical document analysis are tailored to grayscale or RGB images and thus, mostly exploit their spatial information. Multispectral (MS) and hyperspectral (HS) images contain, next to the spatial information, much richer spectral information than RGB images (usually spreading beyond the visible spectral range) that can facilitate more effective feature extraction, more accurate classification and recognition, and thus, improved analysis. Although utilization of rich spectral information can improve historical document analysis tremendously, there are still some potential limitations of HS imagery such as camera-induced noise and blur that require a carefully designed preprocessing step. Here, we propose novel blind HS image deblurring methods tailored to document images. We exploit a low-rank property of HS images (i.e., by projecting an HS image to a lower dimensional subspace) and utilize a text tailor image prior to performing a PSF estimation and deblurring of subspace components. The preliminary results show that the proposed approach gives good results over all spectral bands, removing successfully image artefacts introduced by blur and noise and significantly increasing the number of bands that can be used in further analysis.
翻译:多光谱(MS)和超光谱(HS)图像除了空间信息外,还包含远比RGB图像(通常在可见光谱范围以外传播)更丰富的光谱信息,有助于更有效地提取特征,更准确地分类和识别,从而改进分析。虽然利用丰富的光谱信息可以极大地改进历史文件分析,但HS图像(如照相机引发的噪音和模糊)仍有一些潜在的局限性,需要认真设计一个预处理步骤。在这里,我们提出了针对文件图像的新颖的盲光光谱(MS)和超光谱(HS)图像爆破方法。我们利用了HS图像的低级属性(例如,将HS图像投射到一个低维次空间),并在进行PSF估计和分解子空间组件之前使用文本裁缝图像。初步结果显示,拟议的方法可以给所有光谱带带来良好结果,通过模糊和噪音成功地清除图像制品,并大大增加可用于进一步分析的波段数量。</s>