Constructing high-quality character image datasets is challenging because real-world images are often affected by image degradation. There are limitations when applying current image restoration methods to such real-world character images, since (i) the categories of noise in character images are different from those in general images; (ii) real-world character images usually contain more complex image degradation, e.g., mixed noise at different noise levels. To address these problems, we propose a real-world character restoration network (RCRN) to effectively restore degraded character images, where character skeleton information and scale-ensemble feature extraction are utilized to obtain better restoration performance. The proposed method consists of a skeleton extractor (SENet) and a character image restorer (CiRNet). SENet aims to preserve the structural consistency of the character and normalize complex noise. Then, CiRNet reconstructs clean images from degraded character images and their skeletons. Due to the lack of benchmarks for real-world character image restoration, we constructed a dataset containing 1,606 character images with real-world degradation to evaluate the validity of the proposed method. The experimental results demonstrate that RCRN outperforms state-of-the-art methods quantitatively and qualitatively.
翻译:构建高质量字符图像数据集具有挑战性,因为真实世界图像往往受到图像退化的影响。在对真实世界图像应用当前图像恢复方法时存在局限性,因为(一) 字符图像中的噪音类别不同于一般图像;(二) 真实世界字符图像通常包含更为复杂的图像降解,例如在不同噪音水平上混杂的噪音。为解决这些问题,我们建议建立一个真实世界字符恢复网络(RCRN),以有效恢复退化的字符图像,利用这些图像使用字符骨架信息以及比例组合特征提取,以获得更好的恢复性能。拟议方法包括骨架提取器(SENet)和字符图像恢复器(CiRNet)。SENet旨在维护字符的结构性一致性和常规复杂噪音。然后,CiRNet从退化的字符图像及其骨架中重建清洁图像。由于缺乏真实世界图像恢复的基准,我们建立了一个包含1 606个具有真实世界退化特征图像的数据集,用以评价拟议方法的有效性。实验结果显示, RCRN 超越了状态的定性和定量方法。