Digitized documents such as scientific articles, tax forms, invoices, contract papers, and historic texts, are widely used nowadays. These images could be degraded or damaged due to various reasons including poor lighting conditions when capturing the image, shadow while scanning them, distortion like noise and blur, aging, ink stain, bleed through, watermark, stamp, etc. Document image enhancement and restoration play a crucial role in many automated document analysis and recognition tasks, such as content extraction using optical character recognition (OCR). With recent advances in deep learning, many methods are proposed to enhance the quality of these document images. In this paper, we review deep learning-based methods, datasets, and metrics for different document image enhancement problems. We provide a comprehensive overview of deep learning-based methods for six different document image enhancement tasks, including binarization, debluring, denoising, defading, watermark removal, and shadow removal. We summarize the main state-of-the-art works for each task and discuss their features, challenges, and limitations. We introduce multiple document image enhancement tasks that have received no to little attention, including over and under exposure correction and bleed-through removal, and identify several other promising research directions and opportunities for future research.
翻译:科学文章、税务表格、发票、合同文件和历史文本等数字化文件如今被广泛使用,这些图像可能由于各种原因被退化或损坏,其中包括在捕捉图像时光亮条件差、扫描时的阴影、噪音和模糊等扭曲、老化、墨迹污、流血、水印、印章等。文件图像的增强和恢复在许多自动文件分析和识别任务中发挥着关键作用,例如利用光学特征识别(OCR)进行内容提取。随着最近深层次学习的进展,提出了许多方法来提高这些文件图像的质量。在本文件中,我们审查了基于深层学习的方法、数据集和不同文件图像增强问题的衡量标准。我们全面概述了基于深层学习的方法,用于六种不同的文件图像增强任务,包括二元化、拆除、去除、去除、去除、去除、去除水标记和清除阴影。我们总结了每项任务的主要艺术品状况,并讨论了其特征、挑战和限制。我们介绍了许多基于文件的改进任务,但很少引起注意,包括暴露过后和流血式清除,并确定了其他有希望的机会。