CascadeTabNet:结束对基于图像的文档的表格最终检测和结构识别的方法 (CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents)

An automatic table recognition method for interpretation of tabular data in document images majorly involves solving two problems of table detection and table structure recognition. The prior work involved solving both problems independently using two separate approaches. More recent works signify the use of deep learning-based solutions while also attempting to design an end to end solution. In this paper, we present an improved deep learning-based end to end approach for solving both problems of table detection and structure recognition using a single Convolution Neural Network (CNN) model. We propose CascadeTabNet: a Cascade mask Region-based CNN High-Resolution Network (Cascade mask R-CNN HRNet) based model that detects the regions of tables and recognizes the structural body cells from the detected tables at the same time. We evaluate our results on ICDAR 2013, ICDAR 2019 and TableBank public datasets. We achieved 3rd rank in ICDAR 2019 post-competition results for table detection while attaining the best accuracy results for the ICDAR 2013 and TableBank dataset. We also attain the highest accuracy results on the ICDAR 2019 table structure recognition dataset. Additionally, we demonstrate effective transfer learning and image augmentation techniques that enable CNNs to achieve very accurate table detection results. Code and dataset has been made available at: https://github.com/DevashishPrasad/CascadeTabNet

翻译：对文件图像中的表格数据进行解释的自动表格识别方法主要涉及解决表格检测和表格结构识别的两个问题。之前的工作涉及使用两种不同的方法独立解决这两个问题。最近的工作表明使用了深层次的基于学习的解决方案,同时也试图设计最终解决方案。在本文件中,我们提出了一个更深层次的基于学习的终端方法,以解决表格检测和结构识别两个问题,使用单一的 Conculation Neal网络模型(CNN),我们建议Cascade TabNet:一个基于Cascade 掩码的CNN高分辨率网络(Cascade 遮罩 R-CNN HRNet)基于模型,该模型可以检测表格区域,并同时从检测的表格中识别结构体细胞。我们评估了我们关于ICDAR 2013、 ICDAR 2019 和表库公共数据集的结果。我们在ICDAR 2019 后竞争结果探测表格检测结果达到三等,同时取得2013 ICDAR 和表Bank 高分辨率网络数据集。我们还在 ICDAR 2019 表格结构识别/CRAD 上取得了最高准确的成绩。我们学习了SDRCSDSDSD 。

相关内容

ICDAR

关注 131

ICDAR是一个非常成功的旗舰会议系列，是文献分析界研究人员、科学家和实践者的最大和最重要的国际会议。该会议得到了IAPR-TC 10/11的认可，成立于近30年前。第15届文件分析与识别国际会议（ICDAR 2019）由澳大利亚悉尼科技大学（UTS）主办，并在悉尼国际会议中心（ICC）举行。接受的论文由会议出版服务（CPS）出版，并提交给IEEE Xplore数字图书馆。官网链接：http://icdar2019.org/

【深度学习社区检测】Deep Learning for Community Detection: Progress, Challenges and Opportunities

专知会员服务

28+阅读 · 2020年6月13日

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

专知会员服务

46+阅读 · 2020年6月11日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

【2020关键词提取】医学报告的关键词提取和结构化，Keyword extraction and structuralization of medical reports

专知会员服务

33+阅读 · 2020年5月2日