Table structure recognition is an indispensable element for enabling machines to comprehend tables. Its primary purpose is to identify the internal structure of a table. Nevertheless, due to the complexity and diversity of their structure and style, it is highly challenging to parse the tabular data into a structured format that machines can comprehend. In this work, we adhere to the principle of the split-and-merge based methods and propose an accurate table structure recognizer, termed SEMv2 (SEM: Split, Embed and Merge). Unlike the previous works in the ``split'' stage, we aim to address the table separation line instance-level discrimination problem and introduce a table separation line detection strategy based on conditional convolution. Specifically, we design the ``split'' in a top-down manner that detects the table separation line instance first and then dynamically predicts the table separation line mask for each instance. The final table separation line shape can be accurately obtained by processing the table separation line mask in a row-wise/column-wise manner. To comprehensively evaluate the SEMv2, we also present a more challenging dataset for table structure recognition, dubbed iFLYTAB, which encompasses multiple style tables in various scenarios such as photos, scanned documents, etc. Extensive experiments on publicly available datasets (e.g. SciTSR, PubTabNet and iFLYTAB) demonstrate the efficacy of our proposed approach. The code and iFLYTAB dataset will be made publicly available upon acceptance of this paper.
翻译:表格结构识别是使机器能够理解表格的一个不可或缺的要素。 它的主要目的, 是确定表格的内部结构。 但是,由于表格的结构和风格的复杂性和多样性, 将表格数据转换成机器能够理解的结构化格式是极具挑战性的。 在这项工作中, 我们坚持以拆分和合并法为基础的方法原则, 并提议一个准确的表格结构识别器, 称为 SEMv2( SEM: Splet, Embed and Merge) 。 与以前在“ split” 阶段的工作不同, 我们的目标是解决表格分隔线中实例一级的歧视问题, 并引入基于有条件的共变换的表格分离线检测策略。 具体地说, 我们设计“split” 表格数据, 以自上而下的方式先检测表格分离线, 然后动态地预测每例的表格分隔线遮罩。 最后的表格分隔线形状可以通过以行对表格分隔线遮罩进行行/校正方式处理来准确获得。 为了全面评估 SEMv2, 我们还在表格中提出一个更具挑战性的数据集 i- i 结构识别, i- discride distrate distrate distrate discal distrate distral discloveal distral ex divial ex</s>