We present a new table structure recognition (TSR) approach, called TSRFormer, to robustly recognizing the structures of complex tables with geometrical distortions from various table images. Unlike previous methods, we formulate table separation line prediction as a line regression problem instead of an image segmentation problem and propose a new two-stage DETR based separator prediction approach, dubbed \textbf{Sep}arator \textbf{RE}gression \textbf{TR}ansformer (SepRETR), to predict separation lines from table images directly. To make the two-stage DETR framework work efficiently and effectively for the separation line prediction task, we propose two improvements: 1) A prior-enhanced matching strategy to solve the slow convergence issue of DETR; 2) A new cross attention module to sample features from a high-resolution convolutional feature map directly so that high localization accuracy is achieved with low computational cost. After separation line prediction, a simple relation network based cell merging module is used to recover spanning cells. With these new techniques, our TSRFormer achieves state-of-the-art performance on several benchmark datasets, including SciTSR, PubTabNet and WTW. Furthermore, we have validated the robustness of our approach to tables with complex structures, borderless cells, large blank spaces, empty or spanning cells as well as distorted or even curved shapes on a more challenging real-world in-house dataset.
翻译:我们提出了一个新的表格结构识别(TSRFormer)方法,即TSRFormer(TSRFormer),以强有力地识别不同表格图像中带有几何扭曲的复杂表格结构。与以往的方法不同,我们将表格分隔线预测作为一种线回归问题,而不是图像分割问题,并提出一个新的基于 DERTR 的双阶段分隔器预测(TSRFormer) 方法,称为 dubbed \ textbf{Sep}ator \ textbf{regressquenion \ textbf{TR),以直接预测与表格图像的分隔线的分隔线。为使两阶段的DETR框架能够高效和有效地为分隔线预测任务工作,我们提出了两项改进:(1) 一种先前强化的匹配战略,以解决DETR的缓慢趋同问题;(2) 一种新的交叉关注模块,从高分辨率的演动特征地图上采集样本特征,以便用低计算成本实现高的本地化。在分离线预测后,一个基于单元格合并模块的简单关系网络连接网路段组合模式用于恢复跨单元格。有了这些新技术,我们的TRTRFSFermer-real-ruder-rual-rual-ruder-de-rual-de-ruder-st-st-ruder-rual-st-rual-rual-rual-st-st-st-st-st-st-rub-stal-st-st-st-st-st-rub-rub-rub-ruction-st-st-st-st-st-st-st-st-st-st-st-st-st-ruction-ruction-st-st-st-st-st-st-st-st-st-st-st-st-st-st-st-st-st-st-st-st-st-st-st-st-st-st-d-d-sal-d-d-d-d-stal-stal-stal-st-st-st-stal-stal-st-st-st-st-st-st-st-st-st-st-st-st-st-st-st-st-st-st-st-st-st-st-st-d-