基于层次化建模的快速准确表格识别方法 (Hierarchical Modeling Approach to Fast and Accurate Table Recognition) - 专知论文

会员服务 ·

0

识别 · 表格识别 · 格识别 · 识别方法 · 单元 ·

Hierarchical Modeling Approach to Fast and Accurate Table Recognition

翻译：基于层次化建模的快速准确表格识别方法

Takaya Kawakatsu

The extraction and use of diverse knowledge from numerous documents is a pressing challenge in intelligent information retrieval. Documents contain elements that require different recognition methods. Table recognition typically consists of three subtasks, namely table structure, cell position and cell content recognition. Recent models have achieved excellent recognition with a combination of multi-task learning, local attention, and mutual learning. However, their effectiveness has not been fully explained, and they require a long period of time for inference. This paper presents a novel multi-task model that utilizes non-causal attention to capture the entire table structure, and a parallel inference algorithm for faster cell content inference. The superiority is demonstrated both visually and statistically on two large public datasets.

翻译：从海量文档中提取并利用多样化知识是智能信息检索领域亟待解决的挑战。文档包含需要不同识别方法的多种元素。表格识别通常包含三个子任务：表格结构识别、单元格位置识别和单元格内容识别。现有模型通过结合多任务学习、局部注意力机制和相互学习策略已取得优异识别效果，但其有效性尚未得到充分解释，且推理时间较长。本文提出一种新颖的多任务模型，该模型利用非因果注意力机制捕捉完整表格结构，并采用并行推理算法加速单元格内容推断。在两个大型公开数据集上的可视化与统计结果均证明了该方法的优越性。

0

相关内容

【AAAI2025】TimeDP：通过领域提示学习生成多领域时间序列

【AAAI2025】TimeDP：通过领域提示学习生成多领域时间序列

专知会员服务

14+阅读 · 1月10日

ChatAug: 利用ChatGPT进行文本数据增强

ChatAug: 利用ChatGPT进行文本数据增强

专知会员服务

81+阅读 · 2023年3月4日

CIKM 2021 | FKGE：差分隐私的联邦知识图谱嵌入

专知会员服务

22+阅读 · 2021年8月20日

[CVPR 2021] 基于傅里叶轮廓嵌入的任意形状文本检测（有源码）

专知会员服务

18+阅读 · 2021年5月8日

【CVPR 2020 Oral】小样本类增量学习

专知会员服务

112+阅读 · 2020年6月26日

【KDD2020-Tutorial】深度学习异常检测，180页ppt

【KDD2020-Tutorial】深度学习异常检测，180页ppt

专知

49+阅读 · 2020年8月28日

【ACMMM2020-北航】KBGN:用于视觉对话中自适应视觉-文本推理的知识桥图网络

【ACMMM2020-北航】KBGN:用于视觉对话中自适应视觉-文本推理的知识桥图网络

专知

10+阅读 · 2020年8月12日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知

15+阅读 · 2020年7月23日

零样本图像识别综述论文

零样本图像识别综述论文

专知

22+阅读 · 2020年4月4日

LibRec 每周算法：LDA主题模型

LibRec 每周算法：LDA主题模型

LibRec智能推荐

29+阅读 · 2017年12月4日

有效融合多源异构数据的集成分类器研究

国家自然科学基金

5+阅读 · 2015年12月31日

不确定知识图谱中面向结构查询的众包清洗研究

国家自然科学基金

4+阅读 · 2015年12月31日

面向异分布数据的主动学习方法

国家自然科学基金

12+阅读 · 2015年12月31日

基于组合Hodge理论的图像视频质量评价方法

国家自然科学基金

0+阅读 · 2014年12月31日

基于融合先验知识的机器学习的多传感器融合研究

国家自然科学基金

16+阅读 · 2013年12月31日

Semantic Refinement with LLMs for Graph Representations

Arxiv

0+阅读 · 12月24日

Language-Guided Grasp Detection with Coarse-to-Fine Learning for Robotic Manipulation

Arxiv

0+阅读 · 12月24日

Mirroring Users: Towards Building Preference-aligned User Simulator with User Feedback in Recommendation

Arxiv

0+阅读 · 12月20日

Stable and Efficient Single-Rollout RL for Multimodal Reasoning

Arxiv

0+阅读 · 12月20日

Human-like Content Analysis for Generative AI with Language-Grounded Sparse Encoders

Arxiv

0+阅读 · 12月19日

VIP会员

文章信息

相关主题

相关VIP内容

【AAAI2025】TimeDP：通过领域提示学习生成多领域时间序列

【AAAI2025】TimeDP：通过领域提示学习生成多领域时间序列

专知会员服务

14+阅读 · 1月10日

ChatAug: 利用ChatGPT进行文本数据增强

ChatAug: 利用ChatGPT进行文本数据增强

专知会员服务

81+阅读 · 2023年3月4日

CIKM 2021 | FKGE：差分隐私的联邦知识图谱嵌入

专知会员服务

22+阅读 · 2021年8月20日

[CVPR 2021] 基于傅里叶轮廓嵌入的任意形状文本检测（有源码）

专知会员服务

18+阅读 · 2021年5月8日

【CVPR 2020 Oral】小样本类增量学习

专知会员服务

112+阅读 · 2020年6月26日

热门VIP内容

开通专知VIP会员享更多权益服务

【书籍】从零开始构建文本生成图像生成器：基于 Transformers 与扩散模型

人工智能与未来指挥

【伯克利博士论文】将大语言模型绑定至虚拟人格：实现人类行为模拟

稀疏自编码器综述：解释大语言模型的内部机制

相关资讯

【KDD2020-Tutorial】深度学习异常检测，180页ppt

【KDD2020-Tutorial】深度学习异常检测，180页ppt

专知

49+阅读 · 2020年8月28日

【ACMMM2020-北航】KBGN:用于视觉对话中自适应视觉-文本推理的知识桥图网络

【ACMMM2020-北航】KBGN:用于视觉对话中自适应视觉-文本推理的知识桥图网络

专知

10+阅读 · 2020年8月12日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知

15+阅读 · 2020年7月23日

零样本图像识别综述论文

零样本图像识别综述论文

专知

22+阅读 · 2020年4月4日

LibRec 每周算法：LDA主题模型

LibRec 每周算法：LDA主题模型

LibRec智能推荐

29+阅读 · 2017年12月4日

相关论文

Semantic Refinement with LLMs for Graph Representations

Arxiv

0+阅读 · 12月24日

Language-Guided Grasp Detection with Coarse-to-Fine Learning for Robotic Manipulation

Arxiv

0+阅读 · 12月24日

Mirroring Users: Towards Building Preference-aligned User Simulator with User Feedback in Recommendation

Arxiv

0+阅读 · 12月20日

Stable and Efficient Single-Rollout RL for Multimodal Reasoning

Arxiv

0+阅读 · 12月20日

Human-like Content Analysis for Generative AI with Language-Grounded Sparse Encoders

Arxiv

0+阅读 · 12月19日

相关基金

有效融合多源异构数据的集成分类器研究

国家自然科学基金

5+阅读 · 2015年12月31日

不确定知识图谱中面向结构查询的众包清洗研究

国家自然科学基金

4+阅读 · 2015年12月31日

面向异分布数据的主动学习方法

国家自然科学基金

12+阅读 · 2015年12月31日

基于组合Hodge理论的图像视频质量评价方法

国家自然科学基金

0+阅读 · 2014年12月31日

基于融合先验知识的机器学习的多传感器融合研究

国家自然科学基金

16+阅读 · 2013年12月31日

微信扫码咨询专知VIP会员