Doc-GCN:用于文件布局分析的异基因图变网络 (Doc-GCN: Heterogeneous Graph Convolutional Networks for Document Layout Analysis) - 专知论文

会员服务 ·

0

Analysis · 图卷积神经网络/图卷积网络 · INFORMS · 图 · 图卷积 ·

2022 年 9 月 19 日

Doc-GCN: Heterogeneous Graph Convolutional Networks for Document Layout Analysis

翻译：Doc-GCN:用于文件布局分析的异基因图变网络

Siwen Luo,Yihao Ding,Siqu Long,Josiah Poon,Soyeon Caren Han

from arxiv, Accepted by COLING 2022

Recognizing the layout of unstructured digital documents is crucial when parsing the documents into the structured, machine-readable format for downstream applications. Recent studies in Document Layout Analysis usually rely on computer vision models to understand documents while ignoring other information, such as context information or relation of document components, which are vital to capture. Our Doc-GCN presents an effective way to harmonize and integrate heterogeneous aspects for Document Layout Analysis. We first construct graphs to explicitly describe four main aspects, including syntactic, semantic, density, and appearance/visual information. Then, we apply graph convolutional networks for representing each aspect of information and use pooling to integrate them. Finally, we aggregate each aspect and feed them into 2-layer MLPs for document layout component classification. Our Doc-GCN achieves new state-of-the-art results in three widely used DLA datasets.

翻译：在将文件分为结构化的、机器可读的下游应用格式时,认识到非结构化数字文件的布局至关重要。文件布局分析中最近的研究通常依靠计算机视觉模型来理解文件,而忽略其他信息,例如背景信息或文件组成部分的关系,这些对于捕捉至关重要。我们的Doc-GCN为文件布局分析提供了协调和整合各种内容的有效方法。我们首先为明确描述四个主要方面,包括合成、语义、密度和外观/视觉信息而绘制图表。然后,我们应用图形革命网络来代表信息的每个方面,并利用汇集来整合它们。最后,我们将每个方面集中起来,将其输入到两层 MLP,用于文件布局组成部分的分类。我们的Doc-GCN在三种广泛使用的DLA数据集中取得了新的最新结果。

0

相关内容

Analysis

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

专知会员服务

116+阅读 · 2020年2月10日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

青藏高原与东亚季风区云的时空分布和变化

国家自然科学基金

0+阅读 · 2015年12月31日

茯茶中冠突散囊菌作用下的代谢物变化及形成机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

miRNAs调控柿单宁合成代谢机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

臭氧光催化转化的基础研究

国家自然科学基金

0+阅读 · 2014年12月31日

填埋场中甲烷-Fe(III)/ Fe(II)-硝氮交互作用关系及其微生物分子生态机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

节水灌溉稻田多尺度水碳通量变化机理与耦合模拟

国家自然科学基金

0+阅读 · 2012年12月31日

界面调制对铁磁金属自旋注入效应的影响

国家自然科学基金

0+阅读 · 2012年12月31日

Doublecortin的动态表达在骨折愈合中的作用与调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

霍乱弧菌毒力调控蛋白AphB抗ROS和RNS的巯基调控机制及其对致病过程的影响

国家自然科学基金

0+阅读 · 2012年12月31日

PPARγ对猪胎盘血管发生的作用及其与胎儿发育迟缓的关联性研究

国家自然科学基金

0+阅读 · 2011年12月31日

Efficient few-shot learning for pixel-precise handwritten document layout analysis

Efficient few-shot learning for pixel-precise handwritten document layout analysis

Arxiv

0+阅读 · 2022年10月27日

Multimodal Pre-training Based on Graph Attention Network for Document Understanding

Arxiv

0+阅读 · 2022年10月23日

Interpretable and Efficient Heterogeneous Graph Convolutional Network

Arxiv

15+阅读 · 2021年9月8日

A Review of Graph Neural Networks and Their Applications in Power Systems

A Review of Graph Neural Networks and Their Applications in Power Systems

Arxiv

29+阅读 · 2021年1月25日

Temporal Graph Networks for Deep Learning on Dynamic Graphs

Arxiv

37+阅读 · 2020年10月9日

Attentive Graph Neural Networks for Few-Shot Learning

Attentive Graph Neural Networks for Few-Shot Learning

Arxiv

40+阅读 · 2020年7月14日

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Arxiv

12+阅读 · 2020年2月19日

Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks

Arxiv

14+阅读 · 2019年8月8日

A Comprehensive Survey on Graph Neural Networks

A Comprehensive Survey on Graph Neural Networks

Arxiv

13+阅读 · 2019年3月10日

Dynamic Graph Neural Networks

Arxiv

24+阅读 · 2018年10月24日

VIP会员

文章信息

相关主题

图卷积神经网络/图卷积网络

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

专知会员服务

116+阅读 · 2020年2月10日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型时代的文档智能：综述

蜂窝通信是否是无人机与无人地面战车主宰战场的关键？

文档视觉问答简述

最新新Agent综述！76页327篇论文梳理，北交大桑基韬教授团队发布《迈向模型原生智能体式人工智能的范式转变综述》

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

相关论文

Efficient few-shot learning for pixel-precise handwritten document layout analysis

Efficient few-shot learning for pixel-precise handwritten document layout analysis

Arxiv

0+阅读 · 2022年10月27日

Multimodal Pre-training Based on Graph Attention Network for Document Understanding

Arxiv

0+阅读 · 2022年10月23日

Interpretable and Efficient Heterogeneous Graph Convolutional Network

Arxiv

15+阅读 · 2021年9月8日

A Review of Graph Neural Networks and Their Applications in Power Systems

A Review of Graph Neural Networks and Their Applications in Power Systems

Arxiv

29+阅读 · 2021年1月25日

Temporal Graph Networks for Deep Learning on Dynamic Graphs

Arxiv

37+阅读 · 2020年10月9日

Attentive Graph Neural Networks for Few-Shot Learning

Attentive Graph Neural Networks for Few-Shot Learning

Arxiv

40+阅读 · 2020年7月14日

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Arxiv

12+阅读 · 2020年2月19日

Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks

Arxiv

14+阅读 · 2019年8月8日

A Comprehensive Survey on Graph Neural Networks

A Comprehensive Survey on Graph Neural Networks

Arxiv

13+阅读 · 2019年3月10日

Dynamic Graph Neural Networks

Arxiv

24+阅读 · 2018年10月24日

相关基金

青藏高原与东亚季风区云的时空分布和变化

国家自然科学基金

0+阅读 · 2015年12月31日

茯茶中冠突散囊菌作用下的代谢物变化及形成机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

miRNAs调控柿单宁合成代谢机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

臭氧光催化转化的基础研究

国家自然科学基金

0+阅读 · 2014年12月31日

填埋场中甲烷-Fe(III)/ Fe(II)-硝氮交互作用关系及其微生物分子生态机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

节水灌溉稻田多尺度水碳通量变化机理与耦合模拟

国家自然科学基金

0+阅读 · 2012年12月31日

界面调制对铁磁金属自旋注入效应的影响

国家自然科学基金

0+阅读 · 2012年12月31日

Doublecortin的动态表达在骨折愈合中的作用与调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

霍乱弧菌毒力调控蛋白AphB抗ROS和RNS的巯基调控机制及其对致病过程的影响

国家自然科学基金

0+阅读 · 2012年12月31日

PPARγ对猪胎盘血管发生的作用及其与胎儿发育迟缓的关联性研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员