长期信息提取的主要原因 (Uncovering Main Causalities for Long-tailed Information Extraction) - 专知论文

会员服务 ·

0

INFORMS · 信息抽取 · entity · MoDELS · 推断 ·

2021 年 9 月 11 日

Uncovering Main Causalities for Long-tailed Information Extraction

翻译：长期信息提取的主要原因

Guoshun Nan,Jiaqi Zeng,Rui Qiao,Zhijiang Guo,Wei Lu

from arxiv, Accepted as a long paper in the main conference of EMNLP 2021

Information Extraction (IE) aims to extract structural information from unstructured texts. In practice, long-tailed distributions caused by the selection bias of a dataset, may lead to incorrect correlations, also known as spurious correlations, between entities and labels in the conventional likelihood models. This motivates us to propose counterfactual IE (CFIE), a novel framework that aims to uncover the main causalities behind data in the view of causal inference. Specifically, 1) we first introduce a unified structural causal model (SCM) for various IE tasks, describing the relationships among variables; 2) with our SCM, we then generate counterfactuals based on an explicit language structure to better calculate the direct causal effect during the inference stage; 3) we further propose a novel debiasing approach to yield more robust predictions. Experiments on three IE tasks across five public datasets show the effectiveness of our CFIE model in mitigating the spurious correlation issues.

翻译：信息提取(IE)旨在从非结构化文本中提取结构性信息。在实践中,由数据集选择偏差造成的长期详细分布可能导致传统概率模型中实体和标签之间不正确的关联,也称为虚假关联。这促使我们提出反事实性 IE(CFIE)的新框架,目的是从因果推理的角度发现数据背后的主要因果关系。具体地说,1)我们首先为各种信息发布任务引入一个统一的结构性因果模型,描述变量之间的关系;2)与我们的数据通报(SCM)相比,我们随后产生基于明确语言结构的反事实,以更好地计算推论阶段的直接因果效应;3)我们进一步提出新的偏差方法,以得出更可靠的预测。在五个公共数据集中就信息发布三项信息通报任务进行的实验表明,CFIE模型在减轻虚假关联问题上的有效性。

1

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

一文概览 CVPR2021 最新18篇 Oral 论文

专知会员服务

26+阅读 · 2021年3月7日

【MIT】硬负样本的对比学习

【MIT】硬负样本的对比学习

专知会员服务

40+阅读 · 2020年10月14日

【MIT】反偏差对比学习，Debiased Contrastive Learning

【MIT】反偏差对比学习，Debiased Contrastive Learning

专知会员服务

91+阅读 · 2020年7月4日

【斯坦福】从电子病历EHR构建知识图谱，Robustly Extracting Medical Knowledge from EHRs:A Case Study of Learning a Health Knowledge Graph

【斯坦福】从电子病历EHR构建知识图谱，Robustly Extracting Medical Knowledge from EHRs:A Case Study of Learning a Health Knowledge Graph

专知会员服务

56+阅读 · 2020年6月2日

【Manning新书】现代Java实战，592页pdf

【Manning新书】现代Java实战，592页pdf

专知会员服务

101+阅读 · 2020年5月22日

【2020关键词提取】医学报告的关键词提取和结构化，Keyword extraction and structuralization of medical reports

【2020关键词提取】医学报告的关键词提取和结构化，Keyword extraction and structuralization of medical reports

专知会员服务

33+阅读 · 2020年5月2日

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

专知会员服务

37+阅读 · 2020年3月27日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【ACL2020放榜!】事件抽取、关系抽取、NER、Few-Shot 相关论文整理

【ACL2020放榜!】事件抽取、关系抽取、NER、Few-Shot 相关论文整理

深度学习自然语言处理

18+阅读 · 2020年5月22日

【论文】Awesome Relation Extraction Paper（关系抽取）（PART III）

【论文】Awesome Relation Extraction Paper（关系抽取）（PART III）

AINLP

25+阅读 · 2019年8月21日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

论文浅尝 | Learning with Noise: Supervised Relation Extraction

论文浅尝 | Learning with Noise: Supervised Relation Extraction

开放知识图谱

3+阅读 · 2018年1月4日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

ED2: An Environment Dynamics Decomposition Framework for World Model Construction

Arxiv

0+阅读 · 2021年12月6日

Uncertainty-Aware Reliable Text Classification

Arxiv

8+阅读 · 2021年7月15日

Kleister: Key Information Extraction Datasets Involving Long Documents with Complex Layouts

Kleister: Key Information Extraction Datasets Involving Long Documents with Complex Layouts

Arxiv

3+阅读 · 2021年5月12日

Dynamic Neuro-Symbolic Knowledge Graph Construction for Zero-shot Commonsense Question Answering

Dynamic Neuro-Symbolic Knowledge Graph Construction for Zero-shot Commonsense Question Answering

Arxiv

6+阅读 · 2020年10月30日

Open Knowledge Enrichment for Long-tail Entities

Arxiv

6+阅读 · 2020年2月15日

Attention Guided Graph Convolutional Networks for Relation Extraction

Arxiv

4+阅读 · 2019年10月11日

Long-tail Relation Extraction via Knowledge Graph Embeddings and Graph Convolution Networks

Long-tail Relation Extraction via Knowledge Graph Embeddings and Graph Convolution Networks

Arxiv

8+阅读 · 2019年3月4日

QA4IE: A Question Answering based Framework for Information Extraction

Arxiv

4+阅读 · 2019年1月28日

Joint entity recognition and relation extraction as a multi-head selection problem

Arxiv

3+阅读 · 2018年12月17日

Event Extraction with Generative Adversarial Imitation Learning

Arxiv

13+阅读 · 2018年4月21日

VIP会员

文章信息

相关主题

相关VIP内容

一文概览 CVPR2021 最新18篇 Oral 论文

专知会员服务

26+阅读 · 2021年3月7日

【MIT】硬负样本的对比学习

【MIT】硬负样本的对比学习

专知会员服务

40+阅读 · 2020年10月14日

【MIT】反偏差对比学习，Debiased Contrastive Learning

【MIT】反偏差对比学习，Debiased Contrastive Learning

专知会员服务

91+阅读 · 2020年7月4日

【斯坦福】从电子病历EHR构建知识图谱，Robustly Extracting Medical Knowledge from EHRs:A Case Study of Learning a Health Knowledge Graph

【斯坦福】从电子病历EHR构建知识图谱，Robustly Extracting Medical Knowledge from EHRs:A Case Study of Learning a Health Knowledge Graph

专知会员服务

56+阅读 · 2020年6月2日

【Manning新书】现代Java实战，592页pdf

【Manning新书】现代Java实战，592页pdf

专知会员服务

101+阅读 · 2020年5月22日

【2020关键词提取】医学报告的关键词提取和结构化，Keyword extraction and structuralization of medical reports

【2020关键词提取】医学报告的关键词提取和结构化，Keyword extraction and structuralization of medical reports

专知会员服务

33+阅读 · 2020年5月2日

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

专知会员服务

37+阅读 · 2020年3月27日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《城市滨海地区：理解复杂多变环境下的指挥控制框架》50页报告

《理解城市战及其在俄乌战争中的表现》报告

美空军“顶点2025”实验：推进AI在C2、动态目标锁定与联盟集成中的应用

《建设式兵棋模拟作为战术集群配置优化的关键组成部分》

相关资讯

【ACL2020放榜!】事件抽取、关系抽取、NER、Few-Shot 相关论文整理

【ACL2020放榜!】事件抽取、关系抽取、NER、Few-Shot 相关论文整理

深度学习自然语言处理

18+阅读 · 2020年5月22日

【论文】Awesome Relation Extraction Paper（关系抽取）（PART III）

【论文】Awesome Relation Extraction Paper（关系抽取）（PART III）

AINLP

25+阅读 · 2019年8月21日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

论文浅尝 | Learning with Noise: Supervised Relation Extraction

论文浅尝 | Learning with Noise: Supervised Relation Extraction

开放知识图谱

3+阅读 · 2018年1月4日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

ED2: An Environment Dynamics Decomposition Framework for World Model Construction

Arxiv

0+阅读 · 2021年12月6日

Uncertainty-Aware Reliable Text Classification

Arxiv

8+阅读 · 2021年7月15日

Kleister: Key Information Extraction Datasets Involving Long Documents with Complex Layouts

Kleister: Key Information Extraction Datasets Involving Long Documents with Complex Layouts

Arxiv

3+阅读 · 2021年5月12日

Dynamic Neuro-Symbolic Knowledge Graph Construction for Zero-shot Commonsense Question Answering

Dynamic Neuro-Symbolic Knowledge Graph Construction for Zero-shot Commonsense Question Answering

Arxiv

6+阅读 · 2020年10月30日

Open Knowledge Enrichment for Long-tail Entities

Arxiv

6+阅读 · 2020年2月15日

Attention Guided Graph Convolutional Networks for Relation Extraction

Arxiv

4+阅读 · 2019年10月11日

Long-tail Relation Extraction via Knowledge Graph Embeddings and Graph Convolution Networks

Long-tail Relation Extraction via Knowledge Graph Embeddings and Graph Convolution Networks

Arxiv

8+阅读 · 2019年3月4日

QA4IE: A Question Answering based Framework for Information Extraction

Arxiv

4+阅读 · 2019年1月28日

Joint entity recognition and relation extraction as a multi-head selection problem

Arxiv

3+阅读 · 2018年12月17日

Event Extraction with Generative Adversarial Imitation Learning

Arxiv

13+阅读 · 2018年4月21日

微信扫码咨询专知VIP会员