Document-level relation extraction (DocRE) aims at extracting the semantic relations among entity pairs in a document. In DocRE, a subset of the sentences in a document, called the evidence sentences, might be sufficient for predicting the relation between a specific entity pair. To make better use of the evidence sentences, in this paper, we propose a three-stage evidence-enhanced DocRE framework consisting of joint relation and evidence extraction, evidence-centered relation extraction (RE), and fusion of extraction results. We first jointly train an RE model with a simple and memory-efficient evidence extraction model. Then, we construct pseudo documents based on the extracted evidence sentences and run the RE model again. Finally, we fuse the extraction results of the first two stages using a blending layer and make a final prediction. Extensive experiments show that our proposed framework achieves state-of-the-art performance on the DocRED dataset, outperforming the second-best method by 0.76/0.82 Ign F1/F1. In particular, our method significantly improves the performance on inter-sentence relations by 1.23 Inter F1.
翻译:文件级关系提取(DocRE)的目的是在一份文件中提取实体对对对之间的语义关系。在DocRE中,文件中的一个称为证据句子的子集,可能足以预测特定实体对对的对应关系。为了更好地利用证据判决,我们在本文件中提议了一个由联合关系和证据提取、证据-以证据为中心的关系提取(RE)和提取结果融合组成的加强证据的DocRE三阶段框架。我们首先用一个简单和记忆高效的证据提取模型联合培训RE模型。然后,我们根据提取的证据句子建立假文件,并再次运行RE模型。最后,我们利用混合层整合前两个阶段的提取结果,并作出最后预测。广泛的实验表明,我们提议的框架在DocRED数据集上取得了最新的最新业绩,比第二最佳方法高0.76/0.82 Ign F1/F1。特别是,我们的方法大大改进了1.23个Inter F1的跨点关系绩效。