This paper explores how the Distantly Supervised Relation Extraction (DS-RE) can benefit from the use of a Universal Graph (UG), the combination of a Knowledge Graph (KG) and a large-scale text collection. A straightforward extension of a current state-of-the-art neural model for DS-RE with a UG may lead to degradation in performance. We first report that this degradation is associated with the difficulty in learning a UG and then propose two training strategies: (1) Path Type Adaptive Pretraining, which sequentially trains the model with different types of UG paths so as to prevent the reliance on a single type of UG path; and (2) Complexity Ranking Guided Attention mechanism, which restricts the attention span according to the complexity of a UG path so as to force the model to extract features not only from simple UG paths but also from complex ones. Experimental results on both biomedical and NYT10 datasets prove the robustness of our methods and achieve a new state-of-the-art result on the NYT10 dataset. The code and datasets used in this paper are available at https://github.com/baodaiqin/UGDSRE.
翻译:本文探讨远端监督关系提取(DS-RE)如何能从通用图(UG)的使用、知识图(KG)的组合和大规模文本收集中受益。DS-RE当前最新神经模型与UG的简单扩展可能会导致性能退化。我们首先报告,这种退化与学习UG的困难相关,然后提出两项培训战略:(1) 路径类型适应性预备训练,它依次以不同类型UG路径对模型进行培训,以防止对单一类型UG路径的依赖;(2) 复杂程度排列引导注意机制,它根据UG路径的复杂性限制关注范围,以便迫使模型不仅从简单的UG路径,而且从复杂的路径提取特征。生物医学和NYT10数据集的实验结果证明了我们方法的稳健性,并在NYT10数据集上实现了新的状态艺术结果。本文中使用的代码和数据设置在http://s/GDA/EGA/GA. 中提供。