Stepping from sentence-level to document-level, the research on relation extraction (RE) confronts increasing text length and more complicated entity interactions. Consequently, it is more challenging to encode the key information sources--relevant contexts and entity types. However, existing methods only implicitly learn to model these critical information sources while being trained for RE. As a result, they suffer the problems of ineffective supervision and uninterpretable model predictions. In contrast, we propose to explicitly teach the model to capture relevant contexts and entity types by supervising and augmenting intermediate steps (SAIS) for RE. Based on a broad spectrum of carefully designed tasks, our proposed SAIS method not only extracts relations of better quality due to more effective supervision, but also retrieves the corresponding supporting evidence more accurately so as to enhance interpretability. By assessing model uncertainty, SAIS further boosts the performance via evidence-based data augmentation and ensemble inference while reducing the computational cost. Eventually, SAIS delivers state-of-the-art RE results on three benchmarks (DocRED, CDR, and GDA) and outperforms the runner-up by 5.04% relatively in F1 score in evidence retrieval on DocRED.
翻译:从判决一级到文件一级,关于关系提取(RE)的研究面临越来越多的文字长度和更为复杂的实体互动,因此,对关键信息来源相关背景和实体类型进行编码比较困难,然而,现有方法只是含蓄地在接受RE培训时学习模型这些关键信息来源。结果,它们遭遇了无效监督和无法解释的模型预测等问题。相比之下,我们提议明确教授模型,通过监督和增强RE的中间步骤(SAIS)来捕捉相关背景和实体类型。根据经过仔细设计的广泛任务,我们拟议的SAIS方法不仅提取出由于更有效的监督而具有更好质量的关系,而且还检索相应的辅助证据,以便提高可解释性。通过评估模型不确定性,SAIS进一步通过基于证据的数据增强和共性推断提高绩效,同时降低计算成本。最后,SAIS在三个基准(DocRED、CD和GDA)上提供最新的REA结果,并在相对的F1分级证据中比RED的RD在5-04%前优于RED1分级证据检索。