Recent literature focuses on utilizing the entity information in the sentence-level relation extraction (RE), but this risks leaking superficial and spurious clues of relations. As a result, RE still suffers from unintended entity bias, i.e., the spurious correlation between entity mentions (names) and relations. Entity bias can mislead the RE models to extract the relations that do not exist in the text. To combat this issue, some previous work masks the entity mentions to prevent the RE models from overfitting entity mentions. However, this strategy degrades the RE performance because it loses the semantic information of entities. In this paper, we propose the CORE (Counterfactual Analysis based Relation Extraction) debiasing method that guides the RE models to focus on the main effects of textual context without losing the entity information. We first construct a causal graph for RE, which models the dependencies between variables in RE models. Then, we propose to conduct counterfactual analysis on our causal graph to distill and mitigate the entity bias, that captures the causal effects of specific entity mentions in each instance. Note that our CORE method is model-agnostic to debias existing RE systems during inference without changing their training processes. Extensive experimental results demonstrate that our CORE yields significant gains on both effectiveness and generalization for RE. The source code is provided at: https://github.com/vanoracai/CoRE.
翻译:最近的文献侧重于在句级关系提取(RE)中利用实体信息,但这有可能泄露表面和虚假的关系线索。因此,RE仍然受到意外实体偏差的影响,即实体提及(名称)和关系之间的虚假关联。实体偏差会误导RE模型,以解析案文中不存在的关系。为解决这一问题,该实体先前的一些工作掩盖了该实体提到防止RE模型过分装配实体提及的问题。然而,这一战略降低了RE的绩效,因为它失去了实体的语义信息。在本文件中,我们提议CORE(基于RelationExpresson的对事实的分析)消除偏见,以指导RE模式侧重于文本环境的主要影响,而同时又不丧失实体信息。我们首先为RE建立一个因果图,以模型显示RE模型和变异模型中的变量之间的依赖性关系。然后,我们提议对我们的因果关系图进行反事实分析,以淡化和减轻实体的偏差,以捕捉具体实体在每例中提及的因果效应。我们指出,在CORE方法中,我们的REA分析方法是显著的模型/变现结果。