The Emotion-Cause Pair Extraction (ECPE) task aims to extract emotions and causes as pairs from documents. We observe that the relative distance distribution of emotions and causes is extremely imbalanced in the typical ECPE dataset. Existing methods have set a fixed size window to capture relations between neighboring clauses. However, they neglect the effective semantic connections between distant clauses, leading to poor generalization ability towards position-insensitive data. To alleviate the problem, we propose a novel Multi-Granularity Semantic Aware Graph model (MGSAG) to incorporate fine-grained and coarse-grained semantic features jointly, without regard to distance limitation. In particular, we first explore semantic dependencies between clauses and keywords extracted from the document that convey fine-grained semantic features, obtaining keywords enhanced clause representations. Besides, a clause graph is also established to model coarse-grained semantic relations between clauses. Experimental results indicate that MGSAG surpasses the existing state-of-the-art ECPE models. Especially, MGSAG outperforms other models significantly in the condition of position-insensitive data.
翻译:情感-原因派尔提取(ECPE)任务旨在从文档中提取情感和原因。我们观察到,情感和原因的相对距离分布在典型的ECPE数据集中极不平衡。现有的方法设置了一个固定大小窗口,以捕捉相邻条款之间的关系。然而,它们忽视了遥远条款之间有效的语义联系,导致对位置不敏感数据的概括性能力差。为了缓解这一问题,我们提议了一个新型的多毛性语义认知图形模型(MGSAG),以在不考虑距离限制的情况下,将精细和粗粗粗的语义特征联合纳入其中。特别是,我们首先探索了从传达微细的语义特征的文档中提取的条款和关键词之间的语义依赖性,获得了关键词强化条款表述。此外,还建立了一条条款图,以模拟粗微的语系关系条款。实验结果表明,MSAGAG超过了现有先进的ECPE模型。特别是,MSAGGG在数据敏感位置上大大超越了其他模型。