Sequence generation demonstrates promising performance in recent information extraction efforts, by incorporating large-scale pre-trained Seq2Seq models. This paper investigates the merits of employing sequence generation in relation extraction, finding that with relation names or synonyms as generation targets, their textual semantics and the correlation (in terms of word sequence pattern) among them affect model performance. We then propose Relation Extraction with Label Augmentation (RELA), a Seq2Seq model with automatic label augmentation for RE. By saying label augmentation, we mean prod semantically synonyms for each relation name as the generation target. Besides, we present an in-depth analysis of the Seq2Seq model's behavior when dealing with RE. Experimental results show that RELA achieves competitive results compared with previous methods on four RE datasets.
翻译:序列生成表明,在最近的信息提取工作中,通过纳入大规模预先培训的Seq2Seqeq模型,取得了有希望的成绩。本文调查了在提取时采用序列生成的优点,发现在将名称或同义词作为生成目标时,它们之间的文字语义和相关性(在字序列模式方面)会影响模型性能。然后我们提议与Label增殖(REA)关系提取(REA)关系(RELA)关系,这是Seq2Seqeq 模型,自动增加RE的标签。我们说,标签扩增是指每个关系名称的语义同义,作为生成目标。此外,我们对Seq2Seqeq处理RE时的行为进行了深入分析。实验结果显示,REA在四个RE数据集上取得了与以往方法相比的竞争结果。