跨语言关系抽取的提示学习 (Prompt-Learning for Cross-Lingual Relation Extraction)

Relation Extraction (RE) is a crucial task in Information Extraction, which entails predicting relationships between entities within a given sentence. However, extending pre-trained RE models to other languages is challenging, particularly in real-world scenarios where Cross-Lingual Relation Extraction (XRE) is required. Despite recent advancements in Prompt-Learning, which involves transferring knowledge from Multilingual Pre-trained Language Models (PLMs) to diverse downstream tasks, there is limited research on the effective use of multilingual PLMs with prompts to improve XRE. In this paper, we present a novel XRE algorithm based on Prompt-Tuning, referred to as Prompt-XRE. To evaluate its effectiveness, we design and implement several prompt templates, including hard, soft, and hybrid prompts, and empirically test their performance on competitive multilingual PLMs, specifically mBART. Our extensive experiments, conducted on the low-resource ACE05 benchmark across multiple languages, demonstrate that our Prompt-XRE algorithm significantly outperforms both vanilla multilingual PLMs and other existing models, achieving state-of-the-art performance in XRE. To further show the generalization of our Prompt-XRE on larger data scales, we construct and release a new XRE dataset- WMT17-EnZh XRE, containing 0.9M English-Chinese pairs extracted from WMT 2017 parallel corpus. Experiments on WMT17-EnZh XRE also show the effectiveness of our Prompt-XRE against other competitive baselines. The code and newly constructed dataset are freely available at \url{https://github.com/HSU-CHIA-MING/Prompt-XRE}.

翻译：关系抽取（RE）是信息提取中的重要任务，涉及预测给定句子中实体之间的关系。然而，将预训练的RE模型扩展到其他语言尤其在需要跨语言关系抽取（XRE）的实际情况下具有挑战性。尽管最近Prompt-Learning取得了一定的成就，Prompt-Learning涉及从多语言预训练语言模型（PLMs）向各种下游任务传递知识，但是在有效使用提示的多语言PLMs方面应用于改善XRE的研究有限。在本文中，我们提出了一种基于Prompt-Tuning的XRE算法，称为Prompt-XRE。为了评估其有效性，我们设计并实现了几个提示模板，包括硬提示，软提示和混合提示，并在竞争性的多语言PLMs mBART上进行实证测试。我们广泛的实验在多种语言上在低资源ACE05基准测试上进行，结果显示，我们的Prompt-XRE算法在XRE方面显著优于原始的多语言PLMs和其他现有模型，实现了XRE的最新性能。为了进一步证明我们的Prompt-XRE对更大数据规模的泛化能力，我们构建和发布了一个新的XRE数据集-WMT17-EnZh XRE，其中包含从WMT 2017平行语料库中提取的0.9M英汉对。在WMT17-EnZh XRE的实验中，我们的Prompt-XRE相比于其他竞争基线表现更为出色。代码和新构建的数据集可在 \url{https://github.com/HSU-CHIA-MING/Prompt-XRE}上免费获取。