Zero-shot relation triplet extraction (ZeroRTE) aims to extract relation triplets from unstructured texts under the zero-shot setting, where the relation sets at the training and testing stages are disjoint. Previous state-of-the-art method handles this challenging task by leveraging pretrained language models to generate data as additional training samples, which increases the training cost and severely constrains the model performance. To address the above issues, we propose a novel method named PCRED for ZeroRTE with Potential Candidate Relation Selection and Entity Boundary Detection. The remarkable characteristic of PCRED is that it does not rely on additional data and still achieves promising performance. The model adopts a relation-first paradigm, recognizing unseen relations through candidate relation selection. With this approach, the semantics of relations are naturally infused in the context. Entities are extracted based on the context and the semantics of relations subsequently. We evaluate our model on two ZeroRTE datasets. The experiment results show that our method consistently outperforms previous works. Our code will be available at https://anonymous.4open.science/r/PCRED.
翻译:零点关系三重提取(ZeroRTE)旨在从零点设置下未结构化的文本中提取三重关系,在零点设置下,培训和测试阶段的整套关系是脱节的。以前最先进的方法利用预先培训的语言模型作为额外的培训样本来生成数据,这增加了培训成本,严重限制了模型的性能。为了解决上述问题,我们提议了一个名为PCRED的新颖方法,用于与潜在候选人进行零点选择和实体边界探测。PCRED的显著特征是,它不依赖额外数据,仍然能够取得有希望的业绩。该模型采用了一种关系第一模式,通过选择候选人关系来认识未见的关系。采用这一方法,关系的语义自然在环境中被使用。实体根据背景和随后的关系语义进行提取。我们用两个ZeroRTE数据集来评估我们的模型。实验结果显示,我们的方法始终超越了以前的工作。我们的代码将在 https://anonimous.4open.science/r/PCRED) 。