Zero-shot relation triplet extraction (ZeroRTE) aims to extract relation triplets from unstructured texts, while the relation sets at the training and testing stages are disjoint. Previous state-of-the-art method handles this challenging task by leveraging pretrained language models to generate data as additional training samples, which increases the training cost and severely constrains the model performance. We tackle this task from a new perspective and propose a novel method named PCRED for ZeroRTE with Potential Candidate Relation selection and Entity boundary Detection. The model adopts a relation-first paradigm, which firstly recognizes unseen relations through candidate relation selection. By this approach, the semantics of relations are naturally infused in the context. Entities are extracted based on the context and the semantics of relations subsequently. We evaluate our model on two ZeroRTE datasets. The experiment result shows that our method consistently outperforms previous works. Besides, our model does not rely on any additional data, which boasts the advantages of simplicity and effectiveness. Our code is available at https://anonymous.4open.science/r/PCRED.
翻译:零点关系三重提取(ZeroRTE)旨在从无结构文本中提取三重关系,而培训和测试阶段的关系组则脱节。以前的先进方法利用预先培训的语言模型作为额外的培训样本来生成数据,从而增加培训成本,严重制约模型的性能,从而应对这一任务,我们从新的角度处理这项任务,并提议一种名为 " PCRED " 的新型方法,与潜在候选者进行选择和实体边界探测。模型采用了一种关系第一模式,首先通过选择候选人关系来识别隐蔽的关系。通过这种方法,关系的语义自然地在背景中被使用。实体根据背景和随后的关系语义进行提取。我们用两个零点语言数据集来评估我们的模型。实验结果显示,我们的方法一贯地超越了以前的工程。此外,我们的模型并不依赖任何额外的数据,这说明简单和有效性的好处。我们的代码可以在 https://anonimous.4open.s/r/PCRED。