With the development of pre-trained language models, many prompt-based approaches to data-efficient knowledge graph construction have been proposed and achieved impressive performance. However, existing prompt-based learning methods for knowledge graph construction are still susceptible to several potential limitations: (i) semantic gap between natural language and output structured knowledge with pre-defined schema, which means model cannot fully exploit semantic knowledge with the constrained templates; (ii) representation learning with locally individual instances limits the performance given the insufficient features, which are unable to unleash the potential analogical capability of pre-trained language models. Motivated by these observations, we propose a retrieval-augmented approach, which retrieves schema-aware Reference As Prompt (RAP), for data-efficient knowledge graph construction. It can dynamically leverage schema and knowledge inherited from human-annotated and weak-supervised data as a prompt for each sample, which is model-agnostic and can be plugged into widespread existing approaches. Experimental results demonstrate that previous methods integrated with RAP can achieve impressive performance gains in low-resource settings on five datasets of relational triple extraction and event extraction for knowledge graph construction. Code is available in https://github.com/zjunlp/RAP.
翻译:随着预训练语言模型的发展,许多基于提示的数据效率知识图谱构建方法已经被提出,并取得了令人印象深刻的性能。然而,现有的基于提示的知识图谱构建学习方法仍然容易受到几个潜在限制的影响:(i)自然语言和预定义模式结构化知识之间的语义鸿沟,这意味着模型不能充分利用受限制的模板的语义知识;(ii)基于局部单独实例的表示学习限制了性能,因为特征不足,无法释放预训练语言模型的潜在类比能力。在这些观察的基础上,我们提出了一种检索增强的方法,其检索基于schema的参考提示(RAP),用于数据效率的知识图谱构建。它可以动态地利用人类注释和弱监督数据继承的schema和知识,作为每个样本的提示,它是模型无关的,可以插入广泛的现有方法中。实验结果表明,在双重三元组提取和事件提取的五个数据集中,先前的方法集成RAP可以在低资源设置中实现令人印象深刻的性能提升。代码可以在https://github.com/zjunlp/RAP中找到。