Many NLP models gain performance by having access to a knowledge base. A lot of research has been devoted to devising and improving the way the knowledge base is accessed and incorporated into the model, resulting in a number of mechanisms and pipelines. Despite the diversity of proposed mechanisms, there are patterns in the designs of such systems. In this paper, we systematically describe the typology of artefacts (items retrieved from a knowledge base), retrieval mechanisms and the way these artefacts are fused into the model. This further allows us to uncover combinations of design decisions that had not yet been tried. Most of the focus is given to language models, though we also show how question answering, fact-checking and knowledgable dialogue models fit into this system as well. Having an abstract model which can describe the architecture of specific models also helps with transferring these architectures between multiple NLP tasks.
翻译:许多NLP模式通过进入知识库获得业绩,许多研究都致力于设计和改进知识库的获取和融入模式的方式,从而形成若干机制和管道。尽管拟议的机制多种多样,但这类系统的设计模式也各不相同。在本文件中,我们系统地描述手工艺品的类型(从知识库检索的项目)、检索机制以及这些手工艺品与模型的结合方式。这还使我们能够发现尚未尝试过的设计决定的组合。大多数重点都放在语言模型上,尽管我们也展示了如何回答问题、核实事实和了解的对话模式如何适合这一系统。有了能够描述具体模型结构的抽象模型,也有助于将这些结构在多种NLP任务之间转移。