Knowledge-grounded dialogue is a task of generating a fluent and informative response based on both conversation context and a collection of external knowledge, in which knowledge selection plays an important role and attracts more and more research interest. However, most existing models either select only one knowledge or use all knowledge for responses generation. The former may lose valuable information in discarded knowledge, while the latter may bring a lot of noise. At the same time, many approaches need to train the knowledge selector with knowledge labels that indicate ground-truth knowledge, but these labels are difficult to obtain and require a large number of manual annotations. Motivated by these issues, we propose Knoformer, a dialogue response generation model based on reinforcement learning, which can automatically select one or more related knowledge from the knowledge pool and does not need knowledge labels during training. Knoformer is evaluated on two knowledge-guided conversation datasets, and achieves state-of-the-art performance.
翻译:以知识为基础的对话是一项基于对话背景和外部知识集成的基础上产生流畅和内容丰富的反应的任务,在这种反应中,知识选择起着重要作用,吸引了越来越多的研究兴趣,然而,大多数现有模式要么只选择一种知识,要么利用所有知识来生成应对能力,前者可能失去在被抛弃的知识中的宝贵信息,而后者则可能造成大量噪音。与此同时,许多方法都需要用显示地面真实性知识的知识标签来培训知识选择者,但这些标签很难获得,需要大量手工说明。我们建议Knoeron,这是建立在强化学习基础上的对话响应生成模型,可以自动从知识库中选择一种或多种相关知识,在培训期间不需要知识标签。Knoeor用两种知识引导的对话数据集进行评估,并实现最先进的性能。