In open-domain conversational systems, it is important but challenging to leverage background knowledge. We can use the incorporation of knowledge to make the generation of dialogue controllable, and can generate more diverse sentences that contain real knowledge. In this paper, we combine the knowledge bases and pre-training model to propose a knowledge-driven conversation system. The system includes modules such as dialogue topic prediction, knowledge matching and dialogue generation. Based on this system, we study the performance factors that maybe affect the generation of knowledge-driven dialogue: topic coarse recall algorithm, number of knowledge choices, generation model choices, etc., and finally made the system reach state-of-the-art. These experimental results will provide some guiding significance for the future research of this task. As far as we know, this is the first work to study and analyze the effects of the related factors.
翻译:在开放式对话系统中,利用背景知识很重要,但具有挑战性。我们可以使用知识集成,使对话的生成具有可控性,并能够产生包含真正知识的更多样化的句子。在本文中,我们将知识基础和培训前模式结合起来,以提出知识驱动的对话系统。该系统包括对话主题预测、知识匹配和对话生成等模块。基于这个系统,我们研究可能影响知识驱动对话生成的性能因素:主题粗糙回溯算法、知识选择数量、生成模式选择等,并最终使系统达到最新水平。这些实验结果将为今后研究这项任务提供一些指导意义。据我们所知,这是研究和分析相关因素影响的首项工作。