Dialog systems have achieved significant progress and have been widely used in various scenarios. The previous researches mainly focused on designing dialog generation models in a single scenario, while comprehensive abilities are required to handle tasks under various scenarios in the real world. In this paper, we propose a general Multi-Skill Dialog Framework, namely MSDF, which can be applied in different dialog tasks (e.g. knowledge grounded dialog and persona based dialog). Specifically, we propose a transferable response generator pre-trained on diverse large-scale dialog corpora as the backbone of MSDF, consisting of BERT-based encoders and a GPT-based decoder. To select the response consistent with dialog history, we propose a consistency selector trained through negative sampling. Moreover, the flexible copy mechanism of external knowledge is also employed to enhance the utilization of multiform knowledge in various scenarios. We conduct experiments on knowledge grounded dialog, recommendation dialog, and persona based dialog tasks. The experimental results indicate that our MSDF outperforms the baseline models with a large margin. In the Multi-skill Dialog of 2021 Language and Intelligence Challenge, our general MSDF won the 3rd prize, which proves our MSDF is effective and competitive.
翻译:先前的研究主要侧重于设计单一情景下的对话生成模型,同时需要全面的能力来处理现实世界中各种情景下的任务。在本文中,我们提议一个通用的多技能对话框架,即MSDF, 可用于不同对话任务(例如知识型对话和人型对话)。具体地说,我们提议一个可转让反应生成器,在各种大型对话中预先培训成为MSDF的骨干,由基于BERT的编码器和基于GPT的解密器组成。为了选择符合对话历史的响应,我们提议一个通过负面抽样培训的一致选择器。此外,还采用灵活的外部知识复制机制,加强多种情景中多形式知识的利用。我们根据知识进行基于对话、建议对话和人型对话任务的实验。实验结果显示,我们的MSDF以巨大的空间超越了基线模型。在2021语言和智能挑战的多技能性分析中,我们通用的MSDFF赢得了第3个奖项,这证明我们的MSDF是有效的和竞争性的。