Existing dialog system models require extensive human annotations and are difficult to generalize to different tasks. The recent success of large pre-trained language models such as BERT and GPT-2 (Devlin et al., 2019; Radford et al., 2019) have suggested the effectiveness of incorporating language priors in down-stream NLP tasks. However, how much pre-trained language models can help dialog response generation is still under exploration. In this paper, we propose a simple, general, and effective framework: Alternating Roles Dialog Model (ARDM). ARDM models each speaker separately and takes advantage of the large pre-trained language model. It requires no supervision from human annotations such as belief states or dialog acts to achieve effective conversations. ARDM outperforms or is on par with state-of-the-art methods on two popular task-oriented dialog datasets: CamRest676 and MultiWOZ. Moreover, we can generalize ARDM to more challenging, non-collaborative tasks such as persuasion. In persuasion tasks, ARDM is capable of generating human-like responses to persuade people to donate to a charity.
翻译:现有对话系统模型需要广泛的人文说明,难以概括到不同的任务。如BERT和GPT-2(Devlin等人,2019年;Radford等人,2019年)等大型预先培训语言模型最近的成功表明,将语言前科纳入下游NLP任务中是有效的。然而,目前仍在探索有多少经过培训的语言模型有助于对话回应生成。在本文件中,我们提出了一个简单、一般和有效的框架:角色对口模式(ARDM)。ARDM模型每个演讲人分别使用大型预先培训的语言模型。它不需要人文说明的监督,例如信仰状态或对话行为,以实现有效的对话。ARDM在两个面向任务的流行对话数据集(CamRest676和MultiWOZ)上,超越或接近最先进的方法。此外,我们可以将ARDM用于更具挑战性、非协作性的任务,如说服。ARDM的任务是能够产生像人类一样的反应,说服人们捐赠慈善。