Task-oriented dialogue systems in industry settings need to have high conversational capability, be easily adaptable to changing situations and conform to business constraints. This paper describes a 3-step procedure to develop a conversational model that satisfies these criteria and can efficiently scale to rank a large set of response candidates. First, we provide a simple algorithm to semi-automatically create a high-coverage template set from historic conversations without any annotation. Second, we propose a neural architecture that encodes the dialogue context and applicable business constraints as profile features for ranking the next turn. Third, we describe a two-stage learning strategy with self-supervised training, followed by supervised fine-tuning on limited data collected through a human-in-the-loop platform. Finally, we describe offline experiments and present results of deploying our model with human-in-the-loop to converse with live customers online.
翻译:在行业环境中,以任务为导向的对话系统需要高度的谈话能力,容易适应不断变化的形势,并符合商业限制。本文描述了一个三步程序,以开发一个满足这些标准并能有效评分大批应答候选人的谈话模式。首先,我们为半自动地创建由历史对话组成的高覆盖模板提供了简单的算法。第二,我们提出了一个神经结构,将对话背景和适用的商业限制编码为下一轮排名的特征。第三,我们描述了一个带有自我监督培训的两阶段学习战略,随后对通过流动人员平台收集的有限数据进行监管的微调。最后,我们描述了离线实验,并介绍了利用人与现场客户进行在线互动的模型部署结果。