Recent advances in neural approaches greatly improve task-oriented dialogue (TOD) systems which assist users to accomplish their goals. However, such systems rely on costly manually labeled dialogs which are not available in practical scenarios. In this paper, we present our models for Track 2 of the SereTOD 2022 challenge, which is the first challenge of building semi-supervised and reinforced TOD systems on a large-scale real-world Chinese TOD dataset MobileCS. We build a knowledge-grounded dialog model to formulate dialog history and local KB as input and predict the system response. And we perform semi-supervised pre-training both on the labeled and unlabeled data. Our system achieves the first place both in the automatic evaluation and human interaction, especially with higher BLEU (+7.64) and Success (+13.6\%) than the second place.
翻译:神经方法的近期进步极大地改善了有助于用户实现其目标的任务导向对话系统。然而,这些系统依赖于成本昂贵的人工标签式对话,在实际情况下无法提供。在本文件中,我们介绍了2022年SereTOD挑战第2轨模式,这是在大规模现实世界中国TOD数据集移动CS上建立半监督和强化的半监督和强化的TOD系统的第一个挑战。我们建立了一个知识基础对话模式,以开发对话历史和本地KB作为输入并预测系统响应。我们还就标签式和无标签式数据进行了半监督的预培训。我们的系统在自动评估和人类互动方面都取得了第一位,特别是比第二位高的BLEU(+7.64)和成功(+13.6)。