Task-oriented dialogue systems have been plagued by the difficulties of obtaining large-scale and high-quality annotated conversations. Furthermore, most of the publicly available datasets only include written conversations, which are insufficient to reflect actual human behaviors in practical spoken dialogue systems. In this paper, we propose Task-oriented Dialogue Data Augmentation (TOD-DA), a novel model-agnostic data augmentation paradigm to boost the robustness of task-oriented dialogue modeling on spoken conversations. The TOD-DA consists of two modules: 1) Dialogue Enrichment to expand training data on task-oriented conversations for easing data sparsity and 2) Spoken Conversation Simulator to imitate oral style expressions and speech recognition errors in diverse granularities for bridging the gap between written and spoken conversations. With such designs, our approach ranked first in both tasks of DSTC10 Track2, a benchmark for task-oriented dialogue modeling on spoken conversations, demonstrating the superiority and effectiveness of our proposed TOD-DA.
翻译:以任务为导向的对话系统因难以获得大规模和高质量的附加说明的对话而受到困扰,此外,大多数公开的数据集仅包括书面对话,不足以在实际的口头对话系统中反映实际的人类行为;在本文件中,我们提议采用面向任务的对话数据增强模式(TOD-DA),这是一个新型的模范-不可知性数据增强模式,目的是增强以任务为导向的对话模式在口述对话上的稳健性。TOD-DA由两个模块组成:(1) 强化对话,以扩大关于以任务为导向的对话的培训数据,以缓解数据散居状态;(2) 口述调同声模拟器,以模拟不同颗粒体的口述式表达和语音识别错误,以缩小书面和口述对话之间的差距。有了这种设计,我们的方法在DSTC10轨道2的两项任务中名列第一,这是对口述对话进行任务性对话建模的基准,显示了我们提议的TOD-DD的优越性和有效性。