Prior work has demonstrated that data augmentation is useful for improving dialogue state tracking. However, there are many types of user utterances, while the prior method only considered the simplest one for augmentation, raising the concern about poor generalization capability. In order to better cover diverse dialogue acts and control the generation quality, this paper proposes controllable user dialogue act augmentation (CUDA-DST) to augment user utterances with diverse behaviors. With the augmented data, different state trackers gain improvement and show better robustness, achieving the state-of-the-art performance on MultiWOZ 2.1
翻译:先前的工作表明,数据扩增有助于改进对话状态的跟踪跟踪,但有多种类型的用户话语,而先前的方法仅考虑最简单的扩增方法,引起人们对普遍化能力差的关注。为了更好地涵盖多种对话行为和控制发电质量,本文件提议可控用户对话法案扩增(CUDA-DST),以增加不同行为的用户话语。随着数据扩充,不同的州跟踪器得到改进,并表现出更强的稳健性,实现了多WOZ 2. 1 的最新业绩。