Traditional end-to-end task-oriented dialog systems first convert dialog context into dialog state and action state, before generating the system response. In this paper, we first empirically investigate the relationship between dialog/action state and generated system response. The empirical exploration shows that the system response performance is significantly affected by the quality of dialog state and action state. Based on these findings, we argue that enhancing the relationship modeling between dialog context and dialog/action state is beneficial to improving the quality of the dialog state and action state, which further improves the generated response quality. Therefore, we propose Mars, an end-to-end task-oriented dialog system with semantic-aware contrastive learning strategies to model the relationship between dialog context and dialog/action state. Empirical results show our proposed Mars achieves state-of-the-art performance on the MultiWOZ 2.0, CamRest676, and CrossWOZ.
翻译:传统的端对端任务导向对话系统首先将对话环境转换为对话状态和行动状态,然后才产生系统响应。 在本文中,我们首先从经验角度调查对话/行动状态和生成系统响应之间的关系。 经验探索显示,系统响应性能受到对话状态和行动状态质量的重大影响。 根据这些调查结果,我们认为,加强对话背景与对话/行动状态之间的建模关系有利于提高对话状态和行动状态的质量,从而进一步提高生成的响应质量。 因此,我们建议火星是一个面向终端任务的对话体系,配有语义认知的对比学习战略,以模拟对话背景与对话/行动状态之间的关系。 经验研究结果显示,我们提议的火星在多WOZ2.0、CamRest676和交叉WOZ上取得了最先进的表现。