Design of dialogue systems has witnessed many advances lately, yet acquiring huge set of data remains an hindrance to their fast development for a new task or language. Besides, training interactive systems with batch data is not satisfactory. On-line learning is pursued in this paper as a convenient way to alleviate these difficulties. After the system modules are initiated, a single process handles data collection, annotation and use in training algorithms. A new challenge is to control the cost of the on-line learning borne by the user. Our work focuses on learning the semantic parsing and dialogue management modules (speech recognition and synthesis offer ready-for-use solutions). In this context we investigate several variants of simultaneous learning which are tested in user trials. In our experiments, with varying merits, they can all achieve good performance with only a few hundreds of training dialogues and overstep a handcrafted system. The analysis of these experiments gives us some insights, discussed in the paper, into the difficulty for the system's trainers to establish a coherent and constant behavioural strategy to enable a fast and good-quality training phase.
翻译:对话系统的设计最近取得了许多进展,但获得大量数据仍然是妨碍其迅速发展新任务或新语言的障碍。此外,用批量数据培训互动系统并不令人满意。本文以在线学习作为缓解这些困难的方便方法。在系统模块启动后,单一程序处理数据收集、说明和培训算法中的使用。一个新的挑战是控制用户承担的在线学习费用。我们的工作重点是学习语义分解和对话管理模块(语音识别和合成提供随时可用的解决方案)。在这方面,我们调查在用户试验中测试的同时学习的几种变式。在我们的实验中,它们都只能通过几百次培训对话取得良好的业绩,并超越一个手写系统。对这些实验的分析使我们从文件中讨论到系统培训员难以制定一致和持续的行为战略,以便能够有一个快速和高质量的培训阶段。