We study learning of a matching model for response selection in retrieval-based dialogue systems. The problem is equally important with designing the architecture of a model, but is less explored in existing literature. To learn a robust matching model from noisy training data, we propose a general co-teaching framework with three specific teaching strategies that cover both teaching with loss functions and teaching with data curriculum. Under the framework, we simultaneously learn two matching models with independent training sets. In each iteration, one model transfers the knowledge learned from its training set to the other model, and at the same time receives the guide from the other model on how to overcome noise in training. Through being both a teacher and a student, the two models learn from each other and get improved together. Evaluation results on two public data sets indicate that the proposed learning approach can generally and significantly improve the performance of existing matching models.
翻译:在检索对话系统中,我们学习一个匹配的响应选择模式。这个问题在设计一个模式的结构方面同样重要,但在现有的文献中却较少探讨。为了从吵闹的培训数据中学习一个强健的匹配模式,我们提议了一个通用的共同教学框架,其中有三个具体的教学战略,既包括教学与损失功能,也包括教学与数据课程。在这个框架内,我们同时学习两个匹配模式与独立的培训组合。在每一次循环中,一个模式将从培训中学到的知识传授给另一个模式,同时从另一个模式获得如何克服培训中的噪音的指南。通过教师和学生,两个模式相互学习,并一起改进。两个公共数据集的评价结果表明,拟议的学习方法可以普遍地大大改进现有匹配模式的绩效。