We study the learning of a matching model for dialogue response selection. Motivated by the recent finding that models trained with random negative samples are not ideal in real-world scenarios, we propose a hierarchical curriculum learning framework that trains the matching model in an "easy-to-difficult" scheme. Our learning framework consists of two complementary curricula: (1) corpus-level curriculum (CC); and (2) instance-level curriculum (IC). In CC, the model gradually increases its ability in finding the matching clues between the dialogue context and a response candidate. As for IC, it progressively strengthens the model's ability in identifying the mismatching information between the dialogue context and a response candidate. Empirical studies on three benchmark datasets with three state-of-the-art matching models demonstrate that the proposed learning framework significantly improves the model performance across various evaluation metrics.
翻译:我们研究了对话响应选择匹配模式的学习。最近发现,在现实世界情景中,随机负面抽样培训模式并不理想,我们为此提出一个等级课程学习框架,在“容易困难”计划下对匹配模式进行培训。我们的学习框架由两个互补课程组成:(1) 实体级课程(CC);(2) 实例级课程(IC ) 。在CC 中,该模式逐渐提高了其在寻找对话背景与响应候选之间匹配线索的能力。在IC 方面,该模式逐步加强了该模式在确定对话背景与响应候选之间不匹配信息的能力。关于三个基准数据集和三个最先进的匹配模型的经验性研究显示,拟议的学习框架极大地改善了各种评估指标的模型性能。