We aim to overcome the lack of diversity in responses of current dialogue systems and to develop a dialogue system that is engaging as a conversational partner. We propose a generator-evaluator model that evaluates multiple responses generated by a response generator and selects the best response by an evaluator. By generating multiple responses, we obtain diverse responses. We conduct human evaluations to compare the output of the proposed system with that of a baseline system. The results of the human evaluations showed that the proposed system's responses were often judged to be better than the baseline system's, and indicated the effectiveness of the proposed method.
翻译:我们的目标是克服当前对话系统反应中缺乏多样性的问题,并发展一个作为对话伙伴参与的对话系统。我们提议一个发电机-评价模型,评价反应发电机产生的多种反应,并选择评价员的最佳反应。我们通过产生多种反应,获得不同的反应。我们进行人类评价,将拟议系统的产出与基线系统的产出进行比较。人类评价结果显示,人们往往认为拟议系统的反应优于基线系统,并表明拟议方法的有效性。