This paper describes a personality-adaptive multimodal dialogue system developed for the Dialogue Robot Competition 2022. To realize a dialogue system that adapts the dialogue strategy to individual users, it is necessary to consider the user's nonverbal information and personality. In this competition, we built a prototype of a user-adaptive dialogue system that estimates user personality during dialogue. Pretrained DNN models are used to estimate user personalities annotated as Big Five scores. This model is embedded in a dialogue system to estimate user personality from face images during the dialogue. We proposed a method for dialogue management that changed the dialogue flow based on the estimated personality characteristics and confirmed that the system works in a real environment in the preliminary round of this competition. Furthermore, we implemented specific modules to enhance the multimodal dialogue experience of the user, including personality assessment, controlling facial expressions and movements of the android, and dialogue management to explain the attractiveness of sightseeing spots. The aim of dialogue based on personality assessment is to reduce the nervousness of users, and it acts as an ice breaker. The android's facial expressions and movements are necessary for a more natural android conversation. Since the task of this competition was to promote the appeal of sightseeing spots and to recommend an appropriate sightseeing spot, the dialogue process for how to explain the attractiveness of the spot is important. All results of the subjective evaluation by users were better than those of the baseline and other systems developed for this competition. The proposed dialogue system ranked first in both "Impression Rating" and "Effectiveness of Android Recommendations". According to the total evaluation in the competition, the proposed system was ranked first overall.
翻译:本文描述了为2022年“对话机器人竞赛”开发的个性适应型多式联运对话系统。为了实现一个使对话战略适应个人用户的对话系统,有必要考虑用户的非语言信息和个性。在这一竞争中,我们建立了一个用户适应型对话系统原型,在对话期间估计用户个性;使用预先培训的DNN模型来估计用户个性,标记为“五分大”。这一模型嵌入一个对话系统,从对话期间的面部图像中估计用户个性。我们建议了一种对话管理方法,根据估计的个性特征改变对话流,并确认该系统在本次竞争的初步回合中在真实的环境中运作。此外,我们实施了具体的模块,以加强用户的多模式对话经验,包括个性评估、控制面部位表和运动以及对话管理,以解释视觉景色点的吸引力。基于个性评估的目的是减少用户的紧张性,并起到打破头级的作用。我们提出的面部和机器人的面部和动作评估对于更自然和类比总体对话的总体对话更有必要。由于本次对话的任务,因此,“关于直位评估的准确性评估”的准确性评估是用来推至正确评估系统。关于直位的分级。关于正确性评估的分级。关于直位评估是,所有对话过程的目至分级。关于正确性评估过程的推至正确性评估过程的推至正确性评估的推至正确性评估过程的推至正确性评估的推至最重要评。关于正确性评估的推至正确性评估过程的推至正确性评估是,这是所有分。