探测人与阿凡达对话中干扰对话者:试点研究 (Detecting Interlocutor Confusion in Situated Human-Avatar Dialogue: A Pilot Study)

from arxiv, 8 figures, 10pages including 2pages reference. Conference: https://semdial2021.ling.uni-potsdam.de/, Paper link:https://semdial2021.ling.uni-potsdam.de/assets/semdial2021_potsdial_full_proceedings.pdf

In order to enhance levels of engagement with conversational systems, our long term research goal seeks to monitor the confusion state of a user and adapt dialogue policies in response to such user confusion states. To this end, in this paper, we present our initial research centred on a user-avatar dialogue scenario that we have developed to study the manifestation of confusion and in the long term its mitigation. We present a new definition of confusion that is particularly tailored to the requirements of intelligent conversational system development for task-oriented dialogue. We also present the details of our Wizard-of-Oz based data collection scenario wherein users interacted with a conversational avatar and were presented with stimuli that were in some cases designed to invoke a confused state in the user. Post study analysis of this data is also presented. Here, three pre-trained deep learning models were deployed to estimate base emotion, head pose and eye gaze. Despite a small pilot study group, our analysis demonstrates a significant relationship between these indicators and confusion states. We understand this as a useful step forward in the automated analysis of the pragmatics of dialogue.

翻译：为提高对话系统的参与程度,我们的长期研究目标力求监测用户的混乱状态,并针对用户的混乱状况调整对话政策。为此,我们在本文件中介绍我们最初的研究,其重点是我们为研究混乱的表现和长期缓解混乱而开发的用户-avatar对话设想方案。我们提出了一个新的混乱定义,特别针对为面向任务的对话而开发智能对话系统的要求。我们还介绍了我们的基于“奥兹魔法”的数据收集设想方案的细节,即用户与对话的阿凡达互动,并被介绍给Stimuli,这在某些情况下是为了在用户中援引混乱状态。还介绍了对这些数据的邮政研究分析。在这里,我们采用了三个经过事先训练的深层次学习模型来估计基本情感、头部和眼神。尽管有一个小型的试点研究小组,但我们的分析表明这些指标与混乱状态之间的重要关系。我们理解,这是对对话的实用性进行自动分析的一个有益步骤。