An integral part of seamless human-human communication is engagement, the process by which two or more participants establish, maintain, and end their perceived connection. Therefore, to develop successful human-centered human-machine interaction applications, automatic engagement inference is one of the tasks required to achieve engaging interactions between humans and machines, and to make machines attuned to their users, hence enhancing user satisfaction and technology acceptance. Several factors contribute to engagement state inference, which include the interaction context and interactants' behaviours and identity. Indeed, engagement is a multi-faceted and multi-modal construct that requires high accuracy in the analysis and interpretation of contextual, verbal and non-verbal cues. Thus, the development of an automated and intelligent system that accomplishes this task has been proven to be challenging so far. This paper presents a comprehensive survey on previous work in engagement inference for human-machine interaction, entailing interdisciplinary definition, engagement components and factors, publicly available datasets, ground truth assessment, and most commonly used features and methods, serving as a guide for the development of future human-machine interaction interfaces with reliable context-aware engagement inference capability. An in-depth review across embodied and disembodied interaction modes, and an emphasis on the interaction context of which engagement perception modules are integrated sets apart the presented survey from existing surveys.
翻译:人与人之间无缝交流的一个组成部分是参与,这是两个或两个以上参与者建立、保持和结束所认为的联系的过程,因此,开发成功的以人为中心的人与人-机器互动应用,自动参与推断是实现人与机器之间互动,使机器适应用户,从而提高用户满意度和技术接受度所必须完成的任务之一。若干因素促成了参与状态推断,其中包括互动背景和互动者的行为和身份。事实上,参与是一个多层面和多模式的构思,需要在分析和解释背景、语言和非语言提示方面高度精确。因此,开发一个能够完成这项任务的自动化和智能系统已证明具有如此巨大的挑战性。本文件对以前参与人类-机器互动推断的工作进行了全面调查,其中涉及跨学科定义、参与组成部分和因素、公开提供的数据集、地面真相评估以及最常用的特点和方法。参与是一个多方面的多模式,它需要用来指导未来人-机器互动互动互动互动互动互动互动的开发,而可靠的背景、语言和非语言感触觉参与能力则需要高度精确地分析和解释。从现有互动模式和互动调查中强调现有互动方式。