It is important for socially assistive robots to be able to recognize when a user needs and wants help. Such robots need to be able to recognize human needs in a real-time manner so that they can provide timely assistance. We propose an architecture that uses social cues to determine when a robot should provide assistance. Based on a multimodal fusion approach upon eye gaze and language modalities, our architecture is trained and evaluated on data collected in a robot-assisted Lego building task. By focusing on social cues, our architecture has minimal dependencies on the specifics of a given task, enabling it to be applied in many different contexts. Enabling a social robot to recognize a user's needs through social cues can help it to adapt to user behaviors and preferences, which in turn will lead to improved user experiences.
翻译:社会辅助机器人在用户需要和需要帮助时必须能够识别。 此类机器人需要能够实时地识别人类需求,以便他们能够及时提供援助。 我们建议一个使用社会提示来决定机器人何时提供援助的架构。 基于对眼神和语言模式的多式混合方法,我们的架构在机器人协助的乐高建筑任务中收集的数据上接受培训和评价。 通过关注社会提示,我们的架构对特定任务的具体细节依赖性极小,使其能够在许多不同的背景下应用。 使社会机器人能够通过社会提示来识别用户的需求,可以帮助它适应用户的行为和偏好,这反过来会改善用户的经验。