Attention (and distraction) recognition is a key factor in improving human-robot collaboration. We present an assembly scenario where a human operator and a cobot collaborate equally to piece together a gearbox. The setup provides multiple opportunities for the cobot to adapt its behavior depending on the operator's attention, which can improve the collaboration experience and reduce psychological strain. As a first step, we recognize the areas in the workspace that the human operator is paying attention to, and consequently, detect when the operator is distracted. We propose a novel deep-learning approach to develop an attention recognition model. First, we train a convolutional neural network to estimate the gaze direction using a publicly available image dataset. Then, we use transfer learning with a small dataset to map the gaze direction onto pre-defined areas of interest. Models trained using this approach performed very well in leave-one-subject-out evaluation on the small dataset. We performed an additional validation of our models using the video snippets collected from participants working as an operator in the presented assembly scenario. Although the recall for the Distracted class was lower in this case, the models performed well in recognizing the areas the operator paid attention to. To the best of our knowledge, this is the first work that validated an attention recognition model using data from a setting that mimics industrial human-robot collaboration. Our findings highlight the need for validation of attention recognition solutions in such full-fledged, non-guided scenarios.
翻译:注意力(和分心)识别是改善人机协作的关键因素。我们提出了一个组装场景,在这个场景中,人类操作员和协作机器人平等地拼装齿轮箱。此设置提供了多个机会,让协作机器人根据操作员的注意力调整其行为,从而提高协作体验并减少心理负担。作为第一步,我们识别人类操作员关注的工作区域,并因此检测操作员是否分心。我们提出了一种新颖的深度学习方法来开发注意力识别模型。首先,我们使用公开可用的图像数据集来训练卷积神经网络来估计注视方向。然后,我们使用小的数据集进行转移学习,将注视方向映射到预先定义的关注区域。使用这种方法训练的模型在小数据集的分组交叉验证中表现良好。我们使用从参与者在所提出的组装场景下作为操作员工作时收集的视频片段进行了额外的模型验证。虽然在这种情况下“分心”类别的召回率较低,但模型在识别操作员关注的区域方面表现良好。据我们所知,这是首次使用模拟工业人机协作的数据验证注意力识别模型的研究。我们的研究结果突出了在这种全面的、非指导性场景中验证注意力识别解决方案的必要性。