Many underwater tasks, such as cable-and-wreckage inspection, search-and-rescue, benefit from robust human-robot interaction (HRI) capabilities. With the recent advancements in vision-based underwater HRI methods, autonomous underwater vehicles (AUVs) can communicate with their human partners even during a mission. However, these interactions usually require active participation especially from humans (e.g., one must keep looking at the robot during an interaction). Therefore, an AUV must know when to start interacting with a human partner, i.e., if the human is paying attention to the AUV or not. In this paper, we present a diver attention estimation framework for AUVs to autonomously detect the attentiveness of a diver and then navigate and reorient itself, if required, with respect to the diver to initiate an interaction. The core element of the framework is a deep neural network (called DATT-Net) which exploits the geometric relation among 10 facial keypoints of the divers to determine their head orientation. Our on-the-bench experimental evaluations (using unseen data) demonstrate that the proposed DATT-Net architecture can determine the attentiveness of human divers with promising accuracy. Our real-world experiments also confirm the efficacy of DATT-Net which enables real-time inference and allows the AUV to position itself for an AUV-diver interaction.
翻译:许多水下任务,如电缆和故障检查、搜索和救援等,都受益于强大的人类机器人互动能力。随着基于视觉的水下HRI方法最近的进展,自主水下飞行器(AUVs)即使在任务期间也可以与人类伙伴进行交流;然而,这些互动通常需要特别是人类的积极参与(例如,在互动期间必须不断查看机器人),因此,AUV必须知道何时开始与人类伙伴进行互动,即,如果人类是否在关注AUV。在本文中,我们为AVs提供了一个潜水关注度估计框架,以自主地检测潜水员的注意度,然后在必要时对潜水员进行导航和调整。这个框架的核心要素是一个深层的神经网络(在互动中必须不断查看机器人的10个面部关键点之间的几何关系,以确定其方向。我们关于潜水的实验位置(使用不可见的数据)表明,AUVTS-Net网络的拟议精确度也能够使AUATS-ATS-Net的真实性能确定真实性。