We consider the problem of navigating a mobile robot towards a target in an unknown environment that is endowed with visual sensors, where neither the robot nor the sensors have access to global positioning information and only use first-person-view images. While prior work in sensor network based navigation uses explicit mapping and planning techniques, and are often aided by external positioning systems, we propose a vision-only based learning approach that leverages a Graph Neural Network (GNN) to encode and communicate relevant viewpoint information to the mobile robot. During navigation, the robot is guided by a model that we train through imitation learning to approximate optimal motion primitives, thereby predicting the effective cost-to-go (to the target). In our experiments, we first demonstrate generalizability to previously unseen environments with various sensor layouts. Our results show that by using communication between the sensors and the robot, we achieve up to $2.0\times$ improvement in SPL (Success weighted by Path Length) when compared to the communication-free baseline. This is done without requiring a global map, positioning data, nor pre-calibration of the sensor network. Second, we perform a zero-shot transfer of our model from simulation to the real world. To this end, we train a `translator' model that translates between {latent encodings of} real and simulated images so that the navigation policy (which is trained entirely in simulation) can be used directly on the real robot, without additional fine-tuning. Physical experiments demonstrate the feasibility of our approach in various cluttered environments.
翻译:我们考虑了在一个拥有视觉传感器的未知环境中将移动机器人引向目标的问题,在这种环境中,机器人和传感器都无法获取全球定位系统信息,只能使用第一人看图像。虽然以前在传感器网络上进行的导航工作使用了明确的绘图和规划技术,而且常常得到外部定位系统的帮助,但我们提议了一种基于愿景的学习方法,利用图形神经网络(GNN)将相关观点信息编码和传送到移动机器人。在导航过程中,机器人以一个模型为指导,我们通过模拟学习来训练如何接近最佳运动原始,从而预测有效的成本到目标。在我们的实验中,我们首先以各种传感器布局向先前看不见的环境展示了通用性。我们的结果显示,通过使用传感器和机器人之间的通信,我们实现了在与无通信基线相比,SPL(由路径长度加权加权的超高)的改进,达到2.0美元。这不需要全球地图、定位数据或传感器网络的预校准前校准,从而预测有效的成本(到目标目标 ) 。在我们的实验中,我们首先展示了对真实的模拟环境进行一个从真实的模拟到我们所使用的模拟的模拟。