通过高效深强化学习避免复杂障碍 (Monocular Camera-based Complex Obstacle Avoidance via Efficient Deep Reinforcement Learning)

Deep reinforcement learning has achieved great success in laser-based collision avoidance works because the laser can sense accurate depth information without too much redundant data, which can maintain the robustness of the algorithm when it is migrated from the simulation environment to the real world. However, high-cost laser devices are not only difficult to deploy for a large scale of robots but also demonstrate unsatisfactory robustness towards the complex obstacles, including irregular obstacles, e.g., tables, chairs, and shelves, as well as complex ground and special materials. In this paper, we propose a novel monocular camera-based complex obstacle avoidance framework. Particularly, we innovatively transform the captured RGB images to pseudo-laser measurements for efficient deep reinforcement learning. Compared to the traditional laser measurement captured at a certain height that only contains one-dimensional distance information away from the neighboring obstacles, our proposed pseudo-laser measurement fuses the depth and semantic information of the captured RGB image, which makes our method effective for complex obstacles. We also design a feature extraction guidance module to weight the input pseudo-laser measurement, and the agent has more reasonable attention for the current state, which is conducive to improving the accuracy and efficiency of the obstacle avoidance policy.

翻译：深层强化学习在以激光为基础的避免碰撞工作中取得了巨大成功,因为激光能够感知准确的深度信息,而没有太多的多余数据,从而能够在从模拟环境向现实世界迁移时保持算法的稳健性,然而,高成本激光装置不仅难以用于大规模机器人,而且对复杂的障碍也表现出不尽如人意的稳健性,包括不规则的障碍,例如桌子、椅子和架子,以及复杂的地面材料和特殊材料。在本文件中,我们提出了一个新的单镜相机复杂障碍避免框架。特别是,我们创新地将所捕获的 RGB 图像转换为假激光测量,以便有效地进行深层强化学习。与在某一高度所捕捉的传统激光测量相比,该测量只包含远离近邻障碍的一维距离信息,我们提议的假激光测量将所捕捉到的RGB 图像的深度和语义性信息连接起来,从而使我们的方法对复杂障碍有效。我们还设计了一个特征提取指导模块,以加权输入的假激光测量,而且代理人对当前状态有更合理的关注,这有助于提高避免政策的效率。