We need intelligent robots for mobile construction, the process of navigating in an environment and modifying its structure according to a geometric design. In this task, a major robot vision and learning challenge is how to exactly achieve the design without GPS, due to the difficulty caused by the bi-directional coupling of accurate robot localization and navigation together with strategic environment manipulation. However, many existing robot vision and learning tasks such as visual navigation and robot manipulation address only one of these two coupled aspects. To stimulate the pursuit of a generic and adaptive solution, we reasonably simplify mobile construction as a partially observable Markov decision process (POMDP) in 1/2/3D grid worlds and benchmark the performance of a handcrafted policy with basic localization and planning, and state-of-the-art deep reinforcement learning (RL) methods. Our extensive experiments show that the coupling makes this problem very challenging for those methods, and emphasize the need for novel task-specific solutions.
翻译:我们需要智能机器人进行移动建设,这是在环境中航行和根据几何设计修改其结构的过程。在这项任务中,一个主要的机器人愿景和学习挑战是如何在没有全球定位系统的情况下准确地实现设计,因为精确机器人定位和导航与战略环境操纵的双向结合造成了困难。然而,许多现有的机器人愿景和学习任务,如视觉导航和机器人操纵,只涉及这两个结合的方面之一。为了刺激寻求一种通用的适应性解决方案,我们合理地简化了移动建设,将其作为一个部分可见的马尔科夫决策程序(POMDP ), 在 1/2/3D 电网世界中,我们以基本的本地化和规划以及最先进的强化学习(RL)方法来衡量手工艺化政策的业绩。我们的广泛实验显示,这种结合使这一问题对这些方法非常具有挑战性,并强调需要新的特定任务解决方案。