Is a manipulator on a legged robot a liability or an asset for locomotion? Prior works mainly designed specific controllers to account for the added payload and inertia from a manipulator. In contrast, biological systems typically benefit from additional limbs, which can simplify postural control. For instance, cats use their tails to enhance the stability of their bodies and prevent falls under disturbances. In this work, we show that a manipulator can be an important asset for maintaining balance during locomotion. To do so, we train a sensorimotor policy using deep reinforcement learning to create a synergy between the robot's limbs. This policy enables the robot to maintain stability despite large disturbances. However, learning such a controller can be quite challenging. To account for these challenges, we propose a stage-wise training procedure to learn complex behaviors. Our proposed method decomposes this complex task into three stages and then incrementally learns these tasks to arrive at a single policy capable of solving the final control task, achieving a success rate up to 2.35 times higher than baselines in simulation. We deploy our learned policy in the real world and show stability during locomotion under strong disturbances.
翻译:在机器人的运动中,机械手臂是负担还是资产?以前的研究主要设计特定的控制器来考虑来自机械手臂的额外负载和惯性。相比之下,生物系统通常受益于额外的肢体,这可以简化姿态控制。例如,猫使用它们的尾巴来增强身体的稳定性并防止在干扰下跌倒。在这项工作中,我们展示了在运动过程中机械手臂可以是保持平衡的重要资产。为此,我们使用深度强化学习训练感觉运动策略来创建机器人四肢之间的协同作用。该策略使机器人在大干扰下保持稳定。然而,学习这样的控制器可以是相当具有挑战性的。为了解决这些挑战,我们提出了一个分阶段训练程序来学习复杂的行为。我们提出的方法将这个复杂的任务分解成三个阶段,然后逐步学习这些任务,从而实现一个能够解决最终控制任务的单一策略,在仿真中实现了高达基线的2.35倍成功率。我们将学习到的策略部署在现实世界中,显示在强烈干扰下的运动稳定性。