Jumping is essential for legged robots to traverse through difficult terrains. In this work, we propose a hierarchical framework that combines optimal control and reinforcement learning to learn continuous jumping motions for quadrupedal robots. The core of our framework is a stance controller, which combines a manually designed acceleration controller with a learned residual policy. As the acceleration controller warm starts policy for efficient training, the trained policy overcomes the limitation of the acceleration controller and improves the jumping stability. In addition, a low-level whole-body controller converts the body pose command from the stance controller to motor commands. After training in simulation, our framework can be deployed directly to the real robot, and perform versatile, continuous jumping motions, including omni-directional jumps at up to 50cm high, 60cm forward, and jump-turning at up to 90 degrees. Please visit our website for more results: https://sites.google.com/view/learning-to-jump.
翻译:跳跃对于四足机器人穿越困难的地形至关重要。在本研究中,我们提出了一种层次框架,将最优控制和强化学习相结合,为四足机器人学习连续跳跃动作。我们框架的核心是姿态控制器,将手动设计的加速控制器与学习的残差策略相结合。加速控制器通过热启动策略进行有效的训练,训练后的策略克服了加速控制器的局限性,提高了跳跃稳定性。此外,低级全身控制器将姿势控制器的身体姿态命令转换为电机命令。在模拟中训练后,我们的框架可以直接部署到真实机器人上,执行多功能连续跳跃动作,包括高达50厘米,60厘米向前的全向跳跃和高达90度的跳跃转向。请访问我们的网站以获取更多结果: https://sites.google.com/view/learning-to-jump 。