In this paper we consider a general task of jumping varying distances and heights for a quadrupedal robot in noisy environments, such as off of uneven terrain and with variable robot dynamics parameters. To accurately jump in such conditions, we propose a framework using deep reinforcement learning to leverage the complex solution of nonlinear trajectory optimization for quadrupedal jumping. While the standalone optimization limits jumping to take-off from flat ground and requires accurate assumption of robot dynamics, our proposed approach improves the robustness to allow jumping off of significantly uneven terrain with variable robot dynamical parameters. Through our method, the quadruped is able to jump distances of up to 1 m and heights of up to 0.4 m, while being robust to environment noise of foot disturbances of up to 0.1 m in height as well as with 5% variability of its body mass and inertia. This behavior is learned through just a few thousand simulated jumps in PyBullet, and we perform a sim-to-sim transfer to Gazebo. Video results can be found at https://youtu.be/jkzvL2o3g-s.
翻译:在本文中,我们考虑在噪音环境中跳出不同距离和高度的四重机器人的一般任务,例如从不均匀的地形和可变的机器人动态参数中跳出四重机器人。为了精确地跳出这样的条件。为了准确跳出,我们提议了一个框架,利用深度强化学习来利用非线性轨道优化的复杂解决方案来进行四重跳动。虽然独立优化限制从平地跳出以从平地跳出,并需要精确地假设机器人动态,但我们提议的方法提高了强度,允许以可变机器人动态参数从极不均匀的地形跳出。通过我们的方法,四重可以跳到1米和0.4米的距离,同时对高达0.1米的脚动扰动噪音及其身体质量和惯性5%的变异性保持稳健。这一行为仅通过在PyBullet的几千个模拟跳跃来学习,我们还进行了向Gazebo的Simto-sim转移。视频结果可见https://youtu.be/jkvL2g-s。