Robotic locomotion is often approached with the goal of maximizing robustness and reactivity by increasing motion control frequency. We challenge this intuitive notion by demonstrating robust and dynamic locomotion with a learned motion controller executing at as low as 8 Hz on a real ANYmal C quadruped. The robot is able to robustly and repeatably achieve a high heading velocity of 1.5 m/s, traverse uneven terrain, and resist unexpected external perturbations. We further present a comparative analysis of deep reinforcement learning (RL) based motion control policies trained and executed at frequencies ranging from 5 Hz to 200 Hz. We show that low-frequency policies are less sensitive to actuation latencies and variations in system dynamics. This is to the extent that a successful sim-to-real transfer can be performed even without any dynamics randomization or actuation modeling. We support this claim through a set of rigorous empirical evaluations. Moreover, to assist reproducibility, we provide the training and deployment code along with an extended analysis at https://ori-drs.github.io/lfmc/.
翻译:机器人的动能移动往往以增加运动控制频率来达到最大限度的稳健性和回旋性为目标。我们通过展示强健和动态的动能动能来挑战这一直觉概念,一个有学识的动作控制器在真正Anymal C四分法上执行低至8赫兹的动作控制器。机器人能够稳健和反复地达到1.5米/秒的高航向速度,横跨不均的地形,并抵制意外的外部扰动。我们进一步对在5赫兹至200赫兹的频率上培训和执行的深强化学习(RL)运动控制政策进行了比较分析。我们表明,低频政策对系统动态的动作迟缓和变化不太敏感。这就是说,即使没有任何动态随机化或动作模型,也能够成功地进行模拟到真实的转移。我们通过一套严格的实证评估来支持这一主张。此外,为了帮助重新提高透明度,我们还在https://ori-drrs.github.io/lfmc/上提供培训和部署代码,同时提供扩展分析。