Modern quadrupeds are skillful in traversing or even sprinting on uneven terrains in a remote uncontrolled environment. However, survival in the wild requires not only maneuverability, but also the ability to handle potential critical hardware failures. How to grant such ability to quadrupeds is rarely investigated. In this paper, we propose a novel methodology to train and test hardware fault-tolerant controllers for quadruped locomotion, both in the simulation and physical world. We adopt the teacher-student reinforcement learning framework to train the controller with close-to-reality joint-locking failure in the simulation, which can be zero-shot transferred to the physical robot without any fine-tuning. Extensive experiments show that our fault-tolerant controller can efficiently lead a quadruped stably when it faces joint failure during locomotion.
翻译:现代的四分法在偏远不受控制的环境中,在不均匀的地形上进行穿梭甚至冲刺的技巧。 然而,野生生存不仅需要机动性,还需要有能力处理潜在的关键硬件故障。 如何给予这种能力进行四分法的探索很少。 在本文中,我们提出一种新的方法来培训和测试硬件故障控制器,以便在模拟和物理世界中进行四分法的移动。 我们采用了师生强化学习框架,在模拟中训练控制器近乎现实的合锁故障,这种故障可以在不作任何微调的情况下被零弹转移到物理机器人。 广泛的实验表明,当我们的容错控制器在移动过程中面临联合故障时,它可以有效地导致四分法的三分法。</s>