This work developed a kernel-based residual learning framework for quadrupedal robotic locomotion. Initially, a kernel neural network is trained with data collected from an MPC controller. Alongside a frozen kernel network, a residual controller network is trained via reinforcement learning to acquire generalized locomotion skills and resilience against external perturbations. With this proposed framework, a robust quadrupedal locomotion controller is learned with high sample efficiency and controllability, providing omnidirectional locomotion at continuous velocities. Its versatility and robustness are validated on unseen terrains that the expert MPC controller fails to traverse. Furthermore, the learned kernel can produce a range of functional locomotion behaviors and can generalize to unseen gaits.
翻译:这项工作为四重机器人移动开发了以内核为基础的剩余学习框架,最初,内核神经网络用从MPC控制器收集的数据进行了培训。除了冷冻内核网络外,残余控制网络通过强化学习接受培训,以获得普遍的移动技能和抵御外部扰动的复原力。利用这一拟议框架,以高采样效率和可控性学习了强大的四重移动控制控制器,以连续速度提供全线移动。其多功能性和稳健性在专家MPC控制器无法穿越的无形地形上得到了验证。此外,学习的内核可以产生一系列功能移动行为,并可以概括为看不见的格子。