GLIDE: 具有 " 中心模型 " 的多样化环境中可普遍实现的四步移动 (GLiDE: Generalizable Quadrupedal Locomotion in Diverse Environments with a Centroidal Model)

Model-free reinforcement learning (RL) for legged locomotion commonly relies on a physics simulator that can accurately predict the behaviors of every degree of freedom of the robot. In contrast, approximate reduced-order models are often sufficient for many model-based control strategies. In this work we explore how RL can be effectively used with a centroidal model to generate robust control policies for quadrupedal locomotion. Advantages over RL with a full-order model include a simple reward structure, reduced computational costs, and robust sim-to-real transfer. We further show the potential of the method by demonstrating stepping-stone locomotion, two-legged in-place balance, balance beam locomotion, and sim-to-real transfer without further adaptations. Additional Results: https://www.pair.toronto.edu/glide-quadruped/.

翻译：用于腿部助行器的无模型强化学习通常依赖于物理学模拟器,该模拟器能够准确预测机器人每种程度的自由行为。相比之下,近似减序模型往往足以用于许多基于模型的控制战略。在这项工作中,我们探索如何使用环球模型有效地使用减序模型来为四重体动生成稳健的控制政策。具有全序模型的RL的优势包括一个简单的奖赏结构、降低计算成本和强大的模拟到真实传输。我们进一步展示了该方法的潜力,展示了踏脚石 Locoove、两条腿的内位平衡、平衡波束动和不进一步调整的模拟到真实的转移。其他结果:https://www.pair.toronto.edu/glide-quarpup/。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

【RLChina2020公开课】Lecture-11.pdf【多智能体学习与游戏AI前沿】

专知会员服务

27+阅读 · 2020年8月6日

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日