人类机器人计划足迹上的双脚步行学习 (Learning Bipedal Walking On Planned Footsteps For Humanoid Robots) - 专知论文

会员服务 ·

0

Learning · Legged Robot · 稳健性 · 回合 · 机器人 ·

2022 年 7 月 26 日

Learning Bipedal Walking On Planned Footsteps For Humanoid Robots

翻译：人类机器人计划足迹上的双脚步行学习

Rohan Pratap Singh,Mehdi Benallegue,Mitsuharu Morisawa,Rafael Cisneros,Fumio Kanehiro

from arxiv, Under review. GitHub code: https://github.com/rohanpsingh/LearningHumanoidWalking

Deep reinforcement learning (RL) based controllers for legged robots have demonstrated impressive robustness for walking in different environments for several robot platforms. To enable the application of RL policies for humanoid robots in real-world settings, it is crucial to build a system that can achieve robust walking in any direction, on 2D and 3D terrains, and be controllable by a user-command. In this paper, we tackle this problem by learning a policy to follow a given step sequence. The policy is trained with the help of a set of procedurally generated step sequences (also called footstep plans). We show that simply feeding the upcoming 2 steps to the policy is sufficient to achieve omnidirectional walking, turning in place, standing, and climbing stairs. Our method employs curriculum learning on the complexity of terrains, and circumvents the need for reference motions or pre-trained weights. We demonstrate the application of our proposed method to learn RL policies for 2 new robot platforms - HRP5P and JVRC-1 - in the MuJoCo simulation environment. The code for training and evaluation is available online.

翻译：以深度加固控制器为基础对腿形机器人进行深度加固学习(RL), 显示在多个机器人平台的不同环境中行走的强度令人印象深刻。为了能够在现实世界环境中对人形机器人应用RL政策, 关键是要在2D和3D地形上建立一个能够在任何方向上实现稳健行走的系统, 并可由用户指令控制。在本文中, 我们通过学习一项遵循特定步骤序列的政策来解决这个问题。该政策在一系列程序产生的步骤序列( 也称为脚步计划)的帮助下得到了培训。我们显示, 仅仅将即将到来的2个步骤注入该政策就足以实现全天线行走、转动、站立和爬楼梯。我们的方法是在地形复杂度上学习课程, 并绕过对参考动作或预先训练重量的需要。我们展示了在 MuJoco 模拟环境中为2个新的机器人平台- HRP5P 和 JVRC-1- 学习RL政策的拟议方法的应用。培训和评价代码可以在线查阅。

0

相关内容

Learning

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

miR-21/PDCD4/NF-κB通路在血小板抗菌促糖尿病溃疡愈合中的作用及分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

基于网络效用最大化理论增强无线Mesh网VoIP传输性能的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

帕金森氏病转基因斑马鱼模型建立

国家自然科学基金

0+阅读 · 2011年12月31日

2n配子对马蹄莲种质创新和PGI障碍克服的研究

国家自然科学基金

0+阅读 · 2009年12月31日

以细胞色素P450 26为靶点的新型维甲酸代谢阻断剂的研究

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

SREBP-1c在2型糖尿病骨骼肌胰岛素抵抗及脂毒性中的作用及机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

新的CMN致病基因的定位与克隆

国家自然科学基金

0+阅读 · 2008年12月31日

Active Predicting Coding: Brain-Inspired Reinforcement Learning for Sparse Reward Robotic Control Problems

Arxiv

0+阅读 · 2022年9月19日

Enforcing the consensus between Trajectory Optimization and Policy Learning for precise robot control

Arxiv

0+阅读 · 2022年9月19日

Dynamic Walking of Bipedal Robots on Uneven Stepping Stones via Adaptive-frequency MPC

Arxiv

0+阅读 · 2022年9月18日

Multi-contact MPC for Dynamic Loco-manipulation on Humanoid Robots

Arxiv

0+阅读 · 2022年9月18日

Data-driven Step-to-step Dynamics based Adaptive Control for Robust and Versatile Underactuated Bipedal Robotic Walking

Arxiv

0+阅读 · 2022年9月18日

Sub-optimal Policy Aided Multi-Agent Reinforcement Learning for Flocking Control

Arxiv

0+阅读 · 2022年9月17日

SoLo T-DIRL: Socially-Aware Dynamic Local Planner based on Trajectory-Ranked Deep Inverse Reinforcement Learning

SoLo T-DIRL: Socially-Aware Dynamic Local Planner based on Trajectory-Ranked Deep Inverse Reinforcement Learning

Arxiv

0+阅读 · 2022年9月16日

Personalized Rehabilitation Robotics based on Online Learning Control

Arxiv

0+阅读 · 2022年9月15日

Bipedal Robot Walking Control Using Human Whole-Body Dynamic Telelocomotion

Arxiv

0+阅读 · 2022年9月14日

An Exploration of Hands-free Text Selection for Virtual Reality Head-Mounted Displays

Arxiv

1+阅读 · 2022年9月14日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

Active Predicting Coding: Brain-Inspired Reinforcement Learning for Sparse Reward Robotic Control Problems

Arxiv

0+阅读 · 2022年9月19日

Enforcing the consensus between Trajectory Optimization and Policy Learning for precise robot control

Arxiv

0+阅读 · 2022年9月19日

Dynamic Walking of Bipedal Robots on Uneven Stepping Stones via Adaptive-frequency MPC

Arxiv

0+阅读 · 2022年9月18日

Multi-contact MPC for Dynamic Loco-manipulation on Humanoid Robots

Arxiv

0+阅读 · 2022年9月18日

Data-driven Step-to-step Dynamics based Adaptive Control for Robust and Versatile Underactuated Bipedal Robotic Walking

Arxiv

0+阅读 · 2022年9月18日

Sub-optimal Policy Aided Multi-Agent Reinforcement Learning for Flocking Control

Arxiv

0+阅读 · 2022年9月17日

SoLo T-DIRL: Socially-Aware Dynamic Local Planner based on Trajectory-Ranked Deep Inverse Reinforcement Learning

SoLo T-DIRL: Socially-Aware Dynamic Local Planner based on Trajectory-Ranked Deep Inverse Reinforcement Learning

Arxiv

0+阅读 · 2022年9月16日

Personalized Rehabilitation Robotics based on Online Learning Control

Arxiv

0+阅读 · 2022年9月15日

Bipedal Robot Walking Control Using Human Whole-Body Dynamic Telelocomotion

Arxiv

0+阅读 · 2022年9月14日

An Exploration of Hands-free Text Selection for Virtual Reality Head-Mounted Displays

Arxiv

1+阅读 · 2022年9月14日

相关基金

miR-21/PDCD4/NF-κB通路在血小板抗菌促糖尿病溃疡愈合中的作用及分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

基于网络效用最大化理论增强无线Mesh网VoIP传输性能的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

帕金森氏病转基因斑马鱼模型建立

国家自然科学基金

0+阅读 · 2011年12月31日

2n配子对马蹄莲种质创新和PGI障碍克服的研究

国家自然科学基金

0+阅读 · 2009年12月31日

以细胞色素P450 26为靶点的新型维甲酸代谢阻断剂的研究

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

SREBP-1c在2型糖尿病骨骼肌胰岛素抵抗及脂毒性中的作用及机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

新的CMN致病基因的定位与克隆

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员