利用人类知识为人造机器人学习自然运动行为 (Learning natural locomotion behaviors for humanoid robots using human knowledge) - 专知论文

会员服务 ·

0

稳健性 · 学成 · Networking · 控制器 · 有偏 ·

2021 年 2 月 11 日

Learning natural locomotion behaviors for humanoid robots using human knowledge

翻译：利用人类知识为人造机器人学习自然运动行为

Chuanyu Yang,Kai Yuan,Shuai Heng,Taku Komura,Zhibin Li

from arxiv, university policy

This paper presents a new learning framework that leverages the knowledge from imitation learning, deep reinforcement learning, and control theories to achieve human-style locomotion that is natural, dynamic, and robust for humanoids. We proposed novel approaches to introduce human bias, i.e. motion capture data and a special Multi-Expert network structure. We used the Multi-Expert network structure to smoothly blend behavioral features, and used the augmented reward design for the task and imitation rewards. Our reward design is composable, tunable, and explainable by using fundamental concepts from conventional humanoid control. We rigorously validated and benchmarked the learning framework which consistently produced robust locomotion behaviors in various test scenarios. Further, we demonstrated the capability of learning robust and versatile policies in the presence of disturbances, such as terrain irregularities and external pushes.

翻译：本文介绍了一个新的学习框架,利用模仿学习、深强化学习和控制理论的知识,实现人类形态的自然、动态和强健的人类形态运动。我们提出了引入人类偏见的新办法,即运动捕获数据和特殊的多专家网络结构。我们利用多专家网络结构顺利地混合行为特征,并利用强化奖励设计来完成任务和模仿奖赏。我们的奖赏设计是可合成的、可捕捉的,并且可以通过使用传统人类形态控制的基本概念来解释。我们严格验证和确定了学习框架的基准,该框架在各种测试情景中始终产生强有力的移动行为。此外,我们还展示了在发生动乱时学习强有力和多功能政策的能力,例如地形异常和外部推力。

0

相关内容

稳健性

【斯坦福大学课程】2021年深度多任务学习与元学习，CS 330: Deep Multi-Task and Meta Learning

【斯坦福大学课程】2021年深度多任务学习与元学习，CS 330: Deep Multi-Task and Meta Learning

专知会员服务

110+阅读 · 2022年3月2日

【机器学习傻瓜式入门，443页pdf】Machine Learning For Dummies, 2nd Edition

【机器学习傻瓜式入门，443页pdf】Machine Learning For Dummies, 2nd Edition

专知会员服务

71+阅读 · 2021年1月26日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【爱丁堡大学】最新《元学习meta learning)》2020综述论文大全，23页pdf289篇参考文献

【爱丁堡大学】最新《元学习meta learning)》2020综述论文大全，23页pdf289篇参考文献

专知会员服务

225+阅读 · 2020年4月17日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【斯坦福大学】Gradient Surgery for Multi-Task Learning

【斯坦福大学】Gradient Surgery for Multi-Task Learning

专知会员服务

47+阅读 · 2020年1月23日

【强化学习资源集合】Awesome Reinforcement Learning

【强化学习资源集合】Awesome Reinforcement Learning

专知会员服务

97+阅读 · 2019年12月23日

【麻省理工学院课程】MIT 6.S094: Deep Learning for Self-Driving Cars，深度学习和自动驾驶课程

【麻省理工学院课程】MIT 6.S094: Deep Learning for Self-Driving Cars，深度学习和自动驾驶课程

专知会员服务

52+阅读 · 2019年11月1日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

spinningup.openai 强化学习资源完整

spinningup.openai 强化学习资源完整

CreateAMind

6+阅读 · 2018年12月17日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Approximate Robust NMPC using Reinforcement Learning

Arxiv

0+阅读 · 2021年4月6日

Towards Lifelong Learning of End-to-end ASR

Arxiv

0+阅读 · 2021年4月4日

Regularization Shortcomings for Continual Learning

Arxiv

1+阅读 · 2021年4月4日

Scaffolded Learning of In-place Trotting Gait for a Quadruped Robot with Bayesian Optimization

Arxiv

0+阅读 · 2021年4月3日

Learning to Filter: Siamese Relation Network for Robust Tracking

Arxiv

0+阅读 · 2021年4月2日

Learning to Walk via Deep Reinforcement Learning

Arxiv

7+阅读 · 2018年12月26日

Visual Reinforcement Learning with Imagined Goals

Arxiv

8+阅读 · 2018年7月12日

Unsupervised Meta-Learning for Reinforcement Learning

Arxiv

8+阅读 · 2018年6月12日

Do deep reinforcement learning agents model intentions?

Arxiv

5+阅读 · 2018年5月21日

Towards a Continuous Knowledge Learning Engine for Chatbots

Arxiv

6+阅读 · 2018年2月24日

VIP会员

文章信息

相关主题

相关VIP内容

【斯坦福大学课程】2021年深度多任务学习与元学习，CS 330: Deep Multi-Task and Meta Learning

【斯坦福大学课程】2021年深度多任务学习与元学习，CS 330: Deep Multi-Task and Meta Learning

专知会员服务

110+阅读 · 2022年3月2日

【机器学习傻瓜式入门，443页pdf】Machine Learning For Dummies, 2nd Edition

【机器学习傻瓜式入门，443页pdf】Machine Learning For Dummies, 2nd Edition

专知会员服务

71+阅读 · 2021年1月26日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【爱丁堡大学】最新《元学习meta learning)》2020综述论文大全，23页pdf289篇参考文献

【爱丁堡大学】最新《元学习meta learning)》2020综述论文大全，23页pdf289篇参考文献

专知会员服务

225+阅读 · 2020年4月17日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【斯坦福大学】Gradient Surgery for Multi-Task Learning

【斯坦福大学】Gradient Surgery for Multi-Task Learning

专知会员服务

47+阅读 · 2020年1月23日

【强化学习资源集合】Awesome Reinforcement Learning

【强化学习资源集合】Awesome Reinforcement Learning

专知会员服务

97+阅读 · 2019年12月23日

【麻省理工学院课程】MIT 6.S094: Deep Learning for Self-Driving Cars，深度学习和自动驾驶课程

【麻省理工学院课程】MIT 6.S094: Deep Learning for Self-Driving Cars，深度学习和自动驾驶课程

专知会员服务

52+阅读 · 2019年11月1日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICCV2025教程】基础模型遇见具身智能体

军事机器学习设计：关于开发自动化任务摘要系统的梯次化设计科学研究 | 2025最新93页

扩散模型中的缓存方法综述：迈向高效的多模态生成

【ICCV2025教程】《迈向视觉语言模型的全面推理》

相关资讯

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

spinningup.openai 强化学习资源完整

spinningup.openai 强化学习资源完整

CreateAMind

6+阅读 · 2018年12月17日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

相关论文

Approximate Robust NMPC using Reinforcement Learning

Arxiv

0+阅读 · 2021年4月6日

Towards Lifelong Learning of End-to-end ASR

Arxiv

0+阅读 · 2021年4月4日

Regularization Shortcomings for Continual Learning

Arxiv

1+阅读 · 2021年4月4日

Scaffolded Learning of In-place Trotting Gait for a Quadruped Robot with Bayesian Optimization

Arxiv

0+阅读 · 2021年4月3日

Learning to Filter: Siamese Relation Network for Robust Tracking

Arxiv

0+阅读 · 2021年4月2日

Learning to Walk via Deep Reinforcement Learning

Arxiv

7+阅读 · 2018年12月26日

Visual Reinforcement Learning with Imagined Goals

Arxiv

8+阅读 · 2018年7月12日

Unsupervised Meta-Learning for Reinforcement Learning

Arxiv

8+阅读 · 2018年6月12日

Do deep reinforcement learning agents model intentions?

Arxiv

5+阅读 · 2018年5月21日

Towards a Continuous Knowledge Learning Engine for Chatbots

Arxiv

6+阅读 · 2018年2月24日

微信扫码咨询专知VIP会员