通过Sim-to-Real加强学习进行动态双双双双双赛跑步器 (Dynamic Bipedal Maneuvers through Sim-to-Real Reinforcement Learning) - 专知论文

会员服务 ·

0

Learning · Legged Robot · 稳健性 · 控制器 · 强化学习 ·

2022 年 7 月 16 日

Dynamic Bipedal Maneuvers through Sim-to-Real Reinforcement Learning

翻译：通过Sim-to-Real加强学习进行动态双双双双双赛跑步器

Fangzhou Yu,Ryan Batke,Jeremy Dao,Jonathan Hurst,Kevin Green,Alan Fern

from arxiv, In review for the 2022 IEEE-RAS International Conference on Humanoid Robots. 8 pages, 8 figures, 3 tables

For legged robots to match the athletic capabilities of humans and animals, they must not only produce robust periodic walking and running, but also seamlessly switch between nominal locomotion gaits and more specialized transient maneuvers. Despite recent advancements in controls of bipedal robots, there has been little focus on producing highly dynamic behaviors. Recent work utilizing reinforcement learning to produce policies for control of legged robots have demonstrated success in producing robust walking behaviors. However, these learned policies have difficulty expressing a multitude of different behaviors on a single network. Inspired by conventional optimization-based control techniques for legged robots, this work applies a recurrent policy to execute four-step, 90 degree turns trained using reference data generated from optimized single rigid body model trajectories. We present a novel training framework using epilogue terminal rewards for learning specific behaviors from pre-computed trajectory data and demonstrate a successful transfer to hardware on the bipedal robot Cassie.

翻译：脚机械人要与人类和动物的运动能力相匹配,它们不仅必须产生稳健的周期性步行和跑步,而且必须无缝地在名义移动动作和更加专业化的瞬间动作之间转换。尽管最近两肢机器人的控制有所进展,但很少注重产生高度动态的行为。最近利用强化学习来制定控制脚机械人的政策的工作在产生稳健的行走行为方面取得了成功。然而,这些学习的政策很难在单一网络上表达多种不同的行为。在对脚机械人的常规优化控制技术的启发下,这项工作运用一项经常性政策,使用优化的单体型硬体模型轨迹生成的参考数据,执行四步90度的旋转训练。我们提出了一个新的培训框架,利用上层终端奖励从预编造轨迹数据中学习特定行为,并展示两肢机器人机器人的硬件成功转移。

0

相关内容

Learning

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

专知会员服务

69+阅读 · 2020年1月2日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

AlGaN极化场调控对内量子效率的影响

国家自然科学基金

1+阅读 · 2016年12月31日

手性畴壁对铁电薄膜电学性能调控的相场研究

国家自然科学基金

0+阅读 · 2015年12月31日

Rce1酶切Bex2调控胶质瘤生长的研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向虚拟化云服务器的智能高速缓存管理

国家自然科学基金

0+阅读 · 2012年12月31日

不同垒层厚度并掺杂的GaNAs基短周期超晶格太阳能电池与MBE生长研究

国家自然科学基金

0+阅读 · 2012年12月31日

ZnSe:Mn/ZnSe/PMMA纳米复合薄膜在脉冲强磁场下的物性研究

国家自然科学基金

0+阅读 · 2012年12月31日

高k材料MOSFET沟道电子迁移率的增强研究

国家自然科学基金

0+阅读 · 2012年12月31日

镧系硅氧氮化物荧光材料的晶体结构和发光特性

国家自然科学基金

0+阅读 · 2012年12月31日

椭圆曲线密码学算法研究

国家自然科学基金

1+阅读 · 2009年12月31日

UGT基因簇进化及调控研究

国家自然科学基金

0+阅读 · 2009年12月31日

DashBot: Insight-Driven Dashboard Generation Based on Deep Reinforcement Learning

Arxiv

0+阅读 · 2022年9月13日

Meta-Reinforcement Learning via Language Instructions

Arxiv

1+阅读 · 2022年9月11日

Secure Shapley Value for Cross-Silo Federated Learning

Arxiv

0+阅读 · 2022年9月11日

Federated Reinforcement Learning for Collective Navigation of Robotic Swarms

Arxiv

0+阅读 · 2022年9月11日

Towards Understanding the Overfitting Phenomenon of Deep Click-Through Rate Prediction Models

Arxiv

0+阅读 · 2022年9月4日

Learning Latent Representations to Influence Multi-Agent Interaction

Arxiv

11+阅读 · 2020年11月12日

Adversarial Multimodal Representation Learning for Click-Through Rate Prediction

Arxiv

23+阅读 · 2020年3月7日

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Arxiv

26+阅读 · 2020年2月10日

Event Extraction with Generative Adversarial Imitation Learning

Arxiv

13+阅读 · 2018年4月21日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

VIP会员

文章信息

相关主题

相关VIP内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

专知会员服务

69+阅读 · 2020年1月2日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

Apium加入红猫未来计划：推进战术无人机集群自主技术

《美陆军训练条令：反小型无人机系统（C-sUAS）炮术项目》2025最新80页

无人机如何改变战争？未来战场

《超大城市作战艺术》52页报告

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

相关论文

DashBot: Insight-Driven Dashboard Generation Based on Deep Reinforcement Learning

Arxiv

0+阅读 · 2022年9月13日

Meta-Reinforcement Learning via Language Instructions

Arxiv

1+阅读 · 2022年9月11日

Secure Shapley Value for Cross-Silo Federated Learning

Arxiv

0+阅读 · 2022年9月11日

Federated Reinforcement Learning for Collective Navigation of Robotic Swarms

Arxiv

0+阅读 · 2022年9月11日

Towards Understanding the Overfitting Phenomenon of Deep Click-Through Rate Prediction Models

Arxiv

0+阅读 · 2022年9月4日

Learning Latent Representations to Influence Multi-Agent Interaction

Arxiv

11+阅读 · 2020年11月12日

Adversarial Multimodal Representation Learning for Click-Through Rate Prediction

Arxiv

23+阅读 · 2020年3月7日

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Arxiv

26+阅读 · 2020年2月10日

Event Extraction with Generative Adversarial Imitation Learning

Arxiv

13+阅读 · 2018年4月21日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

相关基金

AlGaN极化场调控对内量子效率的影响

国家自然科学基金

1+阅读 · 2016年12月31日

手性畴壁对铁电薄膜电学性能调控的相场研究

国家自然科学基金

0+阅读 · 2015年12月31日

Rce1酶切Bex2调控胶质瘤生长的研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向虚拟化云服务器的智能高速缓存管理

国家自然科学基金

0+阅读 · 2012年12月31日

不同垒层厚度并掺杂的GaNAs基短周期超晶格太阳能电池与MBE生长研究

国家自然科学基金

0+阅读 · 2012年12月31日

ZnSe:Mn/ZnSe/PMMA纳米复合薄膜在脉冲强磁场下的物性研究

国家自然科学基金

0+阅读 · 2012年12月31日

高k材料MOSFET沟道电子迁移率的增强研究

国家自然科学基金

0+阅读 · 2012年12月31日

镧系硅氧氮化物荧光材料的晶体结构和发光特性

国家自然科学基金

0+阅读 · 2012年12月31日

椭圆曲线密码学算法研究

国家自然科学基金

1+阅读 · 2009年12月31日

UGT基因簇进化及调控研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员