加速机器人学习联系人 - Rich 程序:课程学习研究 (Accelerating Robot Learning of Contact-Rich Manipulations: A Curriculum Learning Study) - 专知论文

会员服务 ·

0

学成 · 机器人 · Automator · INTERACT · 讲稿 ·

2022 年 4 月 28 日

Accelerating Robot Learning of Contact-Rich Manipulations: A Curriculum Learning Study

翻译：加速机器人学习联系人 - Rich 程序:课程学习研究

Cristian C. Beltran-Hernandez,Damien Petit,Ixchel G. Ramirez-Alpizar,Kensuke Harada

from arxiv, 12 pages, 11 figures, 4 tables, in journal review. Corresponding author: Cristian C. Beltran-Hernandez

The Reinforcement Learning (RL) paradigm has been an essential tool for automating robotic tasks. Despite the advances in RL, it is still not widely adopted in the industry due to the need for an expensive large amount of robot interaction with its environment. Curriculum Learning (CL) has been proposed to expedite learning. However, most research works have been only evaluated in simulated environments, from video games to robotic toy tasks. This paper presents a study for accelerating robot learning of contact-rich manipulation tasks based on Curriculum Learning combined with Domain Randomization (DR). We tackle complex industrial assembly tasks with position-controlled robots, such as insertion tasks. We compare different curricula designs and sampling approaches for DR. Based on this study, we propose a method that significantly outperforms previous work, which uses DR only (No CL is used), with less than a fifth of the training time (samples). Results also show that even when training only in simulation with toy tasks, our method can learn policies that can be transferred to the real-world robot. The learned policies achieved success rates of up to 86\% on real-world complex industrial insertion tasks (with tolerances of $\pm 0.01~mm$) not seen during the training.

翻译：强化学习模式(RL)是机器人任务自动化的基本工具。尽管在RL方面有所进步,但由于需要大量机器人与环境进行昂贵的机器人互动,该模式在工业中仍没有被广泛采用。课程学习(CL)建议加快学习。然而,大多数研究工作仅在模拟环境中进行了评价,从视频游戏到机器人玩具任务。本文介绍了根据课程学习(DR)和Domain随机化(DR)相结合,加速机器人学习接触丰富的操纵任务的研究。我们处理复杂的工业组装任务,使用定位控制机器人,例如插入任务。我们比较了不同的课程设计和DR抽样方法。根据这项研究,我们提出了一种方法,大大超过以往的工作,只使用DR(CL),培训时间不到五分之一(样本)。结果还表明,即使只进行模拟 Toy任务的培训,我们的方法也可以学习可以转移到真实世界机器人的政策。在现实世界的复杂工业插入任务中,我们所学过的政策取得了86-美元的成功率,在现实世界的复杂工业插入任务中,没有看到0.01美元的容忍度。

0

相关内容

【伯克利-Pieter Abbeel】深度强化学习基础，附slides与视频

专知会员服务

29+阅读 · 2021年8月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

miR-21/PDCD4/NF-κB通路在血小板抗菌促糖尿病溃疡愈合中的作用及分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

Mipu1促血管新生的机制研究：对VEGF-VASH1/SVBP负反馈通路的转录调节

国家自然科学基金

0+阅读 · 2014年12月31日

氮、钾肥料对再生水灌溉土壤重金属运移特性的影响及调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

基于双指标多等级的土壤重金属生态风险评价方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

MicRNA107调控BACE1mRNA基因与阿尔茨海默病内质网应激病理机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

碳酸盐岩区硫化物尾矿中重金属赋存状态研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于VHF/UHF双频地基雷达的土壤与植被参数建模与测试方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

鄱阳湖湿地香根草富集重金属的拉曼光谱快速检测方法

国家自然科学基金

0+阅读 · 2011年12月31日

α65293;硫辛酸防治2型糖尿病并发症及其线粒体修复机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

Learning from Uncurated Regular Expressions

Arxiv

0+阅读 · 2022年6月14日

Analysis of Randomization Effects on Sim2Real Transfer in Reinforcement Learning for Robotic Manipulation Tasks

Arxiv

0+阅读 · 2022年6月13日

Learning Distributed and Fair Policies for Network Load Balancing as Markov Potential Game

Arxiv

0+阅读 · 2022年6月13日

Deep Reinforcement Learning with Weighted Q-Learning

Arxiv

0+阅读 · 2022年6月13日

Reinforcement Learning for Vision-based Object Manipulation with Non-parametric Policy and Action Primitives

Arxiv

0+阅读 · 2022年6月12日

Accelerating Score-based Generative Models for High-Resolution Image Synthesis

Arxiv

0+阅读 · 2022年6月10日

Mildly Conservative Q-Learning for Offline Reinforcement Learning

Arxiv

0+阅读 · 2022年6月9日

Projected State-action Balancing Weights for Offline Reinforcement Learning

Arxiv

0+阅读 · 2022年6月9日

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Arxiv

20+阅读 · 2020年3月10日

Event Extraction with Generative Adversarial Imitation Learning

Arxiv

13+阅读 · 2018年4月21日

VIP会员

文章信息

相关主题

相关VIP内容

【伯克利-Pieter Abbeel】深度强化学习基础，附slides与视频

专知会员服务

29+阅读 · 2021年8月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《美空军条令出版物：战略打击》最新条令

《高能激光武器》22页slides

军事前沿模型

《面向小型无人机或无人飞行器的创新雷达探测与人工智能分类技术》263页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

相关论文

Learning from Uncurated Regular Expressions

Arxiv

0+阅读 · 2022年6月14日

Analysis of Randomization Effects on Sim2Real Transfer in Reinforcement Learning for Robotic Manipulation Tasks

Arxiv

0+阅读 · 2022年6月13日

Learning Distributed and Fair Policies for Network Load Balancing as Markov Potential Game

Arxiv

0+阅读 · 2022年6月13日

Deep Reinforcement Learning with Weighted Q-Learning

Arxiv

0+阅读 · 2022年6月13日

Reinforcement Learning for Vision-based Object Manipulation with Non-parametric Policy and Action Primitives

Arxiv

0+阅读 · 2022年6月12日

Accelerating Score-based Generative Models for High-Resolution Image Synthesis

Arxiv

0+阅读 · 2022年6月10日

Mildly Conservative Q-Learning for Offline Reinforcement Learning

Arxiv

0+阅读 · 2022年6月9日

Projected State-action Balancing Weights for Offline Reinforcement Learning

Arxiv

0+阅读 · 2022年6月9日

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Arxiv

20+阅读 · 2020年3月10日

Event Extraction with Generative Adversarial Imitation Learning

Arxiv

13+阅读 · 2018年4月21日

相关基金

miR-21/PDCD4/NF-κB通路在血小板抗菌促糖尿病溃疡愈合中的作用及分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

Mipu1促血管新生的机制研究：对VEGF-VASH1/SVBP负反馈通路的转录调节

国家自然科学基金

0+阅读 · 2014年12月31日

氮、钾肥料对再生水灌溉土壤重金属运移特性的影响及调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

基于双指标多等级的土壤重金属生态风险评价方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

MicRNA107调控BACE1mRNA基因与阿尔茨海默病内质网应激病理机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

碳酸盐岩区硫化物尾矿中重金属赋存状态研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于VHF/UHF双频地基雷达的土壤与植被参数建模与测试方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

鄱阳湖湿地香根草富集重金属的拉曼光谱快速检测方法

国家自然科学基金

0+阅读 · 2011年12月31日

α65293;硫辛酸防治2型糖尿病并发症及其线粒体修复机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员