利用非参数政策和行动先行措施,加强基于愿景的物体操纵的强化学习 (Reinforcement Learning for Vision-based Object Manipulation with Non-parametric Policy and Action Primitives) - 专知论文

会员服务 ·

0

Learning · 强化学习 · Performer · state-of-the-art · 翻转 ·

2022 年 6 月 12 日

Reinforcement Learning for Vision-based Object Manipulation with Non-parametric Policy and Action Primitives

翻译：利用非参数政策和行动先行措施,加强基于愿景的物体操纵的强化学习

Dongwon Son,Myungsin Kim,Jaecheol Sim,Wonsik Shin

The object manipulation is a crucial ability for a service robot, but it is hard to solve with reinforcement learning due to some reasons such as sample efficiency. In this paper, to tackle this object manipulation, we propose a novel framework, AP-NPQL (Non-Parametric Q Learning with Action Primitives), that can efficiently solve the object manipulation with visual input and sparse reward, by utilizing a non-parametric policy for reinforcement learning and appropriate behavior prior for the object manipulation. We evaluate the efficiency and the performance of the proposed AP-NPQL for four object manipulation tasks on simulation (pushing plate, stacking box, flipping cup, and picking and placing plate), and it turns out that our AP-NPQL outperforms the state-of-the-art algorithms based on parametric policy and behavior prior in terms of learning time and task success rate. We also successfully transfer and validate the learned policy of the plate pick-and-place task to the real robot in a sim-to-real manner.

翻译：物体操纵是服务机器人的关键能力,但由于样本效率等一些原因,很难通过强化学习来解决,但由于一些原因,例如样本效率等,很难解决物体操纵问题。在本文中,为了解决这种物体操纵问题,我们提出了一个新的框架,即AP-NPQL(非光学Q学习与动作精华),它可以通过视觉输入和微薄的奖励,利用非光学政策来有效解决物体操纵问题,在物体操纵之前,利用非光学政策来强化学习和适当行为。我们评估了拟议AP-NPQL在模拟(推动板、堆叠盒、翻转杯、选和放置板块)的四个物体操纵任务(推动盘、堆叠、翻转杯、以及选和放置板块)方面的效率和表现。事实证明,我们的AP-NPQL在学习时间和任务成功率方面,根据参数政策和先前的行为,超越了最先进的算法。我们还成功地以模拟方式向真正的机器人转移和验证了所学过的板选任务的政策。

0

相关内容

Learning

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

水溶性非环状分子容器的靶向药物传递

国家自然科学基金

0+阅读 · 2014年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

分享经典信息的量子秘密共享研究

国家自然科学基金

0+阅读 · 2013年12月31日

氯氧化铋量子点薄膜的制备及其光催化机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于小波稀疏表示的压缩感知数字全息层析技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

根际促生菌Bacillus amyloliquefaciens SQR9与植物根系分泌物互作的分子机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

BMPs调控Mef2C-ECR5-SOST转录轴的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

云计算环境下数据中心的power capping关键问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

核基因编码的叶绿体蛋白转运调控机理

国家自然科学基金

0+阅读 · 2011年12月31日

14-3-3蛋白与肾脏尿素转运

国家自然科学基金

0+阅读 · 2009年12月31日

Learning Prior Feature and Attention Enhanced Image Inpainting

Arxiv

0+阅读 · 2022年8月3日

Learning Skill-based Industrial Robot Tasks with User Priors

Arxiv

0+阅读 · 2022年8月2日

Graph-based Reinforcement Learning meets Mixed Integer Programs: An application to 3D robot assembly discovery

Arxiv

0+阅读 · 2022年8月2日

Relay Hindsight Experience Replay: Continual Reinforcement Learning for Robot Manipulation Tasks with Sparse Rewards

Relay Hindsight Experience Replay: Continual Reinforcement Learning for Robot Manipulation Tasks with Sparse Rewards

Arxiv

0+阅读 · 2022年8月1日

Bayesian Active Learning for Sim-to-Real Robotic Perception

Arxiv

0+阅读 · 2022年8月1日

Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning

Arxiv

0+阅读 · 2022年7月29日

Multi-Phase Multi-Objective Dexterous Manipulation with Adaptive Hierarchical Curriculum

Arxiv

0+阅读 · 2022年7月29日

MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration

Arxiv

12+阅读 · 2021年2月7日

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Arxiv

26+阅读 · 2020年2月10日

A Multi-Objective Deep Reinforcement Learning Framework

A Multi-Objective Deep Reinforcement Learning Framework

Arxiv

16+阅读 · 2018年6月27日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

从社会学实验到行为仿真：理解基于Agent的观点动力学建模思维

中英文版《GPT-5 System Card速览》报告

ACL 2025 | 大模型结构化知识提示的泛化能力研究

【普林斯顿博士论文】大型模型的高效推理

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Learning Prior Feature and Attention Enhanced Image Inpainting

Arxiv

0+阅读 · 2022年8月3日

Learning Skill-based Industrial Robot Tasks with User Priors

Arxiv

0+阅读 · 2022年8月2日

Graph-based Reinforcement Learning meets Mixed Integer Programs: An application to 3D robot assembly discovery

Arxiv

0+阅读 · 2022年8月2日

Relay Hindsight Experience Replay: Continual Reinforcement Learning for Robot Manipulation Tasks with Sparse Rewards

Relay Hindsight Experience Replay: Continual Reinforcement Learning for Robot Manipulation Tasks with Sparse Rewards

Arxiv

0+阅读 · 2022年8月1日

Bayesian Active Learning for Sim-to-Real Robotic Perception

Arxiv

0+阅读 · 2022年8月1日

Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning

Arxiv

0+阅读 · 2022年7月29日

Multi-Phase Multi-Objective Dexterous Manipulation with Adaptive Hierarchical Curriculum

Arxiv

0+阅读 · 2022年7月29日

MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration

Arxiv

12+阅读 · 2021年2月7日

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Arxiv

26+阅读 · 2020年2月10日

A Multi-Objective Deep Reinforcement Learning Framework

A Multi-Objective Deep Reinforcement Learning Framework

Arxiv

16+阅读 · 2018年6月27日

相关基金

水溶性非环状分子容器的靶向药物传递

国家自然科学基金

0+阅读 · 2014年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

分享经典信息的量子秘密共享研究

国家自然科学基金

0+阅读 · 2013年12月31日

氯氧化铋量子点薄膜的制备及其光催化机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于小波稀疏表示的压缩感知数字全息层析技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

根际促生菌Bacillus amyloliquefaciens SQR9与植物根系分泌物互作的分子机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

BMPs调控Mef2C-ECR5-SOST转录轴的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

云计算环境下数据中心的power capping关键问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

核基因编码的叶绿体蛋白转运调控机理

国家自然科学基金

0+阅读 · 2011年12月31日

14-3-3蛋白与肾脏尿素转运

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员