注意:为基于愿景的机器人操纵提供高效率学习 (Q-attention: Enabling Efficient Learning for Vision-based Robotic Manipulation) - 专知论文

会员服务 ·

0

学成 · 值域 · ARM · 机器人 · Performer ·

2022 年 2 月 4 日

Q-attention: Enabling Efficient Learning for Vision-based Robotic Manipulation

翻译：注意:为基于愿景的机器人操纵提供高效率学习

Stephen James,Andrew J. Davison

from arxiv, IEEE Robotics and Automation Letters, 2022 (+ presentation at ICRA 2022). Videos and code found at: https://sites.google.com/view/q-attention

Despite the success of reinforcement learning methods, they have yet to have their breakthrough moment when applied to a broad range of robotic manipulation tasks. This is partly due to the fact that reinforcement learning algorithms are notoriously difficult and time consuming to train, which is exacerbated when training from images rather than full-state inputs. As humans perform manipulation tasks, our eyes closely monitor every step of the process with our gaze focusing sequentially on the objects being manipulated. With this in mind, we present our Attention-driven Robotic Manipulation (ARM) algorithm, which is a general manipulation algorithm that can be applied to a range of sparse-rewarded tasks, given only a small number of demonstrations. ARM splits the complex task of manipulation into a 3 stage pipeline: (1) a Q-attention agent extracts relevant pixel locations from RGB and point cloud inputs, (2) a next-best pose agent that accepts crops from the Q-attention agent and outputs poses, and (3) a control agent that takes the goal pose and outputs joint actions. We show that current learning algorithms fail on a range of RLBench tasks, whilst ARM is successful.

翻译：尽管强化学习方法取得了成功,但在应用到一系列广泛的机器人操纵任务时,它们还没有达到突破的阶段,这部分是由于强化学习算法十分困难,培训耗时费时,而培训从图像而不是完整的国家投入中更加困难。当人类执行操纵任务时,我们的眼睛会密切监视这一过程的每一个步骤,并按顺序关注被操纵的物体。考虑到这一点,我们展示了我们受关注驱动的机器人操纵算法,这是一种一般操纵算法,可以应用于一系列稀疏的任务,因为演示数量很少。亚美尼亚将复杂的操纵任务分成了3阶段:(1) Q目的代理商从RGB和点云输入中提取相关的像素位置,(2) 接受受控代理和输出构成的作物的下一个最佳构成物剂,以及(3) 接受目标构成和产出联合行动的控制代理。我们显示,当前学习算法在一系列RLBench任务上失败,而ARM是成功的。

0

相关内容

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

无人驾驶中4D场景实时解析算法研究

国家自然科学基金

12+阅读 · 2017年12月31日

紫薯糖基化修饰酶Ib3GGT对花青素修饰和富集的研究

国家自然科学基金

0+阅读 · 2015年12月31日

两类复杂动力学网络的建模、分析与控制

国家自然科学基金

3+阅读 · 2014年12月31日

航天器大规模集群飞行的分布式协调控制方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

复杂网络系统的有限时间同步控制研究

国家自然科学基金

0+阅读 · 2011年12月31日

多模式杆式移动机构

国家自然科学基金

1+阅读 · 2011年12月31日

自治微电网多模态协调切换混杂控制研究

国家自然科学基金

0+阅读 · 2011年12月31日

人工动物可重组建模、运动控制及智能规划方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于肌电解码的多功能仿生假肢的实时控制研究

国家自然科学基金

0+阅读 · 2009年12月31日

鲁棒人脸特征提取方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

Vision System of Curling Robots: Thrower and Skip

Arxiv

0+阅读 · 2022年4月20日

R3M: A Universal Visual Representation for Robot Manipulation

Arxiv

0+阅读 · 2022年4月18日

Training and Evaluation of Deep Policies using Reinforcement Learning and Generative Models

Arxiv

1+阅读 · 2022年4月18日

Efficient Spatial Representation and Routing of Deformable One-Dimensional Objects for Manipulation

Arxiv

0+阅读 · 2022年4月16日

Efficient Reinforcement Learning for Unsupervised Controlled Text Generation

Arxiv

0+阅读 · 2022年4月16日

Evaluating the Effectiveness of Corrective Demonstrations and a Low-Cost Sensor for Dexterous Manipulation

Arxiv

0+阅读 · 2022年4月15日

FasterVideo: Efficient Online Joint Object Detection And Tracking

FasterVideo: Efficient Online Joint Object Detection And Tracking

Arxiv

0+阅读 · 2022年4月15日

Domain Generalization in Vision: A Survey

Arxiv

16+阅读 · 2021年7月18日

Recent Advances in Deep Learning-based Dialogue Systems

Arxiv

18+阅读 · 2021年5月10日

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

Arxiv

14+阅读 · 2021年1月31日

VIP会员

文章信息

相关主题

相关VIP内容

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

面向具身智能的多模态数据存储与检索：综述

《算法战争研究计划全景评估》35页

【CMU博士论文】水下三维视觉感知与生成

智能体战争：自主人工智能军备竞赛全景透视

相关资讯

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Vision System of Curling Robots: Thrower and Skip

Arxiv

0+阅读 · 2022年4月20日

R3M: A Universal Visual Representation for Robot Manipulation

Arxiv

0+阅读 · 2022年4月18日

Training and Evaluation of Deep Policies using Reinforcement Learning and Generative Models

Arxiv

1+阅读 · 2022年4月18日

Efficient Spatial Representation and Routing of Deformable One-Dimensional Objects for Manipulation

Arxiv

0+阅读 · 2022年4月16日

Efficient Reinforcement Learning for Unsupervised Controlled Text Generation

Arxiv

0+阅读 · 2022年4月16日

Evaluating the Effectiveness of Corrective Demonstrations and a Low-Cost Sensor for Dexterous Manipulation

Arxiv

0+阅读 · 2022年4月15日

FasterVideo: Efficient Online Joint Object Detection And Tracking

FasterVideo: Efficient Online Joint Object Detection And Tracking

Arxiv

0+阅读 · 2022年4月15日

Domain Generalization in Vision: A Survey

Arxiv

16+阅读 · 2021年7月18日

Recent Advances in Deep Learning-based Dialogue Systems

Arxiv

18+阅读 · 2021年5月10日

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

Arxiv

14+阅读 · 2021年1月31日

相关基金

无人驾驶中4D场景实时解析算法研究

国家自然科学基金

12+阅读 · 2017年12月31日

紫薯糖基化修饰酶Ib3GGT对花青素修饰和富集的研究

国家自然科学基金

0+阅读 · 2015年12月31日

两类复杂动力学网络的建模、分析与控制

国家自然科学基金

3+阅读 · 2014年12月31日

航天器大规模集群飞行的分布式协调控制方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

复杂网络系统的有限时间同步控制研究

国家自然科学基金

0+阅读 · 2011年12月31日

多模式杆式移动机构

国家自然科学基金

1+阅读 · 2011年12月31日

自治微电网多模态协调切换混杂控制研究

国家自然科学基金

0+阅读 · 2011年12月31日

人工动物可重组建模、运动控制及智能规划方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于肌电解码的多功能仿生假肢的实时控制研究

国家自然科学基金

0+阅读 · 2009年12月31日

鲁棒人脸特征提取方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员