以查看处理: 创建操作控制器</s> (Manipulate by Seeing: Creating Manipulation Controllers) - 专知论文

会员服务 ·

0

Learning · 机器人 · 表示 · 控制器 · 预测器/决策函数 ·

2023 年 3 月 14 日

Manipulate by Seeing: Creating Manipulation Controllers

翻译：以查看处理: 创建操作控制器

Jianren Wang,Sudeep Dasari,Mohan Kumar Srirama,Shubham Tulsiani,Abhinav Gupta

The field of visual representation learning has seen explosive growth in the past years, but its benefits in robotics have been surprisingly limited so far. Prior work uses generic visual representations as a basis to learn (task-specific) robot action policies (e.g. via behavior cloning). While the visual representations do accelerate learning, they are primarily used to encode visual observations. Thus, action information has to be derived purely from robot data, which is expensive to collect! In this work, we present a scalable alternative where the visual representations can help directly infer robot actions. We observe that vision encoders express relationships between image observations as distances (e.g. via embedding dot product) that could be used to efficiently plan robot behavior. We operationalize this insight and develop a simple algorithm for acquiring a distance function and dynamics predictor, by fine-tuning a pre-trained representation on human collected video sequences. The final method is able to substantially outperform traditional robot learning baselines (e.g. 70% success v.s. 50% for behavior cloning on pick-place) on a suite of diverse real-world manipulation tasks. It can also generalize to novel objects, without using any robot demonstrations during train time. For visualizations of the learned policies please check: https://agi-labs.github.io/manipulate-by-seeing/

翻译：视觉代表学习领域在过去几年中出现了爆炸性增长,但在机器人方面的好处迄今却令人惊讶地有限。先前的工作使用通用视觉表现作为学习( 特定任务)机器人行动政策( 例如通过行为克隆)的基础。虽然视觉表现确实加快了学习, 但主要用于编解视觉观察。因此, 行动信息必须纯粹从机器人数据中产生, 收集成本昂贵! 在这项工作中, 我们提出了一个可扩展的替代方案, 视觉表现可以帮助直接推导机器人行动。我们观察到, 视觉编码者将图像观察作为距离( 例如通过嵌入点产品) 来表达关系, 用于有效规划机器人行为。我们操作了这种洞察, 并开发了获取远程功能和动态预测器的简单算法, 其方法是微调人类所收集的视频序列中经过预先培训的代表。最后的方法可以大大超出传统的机器人学习基准( 如 70% 成功与. 等。 50% 选择地点的行为克隆 ) 。我们观察到, 各种真实世界的操作任务中, 也可以概括到新版的游戏政策, 。在任何机器人演示期间, 使用任何视觉演示中, 。</s>

0

相关内容

Learning

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

谷歌足球游戏环境使用介绍

谷歌足球游戏环境使用介绍

CreateAMind

33+阅读 · 2019年6月27日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

16篇论文入门manipulation研究

16篇论文入门manipulation研究

机器人学家

16+阅读 · 2017年6月6日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

硅量子点纳米结构薄膜材料及其太阳电池的制备研究

国家自然科学基金

0+阅读 · 2012年12月31日

水泥混凝土路面半刚性基层的冲刷演变行为和损伤机制

国家自然科学基金

0+阅读 · 2012年12月31日

BMPs调控Mef2C-ECR5-SOST转录轴的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

微合金化Al-Cu-Sc合金的多重强化和多尺度断裂行为研究

国家自然科学基金

0+阅读 · 2011年12月31日

HIV-1 Tat蛋白诱导血脑屏障破坏及其作用机制

国家自然科学基金

0+阅读 · 2011年12月31日

SMA增强复合结构的RFID传感标签技术及其监测方法研究

国家自然科学基金

0+阅读 · 2008年12月31日

CaSO4载氧颗粒/固体燃料化学链燃烧过程氧传递机理研究

国家自然科学基金

0+阅读 · 2008年12月31日

Directed Chain Generative Adversarial Networks

Arxiv

0+阅读 · 2023年5月5日

Making Orchard by Adding Leaves

Arxiv

0+阅读 · 2023年5月4日

ZipIt! Merging Models from Different Tasks without Training

Arxiv

0+阅读 · 2023年5月4日

Learning Failure Prevention Skills for Safe Robot Manipulation

Arxiv

0+阅读 · 2023年5月4日

Neuralizer: General Neuroimage Analysis without Re-Training

Arxiv

0+阅读 · 2023年5月4日

Map-based Experience Replay: A Memory-Efficient Solution to Catastrophic Forgetting in Reinforcement Learning

Arxiv

0+阅读 · 2023年5月3日

Allegories of Symbolic Manipulations

Arxiv

0+阅读 · 2023年5月2日

A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems

Arxiv

17+阅读 · 2023年1月18日

The Role of Heterogeneity in Autonomous Perimeter Defense Problems

The Role of Heterogeneity in Autonomous Perimeter Defense Problems

Arxiv

13+阅读 · 2022年2月21日

RotatE: Knowledge Graph Embedding by Relational Rotation in Complex Space

Arxiv

11+阅读 · 2019年2月26日

VIP会员

文章信息

相关主题

预测器/决策函数

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

多模态大语言模型下游调优中“保持自我”的重要性

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

相关资讯

谷歌足球游戏环境使用介绍

谷歌足球游戏环境使用介绍

CreateAMind

33+阅读 · 2019年6月27日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

16篇论文入门manipulation研究

16篇论文入门manipulation研究

机器人学家

16+阅读 · 2017年6月6日

相关论文

Directed Chain Generative Adversarial Networks

Arxiv

0+阅读 · 2023年5月5日

Making Orchard by Adding Leaves

Arxiv

0+阅读 · 2023年5月4日

ZipIt! Merging Models from Different Tasks without Training

Arxiv

0+阅读 · 2023年5月4日

Learning Failure Prevention Skills for Safe Robot Manipulation

Arxiv

0+阅读 · 2023年5月4日

Neuralizer: General Neuroimage Analysis without Re-Training

Arxiv

0+阅读 · 2023年5月4日

Map-based Experience Replay: A Memory-Efficient Solution to Catastrophic Forgetting in Reinforcement Learning

Arxiv

0+阅读 · 2023年5月3日

Allegories of Symbolic Manipulations

Arxiv

0+阅读 · 2023年5月2日

A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems

Arxiv

17+阅读 · 2023年1月18日

The Role of Heterogeneity in Autonomous Perimeter Defense Problems

The Role of Heterogeneity in Autonomous Perimeter Defense Problems

Arxiv

13+阅读 · 2022年2月21日

RotatE: Knowledge Graph Embedding by Relational Rotation in Complex Space

Arxiv

11+阅读 · 2019年2月26日

相关基金

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

硅量子点纳米结构薄膜材料及其太阳电池的制备研究

国家自然科学基金

0+阅读 · 2012年12月31日

水泥混凝土路面半刚性基层的冲刷演变行为和损伤机制

国家自然科学基金

0+阅读 · 2012年12月31日

BMPs调控Mef2C-ECR5-SOST转录轴的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

微合金化Al-Cu-Sc合金的多重强化和多尺度断裂行为研究

国家自然科学基金

0+阅读 · 2011年12月31日

HIV-1 Tat蛋白诱导血脑屏障破坏及其作用机制

国家自然科学基金

0+阅读 · 2011年12月31日

SMA增强复合结构的RFID传感标签技术及其监测方法研究

国家自然科学基金

0+阅读 · 2008年12月31日

CaSO4载氧颗粒/固体燃料化学链燃烧过程氧传递机理研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员