舰队-开发机组:交互式机器人舰队学习与可扩展的人类监督 (Fleet-DAgger: Interactive Robot Fleet Learning with Scalable Human Supervision) - 专知论文

会员服务 ·

0

Learning · INTERACT · 机器人 · Attention · Continuity ·

2022 年 6 月 29 日

Fleet-DAgger: Interactive Robot Fleet Learning with Scalable Human Supervision

翻译：舰队-开发机组:交互式机器人舰队学习与可扩展的人类监督

Ryan Hoque,Lawrence Yunliang Chen,Satvik Sharma,Karthik Dharmarajan,Brijen Thananjeyan,Pieter Abbeel,Ken Goldberg

Commercial and industrial deployments of robot fleets often fall back on remote human teleoperators during execution when robots are at risk or unable to make task progress. With continual learning, interventions from the remote pool of humans can also be used to improve the robot fleet control policy over time. A central question is how to effectively allocate limited human attention to individual robots. Prior work addresses this in the single-robot, single-human setting. We formalize the Interactive Fleet Learning (IFL) setting, in which multiple robots interactively query and learn from multiple human supervisors. We present a fully implemented open-source IFL benchmark suite of GPU-accelerated Isaac Gym environments for the evaluation of IFL algorithms. We propose Fleet-DAgger, a family of IFL algorithms, and compare a novel Fleet-DAgger algorithm to 4 baselines in simulation. We also perform 1000 trials of a physical block-pushing experiment with 4 ABB YuMi robot arms. Experiments suggest that the allocation of humans to robots significantly affects robot fleet performance, and that our algorithm achieves up to 8.8x higher return on human effort than baselines. See https://tinyurl.com/fleet-dagger for code, videos, and supplemental material.

翻译：在机器人面临风险或无法取得任务进展时,机器人机队的商业和工业部署往往会落在远程人类遥控器上。通过不断学习,远程人类群的干预也可以用来改进机器人机队的长期控制政策。一个中心问题是如何有效地将有限的人类注意力分配给个体机器人。先前的工作在单机器人、单人环境下解决这个问题。我们正式确定了互动式机队学习(IFL)设置,其中多个机器人交互查询并从多个人类督导员那里学习。我们提出了一个完全实施的开放源的IFL基准套GPU-加速IFL基准套件,用于评估IFL算法。我们提议了FL算法的车队-Dagger(IFL算法的家族),并将新的机队-Dagger算法与模拟中的4个基线进行比较。我们还对4个ABB Yumi机器人武器进行了1 000次物理阻击试验。实验表明,将人类分配给机器人会极大地影响机器人机队的性能,我们的算法计算方法比基线、 http://sublietal/subleal com.

0

相关内容

Learning

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

离子液体电沉积构筑纳米有序直孔/柱状结构CIGS吸收层及其光电转换性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

Pt有序阵列电极氧还原反应动力学研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于天然产物Drimenal的新型杀菌剂分子设计、合成及构效关系研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

尖孢镰刀菌黄瓜专化型NPS6和CPS1基因的克隆与致病机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型石墨炔基氧还原电催化材料的制备及性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

控制有机半导体材料分子按照face-on 方式排列的高性能薄膜晶体管的研究

国家自然科学基金

0+阅读 · 2012年12月31日

AB2O4(B=Al、Ga、In)基尖晶石型可见光催化剂结构和性能的理论与实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

ZrO2+CaS辅助电极的制备、性能及机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

多组分金属/半导体纳米结的合成及光催化分解水制氢的研究

国家自然科学基金

0+阅读 · 2008年12月31日

Line Coverage with Multiple Robots: Algorithms and Experiments

Arxiv

0+阅读 · 2022年8月19日

Personalized Federated Recommendation via Joint Representation Learning, User Clustering, and Model Adaptation

Personalized Federated Recommendation via Joint Representation Learning, User Clustering, and Model Adaptation

Arxiv

0+阅读 · 2022年8月19日

A scalable and fast artificial neural network syndrome decoder for surface codes

Arxiv

0+阅读 · 2022年8月19日

Scalable Multi-Agent Framework for Optimizing the Lab and Warehouse

Arxiv

0+阅读 · 2022年8月19日

Planning for Automated Vehicles with Human Trust

Arxiv

0+阅读 · 2022年8月18日

On the Privacy Effect of Data Enhancement via the Lens of Memorization

Arxiv

0+阅读 · 2022年8月17日

The Confluence of Networks, Games and Learning

Arxiv

94+阅读 · 2021年5月17日

Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark

Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark

Arxiv

19+阅读 · 2020年12月17日

Pre-training Text Representations as Meta Learning

Arxiv

13+阅读 · 2020年4月12日

Learning Heuristics over Large Graphs via Deep Reinforcement Learning

Arxiv

12+阅读 · 2019年3月8日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【NTU博士论文】反事实推理在多模态对话生成中的应用

基于强化学习的智能体化搜索全面综述：基础、角色、优化、评估与应用

ICCV最佳论文出炉，朱俊彦团队用砖块积木摘得桂冠

面向具身操作的高效视觉–语言–动作模型：系统综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Line Coverage with Multiple Robots: Algorithms and Experiments

Arxiv

0+阅读 · 2022年8月19日

Personalized Federated Recommendation via Joint Representation Learning, User Clustering, and Model Adaptation

Personalized Federated Recommendation via Joint Representation Learning, User Clustering, and Model Adaptation

Arxiv

0+阅读 · 2022年8月19日

A scalable and fast artificial neural network syndrome decoder for surface codes

Arxiv

0+阅读 · 2022年8月19日

Scalable Multi-Agent Framework for Optimizing the Lab and Warehouse

Arxiv

0+阅读 · 2022年8月19日

Planning for Automated Vehicles with Human Trust

Arxiv

0+阅读 · 2022年8月18日

On the Privacy Effect of Data Enhancement via the Lens of Memorization

Arxiv

0+阅读 · 2022年8月17日

The Confluence of Networks, Games and Learning

Arxiv

94+阅读 · 2021年5月17日

Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark

Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark

Arxiv

19+阅读 · 2020年12月17日

Pre-training Text Representations as Meta Learning

Arxiv

13+阅读 · 2020年4月12日

Learning Heuristics over Large Graphs via Deep Reinforcement Learning

Arxiv

12+阅读 · 2019年3月8日

相关基金

离子液体电沉积构筑纳米有序直孔/柱状结构CIGS吸收层及其光电转换性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

Pt有序阵列电极氧还原反应动力学研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于天然产物Drimenal的新型杀菌剂分子设计、合成及构效关系研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

尖孢镰刀菌黄瓜专化型NPS6和CPS1基因的克隆与致病机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型石墨炔基氧还原电催化材料的制备及性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

控制有机半导体材料分子按照face-on 方式排列的高性能薄膜晶体管的研究

国家自然科学基金

0+阅读 · 2012年12月31日

AB2O4(B=Al、Ga、In)基尖晶石型可见光催化剂结构和性能的理论与实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

ZrO2+CaS辅助电极的制备、性能及机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

多组分金属/半导体纳米结的合成及光催化分解水制氢的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员