GAIL:人类-机器人协作学习多样化战略 (Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration) - 专知论文

会员服务 ·

0

学成 · 多样性 · INTERACT · 估计/估计量 · 机器人 ·

2021 年 8 月 13 日

Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration

翻译：GAIL:人类-机器人协作学习多样化战略

Chen Wang,Claudia Pérez-D'Arpino,Danfei Xu,Li Fei-Fei,C. Karen Liu,Silvio Savarese

We present a method for learning a human-robot collaboration policy from human-human collaboration demonstrations. An effective robot assistant must learn to handle diverse human behaviors shown in the demonstrations and be robust when the humans adjust their strategies during online task execution. Our method co-optimizes a human policy and a robot policy in an interactive learning process: the human policy learns to generate diverse and plausible collaborative behaviors from demonstrations while the robot policy learns to assist by estimating the unobserved latent strategy of its human collaborator. Across a 2D strategy game, a human-robot handover task, and a multi-step collaborative manipulation task, our method outperforms the alternatives in both simulated evaluations and when executing the tasks with a real human operator in-the-loop. Supplementary materials and videos at https://sites.google.com/view/co-gail-web/home

翻译：我们提出了一个从人类-人类合作示范中学习人类-机器人合作政策的方法。一个有效的机器人助理必须学会处理演示中显示的各种人类行为,当人类在在线任务执行期间调整其战略时,必须保持稳健。我们的方法在互动学习过程中共同优化了人类政策和机器人政策:人类政策学会从演示中产生多样化和可信的合作行为,而机器人政策则学会通过估计其人类协作者未见的潜在战略来帮助。在2D战略游戏中,人类-机器人交接任务和多步合作操作任务中,我们的方法在模拟评估中和在与现场真正的人类操作者执行任务时,都超越了替代办法。补充材料和视频见https://sites.gogle.com/view/co-gail-web/home。

0

相关内容

多伦多大学最新《机器学习导论》课程，Introduction to Machine Learning

多伦多大学最新《机器学习导论》课程，Introduction to Machine Learning

专知会员服务

25+阅读 · 2020年9月24日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【CVPR2020-英伟达】从图像集合中学习自监督视点，Self-Supervised Viewpoint Learning From Image Collections

【CVPR2020-英伟达】从图像集合中学习自监督视点，Self-Supervised Viewpoint Learning From Image Collections

专知会员服务

24+阅读 · 2020年4月4日

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

专知会员服务

80+阅读 · 2020年3月4日

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

专知会员服务

28+阅读 · 2020年2月18日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【NLP Seminar】机器人控制和协作下的位置指令（Robot Control and Collaboration in Situated Instruction Following ），康奈尔大学助理教授| Yoav Artzi

【NLP Seminar】机器人控制和协作下的位置指令（Robot Control and Collaboration in Situated Instruction Following ），康奈尔大学助理教授| Yoav Artzi

专知会员服务

8+阅读 · 2019年12月5日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

LibRec 精选：AutoML for Contextual Bandits

LibRec 精选：AutoML for Contextual Bandits

LibRec智能推荐

7+阅读 · 2019年9月19日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

计算机类 | 期刊专刊截稿信息9条

计算机类 | 期刊专刊截稿信息9条

Call4Papers

4+阅读 · 2018年1月26日

计算机类 | 国际会议信息7条

计算机类 | 国际会议信息7条

Call4Papers

3+阅读 · 2017年11月17日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

Extended Tree Search for Robot Task and Motion Planning

Extended Tree Search for Robot Task and Motion Planning

Arxiv

0+阅读 · 2021年10月11日

Learning from Ambiguous Demonstrations with Self-Explanation Guided Reinforcement Learning

Arxiv

0+阅读 · 2021年10月11日

Credit Assignment Safety Learning from Human Demonstrations

Arxiv

0+阅读 · 2021年10月9日

Explaining Reward Functions to Humans for Better Human-Robot Collaboration

Explaining Reward Functions to Humans for Better Human-Robot Collaboration

Arxiv

0+阅读 · 2021年10月8日

Imitation Learning: Progress, Taxonomies and Opportunities

Arxiv

12+阅读 · 2021年6月23日

OntoZSL: Ontology-enhanced Zero-shot Learning

Arxiv

17+阅读 · 2021年2月15日

Reward learning from human preferences and demonstrations in Atari

Arxiv

8+阅读 · 2018年11月15日

Structural Consistency and Controllability for Diverse Colorization

Structural Consistency and Controllability for Diverse Colorization

Arxiv

7+阅读 · 2018年9月6日

Visual Reinforcement Learning with Imagined Goals

Arxiv

8+阅读 · 2018年7月12日

Neural Network Based Reinforcement Learning for Audio-Visual Gaze Control in Human-Robot Interaction

Arxiv

6+阅读 · 2018年4月23日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

多伦多大学最新《机器学习导论》课程，Introduction to Machine Learning

多伦多大学最新《机器学习导论》课程，Introduction to Machine Learning

专知会员服务

25+阅读 · 2020年9月24日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【CVPR2020-英伟达】从图像集合中学习自监督视点，Self-Supervised Viewpoint Learning From Image Collections

【CVPR2020-英伟达】从图像集合中学习自监督视点，Self-Supervised Viewpoint Learning From Image Collections

专知会员服务

24+阅读 · 2020年4月4日

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

专知会员服务

80+阅读 · 2020年3月4日

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

专知会员服务

28+阅读 · 2020年2月18日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【NLP Seminar】机器人控制和协作下的位置指令（Robot Control and Collaboration in Situated Instruction Following ），康奈尔大学助理教授| Yoav Artzi

【NLP Seminar】机器人控制和协作下的位置指令（Robot Control and Collaboration in Situated Instruction Following ），康奈尔大学助理教授| Yoav Artzi

专知会员服务

8+阅读 · 2019年12月5日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

军事行动中人工智能系统目标交战的附带损伤评估模型 | 最新文献

NeurIPS 2025 | 自动化所新作速览（一）

美陆军协会（AUSA）2025 年会公布的美国十大武器与防务产品创新

NeurIPS 2025 | 自动化所新作速览（二）

相关资讯

LibRec 精选：AutoML for Contextual Bandits

LibRec 精选：AutoML for Contextual Bandits

LibRec智能推荐

7+阅读 · 2019年9月19日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

计算机类 | 期刊专刊截稿信息9条

计算机类 | 期刊专刊截稿信息9条

Call4Papers

4+阅读 · 2018年1月26日

计算机类 | 国际会议信息7条

计算机类 | 国际会议信息7条

Call4Papers

3+阅读 · 2017年11月17日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

相关论文

Extended Tree Search for Robot Task and Motion Planning

Extended Tree Search for Robot Task and Motion Planning

Arxiv

0+阅读 · 2021年10月11日

Learning from Ambiguous Demonstrations with Self-Explanation Guided Reinforcement Learning

Arxiv

0+阅读 · 2021年10月11日

Credit Assignment Safety Learning from Human Demonstrations

Arxiv

0+阅读 · 2021年10月9日

Explaining Reward Functions to Humans for Better Human-Robot Collaboration

Explaining Reward Functions to Humans for Better Human-Robot Collaboration

Arxiv

0+阅读 · 2021年10月8日

Imitation Learning: Progress, Taxonomies and Opportunities

Arxiv

12+阅读 · 2021年6月23日

OntoZSL: Ontology-enhanced Zero-shot Learning

Arxiv

17+阅读 · 2021年2月15日

Reward learning from human preferences and demonstrations in Atari

Arxiv

8+阅读 · 2018年11月15日

Structural Consistency and Controllability for Diverse Colorization

Structural Consistency and Controllability for Diverse Colorization

Arxiv

7+阅读 · 2018年9月6日

Visual Reinforcement Learning with Imagined Goals

Arxiv

8+阅读 · 2018年7月12日

Neural Network Based Reinforcement Learning for Audio-Visual Gaze Control in Human-Robot Interaction

Arxiv

6+阅读 · 2018年4月23日

微信扫码咨询专知VIP会员