SKILL-IL:多任务模拟学习中分离技能和知识 (SKILL-IL: Disentangling Skill and Knowledge in Multitask Imitation Learning) - 专知论文

会员服务 ·

0

知识 (knowledge) · Learning · 回合 · Agent · CASES ·

2022 年 7 月 26 日

SKILL-IL: Disentangling Skill and Knowledge in Multitask Imitation Learning

翻译：SKILL-IL:多任务模拟学习中分离技能和知识

Bian Xihan,Oscar Mendez,Simon Hadfield

from arxiv, Submitted to IROS 2022, under review

In this work, we introduce a new perspective for learning transferable content in multi-task imitation learning. Humans are able to transfer skills and knowledge. If we can cycle to work and drive to the store, we can also cycle to the store and drive to work. We take inspiration from this and hypothesize the latent memory of a policy network can be disentangled into two partitions. These contain either the knowledge of the environmental context for the task or the generalizable skill needed to solve the task. This allows improved training efficiency and better generalization over previously unseen combinations of skills in the same environment, and the same task in unseen environments. We used the proposed approach to train a disentangled agent for two different multi-task IL environments. In both cases we out-performed the SOTA by 30% in task success rate. We also demonstrated this for navigation on a real robot.

翻译：在这项工作中,我们引入了在多任务模拟学习中学习可转让内容的新视角。人类能够转让技能和知识。如果我们可以循环工作, 我们可以循环到商店, 我们可以循环到仓库, 并驱动工作。我们从中汲取灵感, 并假设政策网络的潜在记忆可以分解成两个分区。这些分区要么包含任务的环境背景知识, 要么包含解决任务所需的一般技能。这样可以提高培训效率, 更好地推广以前无法见的同一环境中的技能组合, 以及未知环境中的相同任务。我们用建议的方法为两种不同的多任务 IL 环境训练一个分解的代理。在这两种情况下,我们在任务成功率上都比SOTA高出了30%。我们还演示了这个方法, 用于在真正的机器人上导航。

0

相关内容

知识 (knowledge)

知识 (knowledge)

通过学习、实践或探索所获得的认识、判断或技能。

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

专知

12+阅读 · 2018年3月24日

IL-1β活化脂肪间充质干细胞对小肠缺血再灌注损伤的修复作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于IIM模型的城市关联基础设施系统的脆弱性与弹性评价研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于BIM的建筑生命周期环境与经济评价及优化设计方法研究

国家自然科学基金

3+阅读 · 2014年12月31日

非线性Cahn-Hilliard型方程自适应高阶稳定数值方法分析

国家自然科学基金

0+阅读 · 2013年12月31日

多元整数值GARCH模型的统计分析

国家自然科学基金

0+阅读 · 2012年12月31日

兼具诱导成骨和成血管的钛材表面工程研究

国家自然科学基金

0+阅读 · 2012年12月31日

氧化铈在镍钴钨合金粉末成形及高温耐磨涂层中作用机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

约化群酉表示的branching law及其应用

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

微通道内气液两相流及传质特性

国家自然科学基金

0+阅读 · 2008年12月31日

Measuring Interventional Robustness in Reinforcement Learning

Arxiv

0+阅读 · 2022年9月19日

Latent Plans for Task-Agnostic Offline Reinforcement Learning

Arxiv

0+阅读 · 2022年9月19日

Learn the Time to Learn: Replay Scheduling in Continual Learning

Arxiv

0+阅读 · 2022年9月18日

Self-Optimizing Feature Transformation

Arxiv

0+阅读 · 2022年9月16日

Versatile Skill Control via Self-supervised Adversarial Imitation of Unlabeled Mixed Motions

Versatile Skill Control via Self-supervised Adversarial Imitation of Unlabeled Mixed Motions

Arxiv

0+阅读 · 2022年9月16日

Private Synthetic Data for Multitask Learning and Marginal Queries

Private Synthetic Data for Multitask Learning and Marginal Queries

Arxiv

0+阅读 · 2022年9月15日

Neural-iLQR: A Learning-Aided Shooting Method for Trajectory Optimization

Neural-iLQR: A Learning-Aided Shooting Method for Trajectory Optimization

Arxiv

0+阅读 · 2022年9月15日

Imitation Learning: Progress, Taxonomies and Opportunities

Arxiv

12+阅读 · 2021年6月23日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

Event Extraction with Generative Adversarial Imitation Learning

Arxiv

13+阅读 · 2018年4月21日

VIP会员

文章信息

相关主题

知识 (knowledge)

相关VIP内容

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

NeurIPS 2025 | 自动化所新作速览（一）

大型语言模型（LLM）赋能的知识图谱构建：综述

NeurIPS 2025 | 自动化所新作速览（二）

领域特定文本分类中的预训练语言模型新进展：系统综述

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

专知

12+阅读 · 2018年3月24日

相关论文

Measuring Interventional Robustness in Reinforcement Learning

Arxiv

0+阅读 · 2022年9月19日

Latent Plans for Task-Agnostic Offline Reinforcement Learning

Arxiv

0+阅读 · 2022年9月19日

Learn the Time to Learn: Replay Scheduling in Continual Learning

Arxiv

0+阅读 · 2022年9月18日

Self-Optimizing Feature Transformation

Arxiv

0+阅读 · 2022年9月16日

Versatile Skill Control via Self-supervised Adversarial Imitation of Unlabeled Mixed Motions

Versatile Skill Control via Self-supervised Adversarial Imitation of Unlabeled Mixed Motions

Arxiv

0+阅读 · 2022年9月16日

Private Synthetic Data for Multitask Learning and Marginal Queries

Private Synthetic Data for Multitask Learning and Marginal Queries

Arxiv

0+阅读 · 2022年9月15日

Neural-iLQR: A Learning-Aided Shooting Method for Trajectory Optimization

Neural-iLQR: A Learning-Aided Shooting Method for Trajectory Optimization

Arxiv

0+阅读 · 2022年9月15日

Imitation Learning: Progress, Taxonomies and Opportunities

Arxiv

12+阅读 · 2021年6月23日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

Event Extraction with Generative Adversarial Imitation Learning

Arxiv

13+阅读 · 2018年4月21日

相关基金

IL-1β活化脂肪间充质干细胞对小肠缺血再灌注损伤的修复作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于IIM模型的城市关联基础设施系统的脆弱性与弹性评价研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于BIM的建筑生命周期环境与经济评价及优化设计方法研究

国家自然科学基金

3+阅读 · 2014年12月31日

非线性Cahn-Hilliard型方程自适应高阶稳定数值方法分析

国家自然科学基金

0+阅读 · 2013年12月31日

多元整数值GARCH模型的统计分析

国家自然科学基金

0+阅读 · 2012年12月31日

兼具诱导成骨和成血管的钛材表面工程研究

国家自然科学基金

0+阅读 · 2012年12月31日

氧化铈在镍钴钨合金粉末成形及高温耐磨涂层中作用机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

约化群酉表示的branching law及其应用

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

微通道内气液两相流及传质特性

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员