【论文】欺骗学习（Learning by Cheating） - 专知VIP

会员服务 ·

2

智能体 · 自动驾驶 · 计算机视觉 · 模仿学习 · Dian Chen ·

2020 年 1 月 3 日

【论文】欺骗学习（Learning by Cheating）

专知会员服务

专知，提供专业可信的知识分发服务，让认知协作更快更好！

题目： Learning by Cheating

摘要：

基于视觉的城市驾驶是困难的。自主系统需要学会感知世界并在其中行动。我们证明这个具有挑战性的学习问题可以通过把它分解成两个阶段来简化。我们首先训练一个可以访问特权信息的智能体。这个特权智能体通过观察环境的真实布局和所有交通参与者的位置来作弊。在第二阶段，有特权的智能体充当老师，训练一个纯粹基于视觉的感觉运动智能体。产生的感知运动智能体不能访问任何特权信息，也不会欺骗。这个两阶段的训练程序一开始是反直觉的，但是我们分析和实证证明了它有许多重要的优势。我们使用所提出的方法来训练一个基于视觉的自动驾驶系统，该系统在卡拉基准测试和最近的NoCrash基准测试上的表现远远超过现有水平。我们的方法首次实现了原始CARLA基准测试中所有任务的100%成功率，在NoCrash基准测试中创下了新记录，并将违规的频率与现有技术相比降低了一个数量级。

作者：

Dian Chen是得克萨斯大学奥斯汀分校CS专业的二年级博士生，之前在加州大学伯克利分校学习计算机科学和应用数学专业，在伯克利人工智能研究(BAIR)实验室担任研究助理。研究兴趣是机器人，计算机视觉和机器学习，包括强化学习。个人官网：http://www.cs.utexas.edu/~dchen/

成为VIP会员查看完整内容

28

相关内容

智能体

智能体，顾名思义，就是具有智能的实体，英文名是Agent。

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

专知会员服务

74+阅读 · 2020年7月6日

【牛津大学博士论文】基于强化学习的无地图机器人导航，Reinforcement Learning Based MRN

【牛津大学博士论文】基于强化学习的无地图机器人导航，Reinforcement Learning Based MRN

专知会员服务

122+阅读 · 2020年5月18日

【伯克利】最新《深度半监督学习》总述，146页ppt，Semi-Supervised Learning

【伯克利】最新《深度半监督学习》总述，146页ppt，Semi-Supervised Learning

专知会员服务

148+阅读 · 2020年4月11日

【CVPR2020-台大】透视眼：学会透过障碍物看东西，Learning to See Through Obstructions

【CVPR2020-台大】透视眼：学会透过障碍物看东西，Learning to See Through Obstructions

专知会员服务

27+阅读 · 2020年4月3日

【Uber AI新论文】持续元学习，Learning to Continually Learn

【Uber AI新论文】持续元学习，Learning to Continually Learn

专知会员服务

37+阅读 · 2020年2月27日

【论文】量子对抗机器学习，Quantum Adversarial Machine Learning

【论文】量子对抗机器学习，Quantum Adversarial Machine Learning

专知会员服务

38+阅读 · 2020年1月5日

【北京智源大会2019】贝叶斯深度学习（ Bayesian Deep Learning ），清华大学| 朱军

【北京智源大会2019】贝叶斯深度学习（ Bayesian Deep Learning ），清华大学| 朱军

专知会员服务

105+阅读 · 2019年11月22日

【电子书推荐】强化学习（Reinforcement Learning）法兰克福大学 | Cornelius Weber

【电子书推荐】强化学习（Reinforcement Learning）法兰克福大学 | Cornelius Weber

专知会员服务

45+阅读 · 2019年11月19日

【CMU】机器学习导论课程（Introduction to Machine Learning）

【CMU】机器学习导论课程（Introduction to Machine Learning）

专知会员服务

61+阅读 · 2019年8月26日

【ICML2019 tutorial】安全机器学习（Safe Machine Learning），Silvia Chiappa，Jan Leike

【ICML2019 tutorial】安全机器学习（Safe Machine Learning），Silvia Chiappa，Jan Leike

专知会员服务

23+阅读 · 2019年6月10日

【CVPR2020-台大】透视眼：学会透过障碍物看东西，Learning to See Through Obstructions

【CVPR2020-台大】透视眼：学会透过障碍物看东西，Learning to See Through Obstructions

专知

26+阅读 · 2020年4月3日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知

133+阅读 · 2020年3月18日

最新必读的8篇「小样本学习（few-shot learning）」2020顶会论文和代码

最新必读的8篇「小样本学习（few-shot learning）」2020顶会论文和代码

专知

115+阅读 · 2020年3月2日

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

专知

21+阅读 · 2019年11月14日

元学习—Meta Learning的兴起

元学习—Meta Learning的兴起

专知

44+阅读 · 2019年10月19日

近期必读的八篇【Meta-Learning（元学习）】相关论文和代码

近期必读的八篇【Meta-Learning（元学习）】相关论文和代码

专知

134+阅读 · 2019年9月15日

ICML2019《元学习》教程与必读论文列表

ICML2019《元学习》教程与必读论文列表

专知

42+阅读 · 2019年6月16日

近期必读的10篇 ICML 2019【图神经网络（GNN）】相关论文和代码

近期必读的10篇 ICML 2019【图神经网络（GNN）】相关论文和代码

专知

131+阅读 · 2019年5月28日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

Andrew NG的新书《Machine Learning Yearning》

Andrew NG的新书《Machine Learning Yearning》

我爱机器学习

11+阅读 · 2016年12月7日

Continual Unsupervised Representation Learning

Continual Unsupervised Representation Learning

Arxiv

7+阅读 · 2019年10月31日

Label Embedded Dictionary Learning for Image Classification

Label Embedded Dictionary Learning for Image Classification

Arxiv

6+阅读 · 2019年3月7日

Meta-Transfer Learning for Few-Shot Learning

Meta-Transfer Learning for Few-Shot Learning

Arxiv

8+阅读 · 2018年12月6日

Large Margin Few-Shot Learning

Arxiv

11+阅读 · 2018年7月8日

Relational Deep Reinforcement Learning

Relational Deep Reinforcement Learning

Arxiv

10+阅读 · 2018年6月28日

Learning Unsupervised Learning Rules

Arxiv

7+阅读 · 2018年5月23日

Learning Region Features for Object Detection

Arxiv

4+阅读 · 2018年3月19日

Safety-aware Adaptive Reinforcement Learning with Applications to Brushbot Navigation

Arxiv

4+阅读 · 2018年1月29日

One-shot and few-shot learning of word embeddings

Arxiv

5+阅读 · 2017年10月27日

A Unified approach for Conventional Zero-shot, Generalized Zero-shot and Few-shot Learning

Arxiv

4+阅读 · 2017年10月26日

VIP会员

相关主题

计算机视觉

相关VIP内容

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

专知会员服务

74+阅读 · 2020年7月6日

【牛津大学博士论文】基于强化学习的无地图机器人导航，Reinforcement Learning Based MRN

【牛津大学博士论文】基于强化学习的无地图机器人导航，Reinforcement Learning Based MRN

专知会员服务

122+阅读 · 2020年5月18日

【伯克利】最新《深度半监督学习》总述，146页ppt，Semi-Supervised Learning

【伯克利】最新《深度半监督学习》总述，146页ppt，Semi-Supervised Learning

专知会员服务

148+阅读 · 2020年4月11日

【CVPR2020-台大】透视眼：学会透过障碍物看东西，Learning to See Through Obstructions

【CVPR2020-台大】透视眼：学会透过障碍物看东西，Learning to See Through Obstructions

专知会员服务

27+阅读 · 2020年4月3日

【Uber AI新论文】持续元学习，Learning to Continually Learn

【Uber AI新论文】持续元学习，Learning to Continually Learn

专知会员服务

37+阅读 · 2020年2月27日

【论文】量子对抗机器学习，Quantum Adversarial Machine Learning

【论文】量子对抗机器学习，Quantum Adversarial Machine Learning

专知会员服务

38+阅读 · 2020年1月5日

【北京智源大会2019】贝叶斯深度学习（ Bayesian Deep Learning ），清华大学| 朱军

【北京智源大会2019】贝叶斯深度学习（ Bayesian Deep Learning ），清华大学| 朱军

专知会员服务

105+阅读 · 2019年11月22日

【电子书推荐】强化学习（Reinforcement Learning）法兰克福大学 | Cornelius Weber

【电子书推荐】强化学习（Reinforcement Learning）法兰克福大学 | Cornelius Weber

专知会员服务

45+阅读 · 2019年11月19日

【CMU】机器学习导论课程（Introduction to Machine Learning）

【CMU】机器学习导论课程（Introduction to Machine Learning）

专知会员服务

61+阅读 · 2019年8月26日

【ICML2019 tutorial】安全机器学习（Safe Machine Learning），Silvia Chiappa，Jan Leike

【ICML2019 tutorial】安全机器学习（Safe Machine Learning），Silvia Chiappa，Jan Leike

专知会员服务

23+阅读 · 2019年6月10日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型智能体强化学习：全景综述

《城市滨海地区：理解复杂多变环境下的指挥控制框架》50页报告

【伯克利博士论文】从推理服务到训练：面向大规模 LLM 智能体的高效系统

美空军“顶点2025”实验：推进AI在C2、动态目标锁定与联盟集成中的应用

相关资讯

【CVPR2020-台大】透视眼：学会透过障碍物看东西，Learning to See Through Obstructions

【CVPR2020-台大】透视眼：学会透过障碍物看东西，Learning to See Through Obstructions

专知

26+阅读 · 2020年4月3日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知

133+阅读 · 2020年3月18日

最新必读的8篇「小样本学习（few-shot learning）」2020顶会论文和代码

最新必读的8篇「小样本学习（few-shot learning）」2020顶会论文和代码

专知

115+阅读 · 2020年3月2日

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

专知

21+阅读 · 2019年11月14日

元学习—Meta Learning的兴起

元学习—Meta Learning的兴起

专知

44+阅读 · 2019年10月19日

近期必读的八篇【Meta-Learning（元学习）】相关论文和代码

近期必读的八篇【Meta-Learning（元学习）】相关论文和代码

专知

134+阅读 · 2019年9月15日

ICML2019《元学习》教程与必读论文列表

ICML2019《元学习》教程与必读论文列表

专知

42+阅读 · 2019年6月16日

近期必读的10篇 ICML 2019【图神经网络（GNN）】相关论文和代码

近期必读的10篇 ICML 2019【图神经网络（GNN）】相关论文和代码

专知

131+阅读 · 2019年5月28日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

Andrew NG的新书《Machine Learning Yearning》

Andrew NG的新书《Machine Learning Yearning》

我爱机器学习

11+阅读 · 2016年12月7日

相关论文

Continual Unsupervised Representation Learning

Continual Unsupervised Representation Learning

Arxiv

7+阅读 · 2019年10月31日

Label Embedded Dictionary Learning for Image Classification

Label Embedded Dictionary Learning for Image Classification

Arxiv

6+阅读 · 2019年3月7日

Meta-Transfer Learning for Few-Shot Learning

Meta-Transfer Learning for Few-Shot Learning

Arxiv

8+阅读 · 2018年12月6日

Large Margin Few-Shot Learning

Arxiv

11+阅读 · 2018年7月8日

Relational Deep Reinforcement Learning

Relational Deep Reinforcement Learning

Arxiv

10+阅读 · 2018年6月28日

Learning Unsupervised Learning Rules

Arxiv

7+阅读 · 2018年5月23日

Learning Region Features for Object Detection

Arxiv

4+阅读 · 2018年3月19日

Safety-aware Adaptive Reinforcement Learning with Applications to Brushbot Navigation

Arxiv

4+阅读 · 2018年1月29日

One-shot and few-shot learning of word embeddings

Arxiv

5+阅读 · 2017年10月27日

A Unified approach for Conventional Zero-shot, Generalized Zero-shot and Few-shot Learning

Arxiv

4+阅读 · 2017年10月26日

微信扫码咨询专知VIP会员