自动驾驶视觉化城市驾驶的级联深度强化学习框架（CADRE） (CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-based Autonomous Urban Driving) - 专知论文

会员服务 ·

0

级联 · 自动驾驶 · 深度强化学习 · 强化学习 · 动态性 ·

2023 年 4 月 19 日

CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-based Autonomous Urban Driving

翻译：自动驾驶视觉化城市驾驶的级联深度强化学习框架（CADRE）

Yinuo Zhao,Kun Wu,Zhiyuan Xu,Zhengping Che,Qi Lu,Jian Tang,Chi Harold Liu

Vision-based autonomous urban driving in dense traffic is quite challenging due to the complicated urban environment and the dynamics of the driving behaviors. Widely-applied methods either heavily rely on hand-crafted rules or learn from limited human experience, which makes them hard to generalize to rare but critical scenarios. In this paper, we present a novel CAscade Deep REinforcement learning framework, CADRE, to achieve model-free vision-based autonomous urban driving. In CADRE, to derive representative latent features from raw observations, we first offline train a Co-attention Perception Module (CoPM) that leverages the co-attention mechanism to learn the inter-relationships between the visual and control information from a pre-collected driving dataset. Cascaded by the frozen CoPM, we then present an efficient distributed proximal policy optimization framework to online learn the driving policy under the guidance of particularly designed reward functions. We perform a comprehensive empirical study with the CARLA NoCrash benchmark as well as specific obstacle avoidance scenarios in autonomous urban driving tasks. The experimental results well justify the effectiveness of CADRE and its superiority over the state-of-the-art by a wide margin.

翻译：在密集交通中的自动驾驶视觉化城市驾驶是非常具有挑战性的，因为城市环境复杂，驾驶行为的动态性。广泛应用的方法要么严重依赖于手工制作的规则，要么从有限的人类经验中学习，这使得它们难以推广到罕见但关键的情况。在本文中，我们提出了一种新颖的CAscade Deep REinforcement learning framework，CADRE，以实现基于视觉的自动驾驶。在CADRE中，为了从原始观测中导出代表性的潜在特征，我们首先离线训练了一个Co-attention Perception Module（CoPM），它利用共同关注机制从预收集的驾驶数据集中学习视觉和控制信息之间的相互关系。在冻结的CoPM的级联作用下，我们提出了一个高效的分布式近端策略优化框架，在特别设计的奖励函数的指导下在线学习驾驶策略。我们进行了一项全面的实证研究，使用CARLA NoCrash基准测试以及自动驾驶任务中特定的避障场景。实验证明了CADRE的有效性以及它对现有技术的显着优越性。

0

相关内容

JCIM丨DRlinker：深度强化学习优化片段连接设计

JCIM丨DRlinker：深度强化学习优化片段连接设计

专知会员服务

7+阅读 · 2022年12月9日

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

专知会员服务

89+阅读 · 2021年1月12日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

2019必读的十大深度强化学习论文

2019必读的十大深度强化学习论文

专知会员服务

59+阅读 · 2020年1月16日

【论文推荐中科院自动化所】视频游戏中深度强化学习的研究综述，A Survey of Deep Reinforcement Learning in Video

【论文推荐中科院自动化所】视频游戏中深度强化学习的研究综述，A Survey of Deep Reinforcement Learning in Video

专知会员服务

48+阅读 · 2019年12月24日

【目标检测 | 2019最新综述】目标检测的最新进展，附40页PDF，Recent Advances in Deep Learning for Object Detection

【目标检测 | 2019最新综述】目标检测的最新进展，附40页PDF，Recent Advances in Deep Learning for Object Detection

专知会员服务

85+阅读 · 2019年11月15日

【DeepMind-Nando de Freitas】强化学习教程，102页ppt，Reinforcement Learning

【DeepMind-Nando de Freitas】强化学习教程，102页ppt，Reinforcement Learning

专知会员服务

84+阅读 · 2019年11月15日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

【Pieter Abbeel 报告@CMU】元学习与深度强化学习机器人应用，Deep Learning to Learn，84页ppt

【Pieter Abbeel 报告@CMU】元学习与深度强化学习机器人应用，Deep Learning to Learn，84页ppt

专知会员服务

32+阅读 · 2019年10月12日

7 Papers & Radios | ECCV 2022最佳论文；Transformer在试错中自主改进

7 Papers & Radios | ECCV 2022最佳论文；Transformer在试错中自主改进

机器之心

0+阅读 · 2022年10月30日

DAI2020 SMARTS 自动驾驶挑战赛(深度强化学习)

DAI2020 SMARTS 自动驾驶挑战赛(深度强化学习)

深度强化学习实验室

15+阅读 · 2020年8月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知

133+阅读 · 2020年3月18日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

配位不饱和MOFs低温SCR催化脱硝反应机理的分子模拟研究

国家自然科学基金

0+阅读 · 2015年12月31日

随机与动态环境下物流配送区域划分与配送路径集成优化问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向企业2.0的研发团队知识共享与创新多层次模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

变化环境下干旱区内陆艾比湖流域景观格局演变与水资源的相互作用机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于行为模型和超图匹配的多目标跟踪技术研究

国家自然科学基金

3+阅读 · 2012年12月31日

双靶点基于二萜、倍半萜的NF-κB抑制剂的设计合成及活性评价

国家自然科学基金

0+阅读 · 2011年12月31日

5HRE与CEAp联合调控抑癌基因RASSF1A系统治疗CEA阳性肿瘤的基础研究

国家自然科学基金

0+阅读 · 2011年12月31日

碱土-碱金属簇合物的设计合成及催化性能

国家自然科学基金

0+阅读 · 2009年12月31日

基于人工交通系统的交通信号控制系统评价实验设计

国家自然科学基金

0+阅读 · 2009年12月31日

以CD4为靶点的新型眼用凝胶（J2）抗角膜移植排斥的实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

XRoute Environment: A Novel Reinforcement Learning Environment for Routing

Arxiv

0+阅读 · 2023年6月5日

Multimodal Industrial Anomaly Detection via Hybrid Fusion

Multimodal Industrial Anomaly Detection via Hybrid Fusion

Arxiv

11+阅读 · 2023年3月1日

Deep Reinforcement Learning for Multi-Agent Interaction

Arxiv

45+阅读 · 2022年8月2日

Knowledge Augmented Machine Learning with Applications in Autonomous Driving: A Survey

Arxiv

17+阅读 · 2022年5月10日

Reinforcement Learning based Air Combat Maneuver Generation

Reinforcement Learning based Air Combat Maneuver Generation

Arxiv

91+阅读 · 2022年1月14日

Deep Learning for UAV-based Object Detection and Tracking: A Survey

Arxiv

62+阅读 · 2021年10月25日

A Survey on Reinforcement Learning for Recommender Systems

Arxiv

22+阅读 · 2021年9月22日

A Survey of Deep Reinforcement Learning in Recommender Systems: A Systematic Review and Future Directions

Arxiv

14+阅读 · 2021年9月8日

3D Object Detection for Autonomous Driving: A Survey

Arxiv

12+阅读 · 2021年6月21日

Deep Reinforcement Learning: An Overview

Deep Reinforcement Learning: An Overview

Arxiv

17+阅读 · 2018年11月26日

VIP会员

文章信息

相关主题

深度强化学习

相关VIP内容

JCIM丨DRlinker：深度强化学习优化片段连接设计

JCIM丨DRlinker：深度强化学习优化片段连接设计

专知会员服务

7+阅读 · 2022年12月9日

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

专知会员服务

89+阅读 · 2021年1月12日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

2019必读的十大深度强化学习论文

2019必读的十大深度强化学习论文

专知会员服务

59+阅读 · 2020年1月16日

【论文推荐中科院自动化所】视频游戏中深度强化学习的研究综述，A Survey of Deep Reinforcement Learning in Video

【论文推荐中科院自动化所】视频游戏中深度强化学习的研究综述，A Survey of Deep Reinforcement Learning in Video

专知会员服务

48+阅读 · 2019年12月24日

【目标检测 | 2019最新综述】目标检测的最新进展，附40页PDF，Recent Advances in Deep Learning for Object Detection

【目标检测 | 2019最新综述】目标检测的最新进展，附40页PDF，Recent Advances in Deep Learning for Object Detection

专知会员服务

85+阅读 · 2019年11月15日

【DeepMind-Nando de Freitas】强化学习教程，102页ppt，Reinforcement Learning

【DeepMind-Nando de Freitas】强化学习教程，102页ppt，Reinforcement Learning

专知会员服务

84+阅读 · 2019年11月15日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

【Pieter Abbeel 报告@CMU】元学习与深度强化学习机器人应用，Deep Learning to Learn，84页ppt

【Pieter Abbeel 报告@CMU】元学习与深度强化学习机器人应用，Deep Learning to Learn，84页ppt

专知会员服务

32+阅读 · 2019年10月12日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

7 Papers & Radios | ECCV 2022最佳论文；Transformer在试错中自主改进

7 Papers & Radios | ECCV 2022最佳论文；Transformer在试错中自主改进

机器之心

0+阅读 · 2022年10月30日

DAI2020 SMARTS 自动驾驶挑战赛(深度强化学习)

DAI2020 SMARTS 自动驾驶挑战赛(深度强化学习)

深度强化学习实验室

15+阅读 · 2020年8月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知

133+阅读 · 2020年3月18日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

XRoute Environment: A Novel Reinforcement Learning Environment for Routing

Arxiv

0+阅读 · 2023年6月5日

Multimodal Industrial Anomaly Detection via Hybrid Fusion

Multimodal Industrial Anomaly Detection via Hybrid Fusion

Arxiv

11+阅读 · 2023年3月1日

Deep Reinforcement Learning for Multi-Agent Interaction

Arxiv

45+阅读 · 2022年8月2日

Knowledge Augmented Machine Learning with Applications in Autonomous Driving: A Survey

Arxiv

17+阅读 · 2022年5月10日

Reinforcement Learning based Air Combat Maneuver Generation

Reinforcement Learning based Air Combat Maneuver Generation

Arxiv

91+阅读 · 2022年1月14日

Deep Learning for UAV-based Object Detection and Tracking: A Survey

Arxiv

62+阅读 · 2021年10月25日

A Survey on Reinforcement Learning for Recommender Systems

Arxiv

22+阅读 · 2021年9月22日

A Survey of Deep Reinforcement Learning in Recommender Systems: A Systematic Review and Future Directions

Arxiv

14+阅读 · 2021年9月8日

3D Object Detection for Autonomous Driving: A Survey

Arxiv

12+阅读 · 2021年6月21日

Deep Reinforcement Learning: An Overview

Deep Reinforcement Learning: An Overview

Arxiv

17+阅读 · 2018年11月26日

相关基金

配位不饱和MOFs低温SCR催化脱硝反应机理的分子模拟研究

国家自然科学基金

0+阅读 · 2015年12月31日

随机与动态环境下物流配送区域划分与配送路径集成优化问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向企业2.0的研发团队知识共享与创新多层次模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

变化环境下干旱区内陆艾比湖流域景观格局演变与水资源的相互作用机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于行为模型和超图匹配的多目标跟踪技术研究

国家自然科学基金

3+阅读 · 2012年12月31日

双靶点基于二萜、倍半萜的NF-κB抑制剂的设计合成及活性评价

国家自然科学基金

0+阅读 · 2011年12月31日

5HRE与CEAp联合调控抑癌基因RASSF1A系统治疗CEA阳性肿瘤的基础研究

国家自然科学基金

0+阅读 · 2011年12月31日

碱土-碱金属簇合物的设计合成及催化性能

国家自然科学基金

0+阅读 · 2009年12月31日

基于人工交通系统的交通信号控制系统评价实验设计

国家自然科学基金

0+阅读 · 2009年12月31日

以CD4为靶点的新型眼用凝胶（J2）抗角膜移植排斥的实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员