学习关键驾驶行为的展示：基于驾驶者风险场的方法 (Learning from Demonstrations of Critical Driving Behaviours Using Driver's Risk Field) - 专知论文

会员服务 ·

0

驾驶行为 · 学习策略 · 训练数据 · 安全关键 · 工业领域 ·

2023 年 4 月 1 日

Learning from Demonstrations of Critical Driving Behaviours Using Driver's Risk Field

翻译：学习关键驾驶行为的展示：基于驾驶者风险场的方法

Yurui Du,Flavia Sofia Acerbo,Jens Kober,Tong Duy Son

In recent years, imitation learning (IL) has been widely used in industry as the core of autonomous vehicle (AV) planning modules. However, previous IL works show sample inefficiency and low generalisation in safety-critical scenarios, on which they are rarely tested. As a result, IL planners can reach a performance plateau where adding more training data ceases to improve the learnt policy. First, our work presents an IL model using the spline coefficient parameterisation and offline expert queries to enhance safety and training efficiency. Then, we expose the weakness of the learnt IL policy by synthetically generating critical scenarios through optimisation of parameters of the driver's risk field (DRF), a parametric human driving behaviour model implemented in a multi-agent traffic simulator based on the Lyft Prediction Dataset. To continuously improve the learnt policy, we retrain the IL model with augmented data. Thanks to the expressivity and interpretability of the DRF, the desired driving behaviours can be encoded and aggregated to the original training data. Our work constitutes a full development cycle that can efficiently and continuously improve the learnt IL policies in closed-loop. Finally, we show that our IL planner developed with less training resource still has superior performance compared to the previous state-of-the-art.

翻译：近年来，模仿学习（IL）作为自动驾驶（AV）规划模块的核心广泛应用于工业领域。然而，先前的IL研究表明，它们在安全关键环境下显示出样本低效和泛化性能不佳，在这种环境下它们很少被测试。结果，IL计划者可以达到一个性能平台，增加更多的训练数据不能改进学习策略。首先，我们的工作提出了使用样条参数化和离线专家查询来增强安全和训练效率的IL模型。然后，我们通过优化Lyft Prediction数据集上基于多智能体交通模拟器实现的驾驶者风险场（DRF）的参数来合成关键场景，揭示了学到的IL策略的弱点。为了持续改进学习策略，我们使用增强数据重新训练IL模型。由于DRF的表达能力和可解释性，所需的驾驶行为可以被编码并在原始训练数据上聚合。我们的工作构成了一种完整的开发周期，可以有效地在闭环中不断改进学习IL策略。最后，我们展示了与以前的最新技术相比，使用更少的训练资源开发的IL计划程序仍具有优越性能。

0

相关内容

驾驶行为

模仿学习综述：传统与新进展

模仿学习综述：传统与新进展

专知会员服务

55+阅读 · 2023年2月18日

JCIM丨DRlinker：深度强化学习优化片段连接设计

JCIM丨DRlinker：深度强化学习优化片段连接设计

专知会员服务

7+阅读 · 2022年12月9日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

UCL& UC Berkeley | 深度强化学习中的泛化研究综述

UCL& UC Berkeley | 深度强化学习中的泛化研究综述

专知会员服务

61+阅读 · 2021年11月22日

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

专知会员服务

51+阅读 · 2020年5月3日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

专知会员服务

24+阅读 · 2019年11月11日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

促红细胞生成素衍生物对钙黏蛋白突变小鼠CDH23erl/erl的听力保护作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

介孔金属氮化物在燃料电池中的应用

国家自然科学基金

0+阅读 · 2014年12月31日

T-CPS环境下基于多Agent免疫协同进化理论的微观交通认知方法研究

国家自然科学基金

2+阅读 · 2013年12月31日

雄激素受体在膀胱癌进展中对GATA3的调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

RIP2调控CD40-NF-кB信号通路在血管内皮细胞损伤中的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

miR-146a靶向IRAK1与TRAF6调控非小细胞肺癌转移的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于行为模型和超图匹配的多目标跟踪技术研究

国家自然科学基金

3+阅读 · 2012年12月31日

在线和离线折衷排序研究

国家自然科学基金

0+阅读 · 2012年12月31日

TRAIL协同IER3调节NF-κB信号通路介导肝癌细胞凋亡的相关机制研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

A Data-driven Pricing Scheme for Optimal Routing through Artificial Currencies

Arxiv

0+阅读 · 2023年5月25日

Comparison of Pedestrian Prediction Models from Trajectory and Appearance Data for Autonomous Driving

Arxiv

0+阅读 · 2023年5月25日

Design-Based Confidence Sequences: A General Approach to Risk Mitigation in Online Experimentation

Arxiv

0+阅读 · 2023年5月25日

Towards human-compatible autonomous car: A study of non-verbal Turing test in automated driving with affective transition modelling

Arxiv

0+阅读 · 2023年5月24日

Market Making and Pricing of Financial Derivatives based on Road Travel Times

Arxiv

0+阅读 · 2023年5月24日

Human Choice Prediction in Language-based Non-Cooperative Games: Simulation-based Off-Policy Evaluation

Arxiv

0+阅读 · 2023年5月23日

Advancing Community Engaged Approaches to Identifying Structural Drivers of Racial Bias in Health Diagnostic Algorithms

Arxiv

0+阅读 · 2023年5月22日

Knowledge Augmented Machine Learning with Applications in Autonomous Driving: A Survey

Arxiv

17+阅读 · 2022年5月10日

Characterizing Impacts of Heterogeneity in Federated Learning upon Large-Scale Smartphone Data

Arxiv

12+阅读 · 2021年2月21日

How to train your MAML

Arxiv

26+阅读 · 2019年3月5日

VIP会员

文章信息

相关主题

相关VIP内容

模仿学习综述：传统与新进展

模仿学习综述：传统与新进展

专知会员服务

55+阅读 · 2023年2月18日

JCIM丨DRlinker：深度强化学习优化片段连接设计

JCIM丨DRlinker：深度强化学习优化片段连接设计

专知会员服务

7+阅读 · 2022年12月9日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

UCL& UC Berkeley | 深度强化学习中的泛化研究综述

UCL& UC Berkeley | 深度强化学习中的泛化研究综述

专知会员服务

61+阅读 · 2021年11月22日

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

专知会员服务

51+阅读 · 2020年5月3日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

专知会员服务

24+阅读 · 2019年11月11日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《多域空战指挥体系：驾驭复杂性的艺术》

构建军事人工智能信任体系始于破除黑盒机制

《生态建模密码破译：建模与编程实践》美陆军最新报告

《战争形态演变：合成兵种防御主导模式探析》48页slides

相关资讯

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

A Data-driven Pricing Scheme for Optimal Routing through Artificial Currencies

Arxiv

0+阅读 · 2023年5月25日

Comparison of Pedestrian Prediction Models from Trajectory and Appearance Data for Autonomous Driving

Arxiv

0+阅读 · 2023年5月25日

Design-Based Confidence Sequences: A General Approach to Risk Mitigation in Online Experimentation

Arxiv

0+阅读 · 2023年5月25日

Towards human-compatible autonomous car: A study of non-verbal Turing test in automated driving with affective transition modelling

Arxiv

0+阅读 · 2023年5月24日

Market Making and Pricing of Financial Derivatives based on Road Travel Times

Arxiv

0+阅读 · 2023年5月24日

Human Choice Prediction in Language-based Non-Cooperative Games: Simulation-based Off-Policy Evaluation

Arxiv

0+阅读 · 2023年5月23日

Advancing Community Engaged Approaches to Identifying Structural Drivers of Racial Bias in Health Diagnostic Algorithms

Arxiv

0+阅读 · 2023年5月22日

Knowledge Augmented Machine Learning with Applications in Autonomous Driving: A Survey

Arxiv

17+阅读 · 2022年5月10日

Characterizing Impacts of Heterogeneity in Federated Learning upon Large-Scale Smartphone Data

Arxiv

12+阅读 · 2021年2月21日

How to train your MAML

Arxiv

26+阅读 · 2019年3月5日

相关基金

促红细胞生成素衍生物对钙黏蛋白突变小鼠CDH23erl/erl的听力保护作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

介孔金属氮化物在燃料电池中的应用

国家自然科学基金

0+阅读 · 2014年12月31日

T-CPS环境下基于多Agent免疫协同进化理论的微观交通认知方法研究

国家自然科学基金

2+阅读 · 2013年12月31日

雄激素受体在膀胱癌进展中对GATA3的调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

RIP2调控CD40-NF-кB信号通路在血管内皮细胞损伤中的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

miR-146a靶向IRAK1与TRAF6调控非小细胞肺癌转移的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于行为模型和超图匹配的多目标跟踪技术研究

国家自然科学基金

3+阅读 · 2012年12月31日

在线和离线折衷排序研究

国家自然科学基金

0+阅读 · 2012年12月31日

TRAIL协同IER3调节NF-κB信号通路介导肝癌细胞凋亡的相关机制研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员