PILOT: 通过模拟学习和优化安全自主驾驶的高效规划 (PILOT: Efficient Planning by Imitation Learning and Optimisation for Safe Autonomous Driving) - 专知论文

会员服务 ·

0

学成 · Networking · Neural Networks · state-of-the-art · 层 ·

2021 年 7 月 30 日

PILOT: Efficient Planning by Imitation Learning and Optimisation for Safe Autonomous Driving

翻译：PILOT: 通过模拟学习和优化安全自主驾驶的高效规划

Henry Pulver,Francisco Eiras,Ludovico Carozza,Majd Hawasly,Stefano V. Albrecht,Subramanian Ramamoorthy

from arxiv, IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2021. 8 pages, 7 figures

Achieving a proper balance between planning quality, safety and efficiency is a major challenge for autonomous driving. Optimisation-based motion planners are capable of producing safe, smooth and comfortable plans, but often at the cost of runtime efficiency. On the other hand, naively deploying trajectories produced by efficient-to-run deep imitation learning approaches might risk compromising safety. In this paper, we present PILOT -- a planning framework that comprises an imitation neural network followed by an efficient optimiser that actively rectifies the network's plan, guaranteeing fulfilment of safety and comfort requirements. The objective of the efficient optimiser is the same as the objective of an expensive-to-run optimisation-based planning system that the neural network is trained offline to imitate. This efficient optimiser provides a key layer of online protection from learning failures or deficiency in out-of-distribution situations that might compromise safety or comfort. Using a state-of-the-art, runtime-intensive optimisation-based method as the expert, we demonstrate in simulated autonomous driving experiments in CARLA that PILOT achieves a seven-fold reduction in runtime when compared to the expert it imitates without sacrificing planning quality.

翻译：实现规划质量、安全和效率之间的适当平衡是自主驾驶的一大挑战。优化型运动规划者能够制定安全、顺畅和舒适的计划,但往往以运行效率为代价。另一方面,天真地部署高效到运行深度模仿学习方法产生的轨迹可能会危及安全。在本文件中,我们介绍PILOT -- -- 一个规划框架,由模仿神经网络组成,并辅之以一种高效的优化,积极修正网络计划,保证安全和舒适要求得到满足。高效的节能软件的目标与一个昂贵到运行的优化型规划系统的目标相同,即对神经网络进行离线培训,以便模仿。这一高效的节能软件提供了关键的在线保护层,防止在可能损害安全或舒适的不平等情况下学习失败或缺陷。我们以专家的身份使用一种最先进的、时间密集的节能方法,在CARLA的模拟自主驾驶实验中展示,PILOT在不进行质量规划的情况下,在不进行自我复制的情况下,不进行7倍的质量削减。

0

相关内容

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

达摩院基于元学习的对话系统

达摩院基于元学习的对话系统

专知会员服务

25+阅读 · 2021年1月1日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

111+阅读 · 2020年6月10日

【华盛顿大学】用于视觉和语言导航的多视图学习，Multi-View Learning for Vision-and-Language Navigation

【华盛顿大学】用于视觉和语言导航的多视图学习，Multi-View Learning for Vision-and-Language Navigation

专知会员服务

31+阅读 · 2020年3月11日

【斯坦福大学】Gradient Surgery for Multi-Task Learning

【斯坦福大学】Gradient Surgery for Multi-Task Learning

专知会员服务

47+阅读 · 2020年1月23日

【ICCV 2019】基于元学习的自动化神经网络通道 MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning

【ICCV 2019】基于元学习的自动化神经网络通道 MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning

专知会员服务

17+阅读 · 2019年11月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

281+阅读 · 2019年10月9日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

carla 学习笔记

carla 学习笔记

CreateAMind

9+阅读 · 2018年2月7日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Physical Gradients for Deep Learning

Arxiv

0+阅读 · 2021年10月1日

Safe, Deterministic Trajectory Planning for Unstructured and Partially Occluded Environments

Arxiv

0+阅读 · 2021年9月30日

Linear Differential Games for Cooperative Behavior Planning of Autonomous Vehicles Using Mixed-Integer Programming

Arxiv

0+阅读 · 2021年9月30日

A discrete optimisation approach for target path planning whilst evading sensors

Arxiv

0+阅读 · 2021年9月30日

Guaranteed Rejection-free Sampling Method Using Past Behaviours for Motion Planning of Autonomous Systems

Arxiv

0+阅读 · 2021年9月29日

Improving Safety in Deep Reinforcement Learning using Unsupervised Action Planning

Arxiv

0+阅读 · 2021年9月29日

A Communication Security Game on Switched Systems for Autonomous Vehicle Platoons

Arxiv

0+阅读 · 2021年9月29日

SafetyNet: Safe planning for real-world self-driving vehicles using machine-learned policies

Arxiv

0+阅读 · 2021年9月28日

Adaptive Informative Path Planning Using Deep Reinforcement Learning for UAV-based Active Sensing

Arxiv

0+阅读 · 2021年9月28日

Runtime Safety Assurance for Learning-enabled Control of Autonomous Driving Vehicles

Arxiv

0+阅读 · 2021年9月28日

VIP会员

文章信息

相关主题

Neural Networks

state-of-the-art

相关VIP内容

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

达摩院基于元学习的对话系统

达摩院基于元学习的对话系统

专知会员服务

25+阅读 · 2021年1月1日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

111+阅读 · 2020年6月10日

【华盛顿大学】用于视觉和语言导航的多视图学习，Multi-View Learning for Vision-and-Language Navigation

【华盛顿大学】用于视觉和语言导航的多视图学习，Multi-View Learning for Vision-and-Language Navigation

专知会员服务

31+阅读 · 2020年3月11日

【斯坦福大学】Gradient Surgery for Multi-Task Learning

【斯坦福大学】Gradient Surgery for Multi-Task Learning

专知会员服务

47+阅读 · 2020年1月23日

【ICCV 2019】基于元学习的自动化神经网络通道 MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning

【ICCV 2019】基于元学习的自动化神经网络通道 MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning

专知会员服务

17+阅读 · 2019年11月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

281+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

自动驾驶轨迹规划中的基础模型：进展综述与开放挑战

《用于提升多域战备的大型语言模型辅助场景生成器》报告

【斯坦福博士论文】为人类使用优化 AI 模型

国防领域人工智能规模化应用的理论与实践

相关资讯

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

carla 学习笔记

carla 学习笔记

CreateAMind

9+阅读 · 2018年2月7日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Physical Gradients for Deep Learning

Arxiv

0+阅读 · 2021年10月1日

Safe, Deterministic Trajectory Planning for Unstructured and Partially Occluded Environments

Arxiv

0+阅读 · 2021年9月30日

Linear Differential Games for Cooperative Behavior Planning of Autonomous Vehicles Using Mixed-Integer Programming

Arxiv

0+阅读 · 2021年9月30日

A discrete optimisation approach for target path planning whilst evading sensors

Arxiv

0+阅读 · 2021年9月30日

Guaranteed Rejection-free Sampling Method Using Past Behaviours for Motion Planning of Autonomous Systems

Arxiv

0+阅读 · 2021年9月29日

Improving Safety in Deep Reinforcement Learning using Unsupervised Action Planning

Arxiv

0+阅读 · 2021年9月29日

A Communication Security Game on Switched Systems for Autonomous Vehicle Platoons

Arxiv

0+阅读 · 2021年9月29日

SafetyNet: Safe planning for real-world self-driving vehicles using machine-learned policies

Arxiv

0+阅读 · 2021年9月28日

Adaptive Informative Path Planning Using Deep Reinforcement Learning for UAV-based Active Sensing

Arxiv

0+阅读 · 2021年9月28日

Runtime Safety Assurance for Learning-enabled Control of Autonomous Driving Vehicles

Arxiv

0+阅读 · 2021年9月28日

微信扫码咨询专知VIP会员