安全轨迹规划,利用加强学习自驾驶驾驶的安全轨迹规划 (Safe Trajectory Planning Using Reinforcement Learning for Self Driving) - 专知论文

会员服务 ·

0

学成 · 强化学习 · 优化器 · 无人驾驶 · 回合 ·

2020 年 11 月 9 日

Safe Trajectory Planning Using Reinforcement Learning for Self Driving

翻译：安全轨迹规划,利用加强学习自驾驶驾驶的安全轨迹规划

Josiah Coad,Zhiqian Qiao,John M. Dolan

from arxiv, 7 pages, 5 figures

Self-driving vehicles must be able to act intelligently in diverse and difficult environments, marked by high-dimensional state spaces, a myriad of optimization objectives and complex behaviors. Traditionally, classical optimization and search techniques have been applied to the problem of self-driving; but they do not fully address operations in environments with high-dimensional states and complex behaviors. Recently, imitation learning has been proposed for the task of self-driving; but it is labor-intensive to obtain enough training data. Reinforcement learning has been proposed as a way to directly control the car, but this has safety and comfort concerns. We propose using model-free reinforcement learning for the trajectory planning stage of self-driving and show that this approach allows us to operate the car in a more safe, general and comfortable manner, required for the task of self driving.

翻译：自驾车辆必须能够在多样化和困难的环境中明智地行动,其特点是高度的状态空间、各种优化目标和复杂的行为。传统上,典型的优化和搜索技术已经应用到自驾车问题上;但是它们并没有完全解决高度状态和复杂行为环境中的操作问题。最近,为自行驾驶的任务提出了仿造学习建议;但获得足够的培训数据需要花费大量人力。强化学习已被提议为直接控制汽车的一种方法,但有安全和舒适的担忧。我们提议在自行驾驶的轨迹规划阶段使用无模型强化学习,并表明这种方法允许我们以更安全、普遍和舒适的方式驾驶汽车,这是自行驾驶任务所需要的。

0

相关内容

【UIUC】最新《自监督学习》教程，51页ppt，Self-supervised learning

【UIUC】最新《自监督学习》教程，51页ppt，Self-supervised learning

专知会员服务

84+阅读 · 2020年11月25日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【斯坦福大学】Gradient Surgery for Multi-Task Learning

【斯坦福大学】Gradient Surgery for Multi-Task Learning

专知会员服务

47+阅读 · 2020年1月23日

【AAAI 2019 Tutorial】城市交通控制的规划与调度方法（Planning and Scheduling Approaches for Urban Traffic Control），Scott Sanner，Mauro Vallati，Stephen F. Smith

【AAAI 2019 Tutorial】城市交通控制的规划与调度方法（Planning and Scheduling Approaches for Urban Traffic Control），Scott Sanner，Mauro Vallati，Stephen F. Smith

专知会员服务

8+阅读 · 2019年11月18日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

281+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

spinningup.openai 强化学习资源完整

spinningup.openai 强化学习资源完整

CreateAMind

6+阅读 · 2018年12月17日

carla 学习笔记

carla 学习笔记

CreateAMind

9+阅读 · 2018年2月7日

carla无人驾驶模拟中文项目 carla_simulator_Chinese

carla无人驾驶模拟中文项目 carla_simulator_Chinese

CreateAMind

3+阅读 · 2018年1月30日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

A Survey of Deep RL and IL for Autonomous Driving Policy Learning

Arxiv

0+阅读 · 2021年1月6日

Dynamic Prioritization for Conflict-Free Path Planning of Multi-Robot Systems

Arxiv

0+阅读 · 2021年1月6日

Robot Learning with Crash Constraints

Arxiv

0+阅读 · 2021年1月5日

Discovering Reinforcement Learning Algorithms

Arxiv

0+阅读 · 2021年1月5日

CARL: Controllable Agent with Reinforcement Learning for Quadruped Locomotion

Arxiv

0+阅读 · 2021年1月5日

Improving Training Result of Partially Observable Markov Decision Process by Filtering Beliefs

Arxiv

0+阅读 · 2021年1月5日

A Hybrid Learner for Simultaneous Localization and Mapping

A Hybrid Learner for Simultaneous Localization and Mapping

Arxiv

0+阅读 · 2021年1月4日

Crowd-Driven Mapping, Localization and Planning

Arxiv

0+阅读 · 2021年1月3日

Online 3D Bin Packing with Constrained Deep Reinforcement Learning

Arxiv

0+阅读 · 2021年1月2日

Inverse reinforcement learning for autonomous navigation via differentiable semantic mapping and planning

Arxiv

0+阅读 · 2021年1月1日

VIP会员

文章信息

相关主题

相关VIP内容

【UIUC】最新《自监督学习》教程，51页ppt，Self-supervised learning

【UIUC】最新《自监督学习》教程，51页ppt，Self-supervised learning

专知会员服务

84+阅读 · 2020年11月25日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【斯坦福大学】Gradient Surgery for Multi-Task Learning

【斯坦福大学】Gradient Surgery for Multi-Task Learning

专知会员服务

47+阅读 · 2020年1月23日

【AAAI 2019 Tutorial】城市交通控制的规划与调度方法（Planning and Scheduling Approaches for Urban Traffic Control），Scott Sanner，Mauro Vallati，Stephen F. Smith

【AAAI 2019 Tutorial】城市交通控制的规划与调度方法（Planning and Scheduling Approaches for Urban Traffic Control），Scott Sanner，Mauro Vallati，Stephen F. Smith

专知会员服务

8+阅读 · 2019年11月18日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

281+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

美军“泰坦（TITAN）地面站目标系统”：是颠覆还是一场可预见的军事进步？

美空军指挥参谋学院 · 联合空中作战规划课程介绍（2025年） | 22页

一种基于视觉算法生成三维场景重建的多任务系统 | 2025最新200页

北约第十七届（2025年）网络冲突国际会议论文集 | 272页

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

spinningup.openai 强化学习资源完整

spinningup.openai 强化学习资源完整

CreateAMind

6+阅读 · 2018年12月17日

carla 学习笔记

carla 学习笔记

CreateAMind

9+阅读 · 2018年2月7日

carla无人驾驶模拟中文项目 carla_simulator_Chinese

carla无人驾驶模拟中文项目 carla_simulator_Chinese

CreateAMind

3+阅读 · 2018年1月30日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

A Survey of Deep RL and IL for Autonomous Driving Policy Learning

Arxiv

0+阅读 · 2021年1月6日

Dynamic Prioritization for Conflict-Free Path Planning of Multi-Robot Systems

Arxiv

0+阅读 · 2021年1月6日

Robot Learning with Crash Constraints

Arxiv

0+阅读 · 2021年1月5日

Discovering Reinforcement Learning Algorithms

Arxiv

0+阅读 · 2021年1月5日

CARL: Controllable Agent with Reinforcement Learning for Quadruped Locomotion

Arxiv

0+阅读 · 2021年1月5日

Improving Training Result of Partially Observable Markov Decision Process by Filtering Beliefs

Arxiv

0+阅读 · 2021年1月5日

A Hybrid Learner for Simultaneous Localization and Mapping

A Hybrid Learner for Simultaneous Localization and Mapping

Arxiv

0+阅读 · 2021年1月4日

Crowd-Driven Mapping, Localization and Planning

Arxiv

0+阅读 · 2021年1月3日

Online 3D Bin Packing with Constrained Deep Reinforcement Learning

Arxiv

0+阅读 · 2021年1月2日

Inverse reinforcement learning for autonomous navigation via differentiable semantic mapping and planning

Arxiv

0+阅读 · 2021年1月1日

微信扫码咨询专知VIP会员