安全操作器与强化学习促进安全与合作自动化合并的强化学习高级目录作出的高级别决定 (High-level Decisions from a Safe Maneuver Catalog with Reinforcement Learning for Safe and Cooperative Automated Merging)

Reinforcement learning (RL) has recently been used for solving challenging decision-making problems in the context of automated driving. However, one of the main drawbacks of the presented RL-based policies is the lack of safety guarantees, since they strive to reduce the expected number of collisions but still tolerate them. In this paper, we propose an efficient RL-based decision-making pipeline for safe and cooperative automated driving in merging scenarios. The RL agent is able to predict the current situation and provide high-level decisions, specifying the operation mode of the low level planner which is responsible for safety. In order to learn a more generic policy, we propose a scalable RL architecture for the merging scenario that is not sensitive to changes in the environment configurations. According to our experiments, the proposed RL agent can efficiently identify cooperative drivers from their vehicle state history and generate interactive maneuvers, resulting in faster and more comfortable automated driving. At the same time, thanks to the safety constraints inside the planner, all of the maneuvers are collision free and safe.

翻译：强化学习(RL)最近被用于解决自动化驾驶过程中具有挑战性的决策问题,然而,所提出的以RL为基础的政策的一个主要缺点是缺乏安全保障,因为这些政策力求减少预期的碰撞次数,但仍能容忍碰撞次数。在本文件中,我们提议建立一个基于RL的高效决策管道,用于在合并情况下安全和合作的自动驾驶。RL代理能够预测当前情况并提供高层决定,具体说明负责安全的低级别规划员的操作模式。为了了解更通用的政策,我们为合并方案提出了一个可扩缩的RL结构结构,该结构对环境配置的变化并不敏感。根据我们的实验,拟议的RL代理可以高效率地识别其车辆状况历史中的合作驾驶者,并产生互动的动作,从而导致更快和更舒适的自动化驾驶。与此同时,由于规划员内部的安全限制,所有动作都是自由而安全的。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

专知会员服务

89+阅读 · 2021年1月12日

【RLChina2020公开课】Lecture-11.pdf【多智能体学习与游戏AI前沿】

专知会员服务

27+阅读 · 2020年8月6日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日