实时协作车辆协同管理非信号道路交叉口 (Real-time Cooperative Vehicle Coordination at Unsignalized Road Intersections)

Cooperative coordination at unsignalized road intersections, which aims to improve the driving safety and traffic throughput for connected and automated vehicles, has attracted increasing interests in recent years. However, most existing investigations either suffer from computational complexity or cannot harness the full potential of the road infrastructure. To this end, we first present a dedicated intersection coordination framework, where the involved vehicles hand over their control authorities and follow instructions from a centralized coordinator. Then a unified cooperative trajectory optimization problem will be formulated to maximize the traffic throughput while ensuring the driving safety and long-term stability of the coordination system. To address the key computational challenges in the real-world deployment, we reformulate this non-convex sequential decision problem into a model-free Markov Decision Process (MDP) and tackle it by devising a Twin Delayed Deep Deterministic Policy Gradient (TD3)-based strategy in the deep reinforcement learning (DRL) framework. Simulation and practical experiments show that the proposed strategy could achieve near-optimal performance in sub-static coordination scenarios and significantly improve the traffic throughput in the realistic continuous traffic flow. The most remarkable advantage is that our strategy could reduce the time complexity of computation to milliseconds, and is shown scalable when the road lanes increase.

翻译：针对非信号道路交叉口实现协同管理能够提高连接和自动驾驶汽车的驾驶安全和交通流量，近年来引起了越来越多的关注。然而，现有的大部分研究在计算上或不能充分利用道路基础设施方面存在问题。为此，我们首先提出了一个专门的路口协调框架，其中涉及的车辆交出控制权，并遵循来自集中式协调员的指令。然后，我们制定了一个统一的协同轨迹优化问题，以最大化交通吞吐量，同时确保协调系统的驾驶安全和长期稳定性。为了解决实际部署中的关键计算挑战，我们将这个非凸顺序决策问题重新转化为一个无模型马尔可夫决策过程（MDP），并在深度强化学习（DRL）框架下设计了一个双重延迟深度确定性策略梯度（TD3）的策略。模拟和实际实验表明，所提出的策略在几乎静态协调场景下可以实现接近最优的性能，并且在现实的连续交通流中显著提高交通吞吐量。最显著的优点是，我们的策略可以将计算的时间复杂度降低到毫秒级，并且在道路车道增加时具有可伸缩性。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

【硬核书】规划算法 (Planning Algorithm)，1023页pdf，Steven M. Illinois大学

专知会员服务

165+阅读 · 2022年4月10日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

维多利亚运输政策研究所“Autonomous Vehicle Implementation Predictions：Implications for Transport Planning”（自动驾驶汽车实施预测:对交通规划的影响）

专知会员服务

17+阅读 · 2022年2月16日

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

专知会员服务

40+阅读 · 2020年9月21日