COOR-PLT:基于深强化学习,在无信号十字路口协调接连和自主车辆的适应性排的等级控制模式 (COOR-PLT: A hierarchical control model for coordinating adaptive platoons of connected and autonomous vehicles at signal-free intersections based on deep reinforcement learning)

控制器 · Learning · CAV · MoDELS · 深度强化学习 ·

2022 年 7 月 1 日

COOR-PLT: A hierarchical control model for coordinating adaptive platoons of connected and autonomous vehicles at signal-free intersections based on deep reinforcement learning

翻译：COOR-PLT:基于深强化学习,在无信号十字路口协调接连和自主车辆的适应性排的等级控制模式

Duowei Li,Jianping Wu,Feng Zhu,Tianyi Chen,Yiik Diew Wong

from arxiv, This paper has been submitted to Transportation Research Part C: Emerging Technologies and is currently under review

Platooning and coordination are two implementation strategies that are frequently proposed for traffic control of connected and autonomous vehicles (CAVs) at signal-free intersections instead of using conventional traffic signals. However, few studies have attempted to integrate both strategies to better facilitate the CAV control at signal-free intersections. To this end, this study proposes a hierarchical control model, named COOR-PLT, to coordinate adaptive CAV platoons at a signal-free intersection based on deep reinforcement learning (DRL). COOR-PLT has a two-layer framework. The first layer uses a centralized control strategy to form adaptive platoons. The optimal size of each platoon is determined by considering multiple objectives (i.e., efficiency, fairness and energy saving). The second layer employs a decentralized control strategy to coordinate multiple platoons passing through the intersection. Each platoon is labeled with coordinated status or independent status, upon which its passing priority is determined. As an efficient DRL algorithm, Deep Q-network (DQN) is adopted to determine platoon sizes and passing priorities respectively in the two layers. The model is validated and examined on the simulator Simulation of Urban Mobility (SUMO). The simulation results demonstrate that the model is able to: (1) achieve satisfactory convergence performances; (2) adaptively determine platoon size in response to varying traffic conditions; and (3) completely avoid deadlocks at the intersection. By comparison with other control methods, the model manifests its superiority of adopting adaptive platooning and DRL-based coordination strategies. Also, the model outperforms several state-of-the-art methods on reducing travel time and fuel consumption in different traffic conditions.

翻译：平流和协调是经常提出的两个执行战略,用于在无信号十字路口而不是使用常规交通信号对连接和自主车辆(CAVs)进行交通控制,而不是在无信号十字路口使用常规交通信号,然而,很少有研究试图将这两种战略结合起来,以便更好地便利无信号十字路口的CAV控制。为此,本研究报告提议了一个等级控制模式,名为COOR-PLT,在深度强化学习(DRL)的基础上,在无信号十字路口协调适应的CAV排。COOR-PLT有一个双层框架。第一层使用集中控制战略来形成适应性排。每个排的最佳规模是通过考虑多个目标(即效率、公平性和节能性)来确定的最佳规模。第二层使用分散控制战略来协调通过无信号十字路口的多排。每个排都有协调的地位或独立地位标志,并据此决定其过往优先事项。随着一个高效的DRL算法、深QL网络(DQN)被采用来决定排规模大小和通过两个层次的优先事项。该模型经过验证和检查每个排的最佳规模,每个排的最佳规模是通过考虑多个目标(即考虑多个目标、效率、效率、效率) 模拟交通稳定度比重度比重度比重度比重、燃料速度,在城市行的平比重度比重、模拟、模拟、模拟、稳定、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟到稳定速度、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、模拟、