Many studies confirmed that a proper traffic state representation is more important than complex algorithms for the classical traffic signal control (TSC) problem. In this paper, we (1) present a novel, flexible and efficient method, namely advanced max pressure (Advanced-MP), taking both running and queuing vehicles into consideration to decide whether to change current signal phase; (2) inventively design the traffic movement representation with the efficient pressure and effective running vehicles from Advanced-MP, namely advanced traffic state (ATS); and (3) develop a reinforcement learning (RL) based algorithm template, called Advanced-XLight, by combining ATS with the latest RL approaches, and generate two RL algorithms, namely "Advanced-MPLight" and "Advanced-CoLight" from Advanced-XLight. Comprehensive experiments on multiple real-world datasets show that: (1) the Advanced-MP outperforms baseline methods, and it is also efficient and reliable for deployment; and (2) Advanced-MPLight and Advanced-CoLight can achieve the state-of-the-art
翻译:许多研究确认,适当的交通州代表比典型交通信号控制(TSC)问题的复杂算法更为重要。在本文中,我们(1) 提出了一个新型、灵活和有效的方法,即先进的最大压力(高级-MP),将运行和排队的车辆都考虑在内,以决定是否改变目前的信号阶段;(2) 以来自高级-MP(高级交通州)的高效压力和有效运行的车辆(即高级交通州),即高级交通信号控制(ATS)的高效压力和有效运行车辆,以发明一个基于强化学习(RL)的算法模板,称为高级-XLight(高级-XLight),将苯丙胺类兴奋剂与最新的RL方法相结合,并产生两种RL算法,即“高级-MPLight”(高级-XLight)和“高级-Coight(高级-XLight)”,关于多个真实世界数据集的综合实验表明:(1) 高级MP-MP(高级-Conight)超越基线方法,而且对于部署也是有效和可靠的;(2) 高级-MP-Light和高级-Coight(高级-Cight)能够实现最新状态。