Many studies confirmed that a proper traffic state representation is more important than complex algorithms for the classical traffic signal control (TSC) problem. In this paper, we (1) present a novel, flexible and efficient method, namely advanced max pressure (Advanced-MP), taking both running and queuing vehicles into consideration to decide whether to change current signal phase; (2) inventively design the traffic movement representation with the efficient pressure and effective running vehicles from Advanced-MP, namely advanced traffic state (ATS); and (3) develop a reinforcement learning (RL) based algorithm template, called Advanced-XLight, by combining ATS with the latest RL approaches, and generate two RL algorithms, namely "Advanced-MPLight" and "Advanced-CoLight" from Advanced-XLight. Comprehensive experiments on multiple real-world datasets show that: (1) the Advanced-MP outperforms baseline methods, and it is also efficient and reliable for deployment; and (2) Advanced-MPLight and Advanced-CoLight can achieve the state-of-the-art.
翻译:许多研究证实,适当的交通州代表比典型交通信号控制(TSC)问题的复杂算法更为重要。在本文中,我们(1)提出一种新的、灵活和有效的方法,即先进的最大压力(高级-MP),将运行和排队的车辆都考虑在内,以决定是否改变目前的信号阶段;(2)以来自高级-MP(即高级交通州)的高效压力和有效运行车辆(即高级交通州),以创新的方式设计交通流动代表;(3)开发一个基于强化学习(RL)的算法模板,称为高级-XLight,将苯丙胺类兴奋剂与最新的RL方法相结合,并产生两种RL算法,即“高级-MPLight”(高级-XLight)和“高级-COLight”(高级-Conloight)(高级-XLight)和“高级-XLight(高级-XLight)(高级-Xight)的“高级-Cacced-COLight)算法。关于多个真实世界数据集的综合实验表明:(1)高级-MP(高级-Conight)超越基线方法,而且对于部署来说也是有效和可靠的;(2)高级-MPLight和高级-Coight(Cight)能够实现最新的状态。