Since conventional approaches could not adapt to dynamic traffic conditions, reinforcement learning (RL) has attracted more attention to help solve the traffic signal control (TSC) problem. However, existing RL-based methods are rarely deployed considering that they are neither cost-effective in terms of computing resources nor more robust than traditional approaches, which raises a critical research question: how to construct an adaptive controller for TSC with less training and reduced complexity based on RL-based approach? To address this question, in this paper, we (1) innovatively specify the traffic movement representation as a simple but efficient pressure of vehicle queues in a traffic network, namely efficient pressure (EP); (2) build a traffic signal settings protocol, including phase duration, signal phase number and EP for TSC; (3) design a TSC approach based on the traditional max pressure (MP) approach, namely efficient max pressure (Efficient-MP) using the EP to capture the traffic state; and (4) develop a general RL-based TSC algorithm template: efficient Xlight (Efficient-XLight) under EP. Through comprehensive experiments on multiple real-world datasets in our traffic signal settings' protocol for TSC, we demonstrate that efficient pressure is complementary to traditional and RL-based modeling to design better TSC methods. Our code is released on Github.
翻译:由于传统方法无法适应动态交通条件,强化学习(RL)吸引了更多的注意力,帮助解决交通信号控制问题,然而,现有基于RL的方法很少被采用,因为就计算资源而言,这些方法既不符合成本效益,也比传统方法更为健全,因此引起了一个关键的研究问题:如何在基于RL的方法的基础上,以较少培训和复杂性降低的方式,为TSC建立一个适应性控制器?为了解决这一问题,我们(1) 创新地将交通流动代表作为交通网络中车辆排队的简单而有效的压力,即有效压力(EP);(2) 建立交通信号设置协议,包括阶段持续时间、信号阶段编号和TSC的EP;(3) 设计基于传统最大压力(MP)方法的TSC方法,即利用EP高效率的最大压力(Efficent-MP)来捕捉交通状态;(4) 在EP下开发一个基于RL的通用的TSC算法模板:高效Xlight(Efficent-XLight) 。