With the emerging connected-vehicle technologies and smart roads, the need for intelligent adaptive traffic signal controls is more than ever before. This paper proposes a novel Economic-driven Adaptive Traffic Signal Control (eATSC) model with a hyper control variable - interest rate defined in economics for traffic signal control at signalized intersections. The eATSC uses a continuous compounding function that captures both the total number of vehicles and the accumulated waiting time of each vehicle to compute penalties for different directions. The computed penalties grow with waiting time and is used for signal control decisions. Each intersection is assigned two intelligent agents adjusting interest rate and signal length for different directions according to the traffic patterns, respectively. The problem is formulated as a Markov Decision Process (MDP) problem to reduce congestions, and a two-agent Double Dueling Deep Q Network (DDDQN) is utilized to solve the problem. Under the optimal policy, the agents can select the optimal interest rates and signal time to minimize the likelihood of traffic congestion. To evaluate the superiority of our method, a VISSIM simulation model with classic four-leg signalized intersections is developed. The results indicate that the proposed model is adequately able to maintain healthy traffic flow at the intersection.
翻译:由于新兴的联运车辆技术和智能道路,智能适应性交通信号控制比以往更加需要智能的适应性交通信号控制。本文件提出一个新的经济驱动的适应性交通信号控制模式(eATSC),在信号十字路口的交通信号控制经济学中定义了超控制变量-利率。eATSC使用一种连续的复合功能,既包括车辆总数,也包括每辆车辆累积的等待时间,以计算不同方向的罚款。计算罚款随着等待时间的增加而增加,并用于信号控制决定。每个交叉点都分别指定了两个智能剂,根据交通模式调整利率和信号长度。问题被表述为用于减少交通拥堵的马克夫决策程序(MDP)问题,并使用一个双剂双倍分辨深Q网络(DDDQN)解决问题。根据最佳政策,代理器可以选择最佳利率和信号时间,以尽量减少交通拥堵的可能性。为了评估我们的方法的优越性,将VISSIM模拟模型配有典型的四腿信号交叉点。结果显示,拟议的模式能够保持健康的交叉点。