Traffic signal control is of critical importance for the effective use of transportation infrastructures. The rapid increase of vehicle traffic and changes in traffic patterns make traffic signal control more and more challenging. Reinforcement Learning (RL)-based algorithms have demonstrated their potential in dealing with traffic signal control. However, most existing solutions require a large amount of training data, which is unacceptable for many real-world scenarios. This paper proposes a novel model-based meta-reinforcement learning framework (ModelLight) for traffic signal control. Within ModelLight, an ensemble of models for road intersections and the optimization-based meta-learning method are used to improve the data efficiency of an RL-based traffic light control method. Experiments on real-world datasets demonstrate that ModelLight can outperform state-of-the-art traffic light control algorithms while substantially reducing the number of required interactions with the real-world environment.
翻译:车辆交通量的迅速增加和交通模式的改变使得交通信号的控制越来越具有挑战性。基于强化学习的算法证明了其在处理交通信号控制方面的潜力。然而,大多数现有解决办法需要大量的培训数据,这对许多现实世界的情景来说都是不可接受的。本文件建议为交通信号控制建立一个新型基于模型的元加强学习框架(ModelLight)。在模型光线范围内,使用一套道路交叉模式和优化的元学习方法来提高基于RL的交通灯控制方法的数据效率。对现实世界数据集的实验表明,模型光能可以超越最新交通灯控制算法,同时大大减少与现实世界环境的必要互动。