Inefficient traffic control may cause numerous problems such as traffic congestion and energy waste. This paper proposes a novel multi-agent reinforcement learning method, named KS-DDPG (Knowledge Sharing Deep Deterministic Policy Gradient) to achieve optimal control by enhancing the cooperation between traffic signals. By introducing the knowledge-sharing enabled communication protocol, each agent can access to the collective representation of the traffic environment collected by all agents. The proposed method is evaluated through two experiments respectively using synthetic and real-world datasets. The comparison with state-of-the-art reinforcement learning-based and conventional transportation methods demonstrate the proposed KS-DDPG has significant efficiency in controlling large-scale transportation networks and coping with fluctuations in traffic flow. In addition, the introduced communication mechanism has also been proven to speed up the convergence of the model without significantly increasing the computational burden.
翻译:低效率的交通控制可能会造成交通堵塞和能源浪费等许多问题。本文件建议采用名为KS-DDPG(知识分享深确定政策梯度)的新型多剂强化学习方法,通过增强交通信号之间的合作实现最佳控制。通过引入知识共享促进通信协议,每个代理都能够利用所有代理收集的交通环境的集体代表性。建议的方法分别通过合成和现实世界数据集的两项实验进行评估。与最新先进的强化学习和常规运输方法进行比较表明,拟议的KS-DDPG在控制大型运输网络和应对交通流量波动方面效率很高。此外,还证明引入的通信机制可以加快模式的趋同,而不会大大增加计算负担。