Sub-optimal control policies in intersection traffic signal controllers (TSC) contribute to congestion and lead to negative effects on human health and the environment. Reinforcement learning (RL) for traffic signal control is a promising approach to design better control policies and has attracted considerable research interest in recent years. However, most work done in this area used simplified simulation environments of traffic scenarios to train RL-based TSC. To deploy RL in real-world traffic systems, the gap between simplified simulation environments and real-world applications has to be closed. Therefore, we propose LemgoRL, a benchmark tool to train RL agents as TSC in a realistic simulation environment of Lemgo, a medium-sized town in Germany. In addition to the realistic simulation model, LemgoRL encompasses a traffic signal logic unit that ensures compliance with all regulatory and safety requirements. LemgoRL offers the same interface as the well-known OpenAI gym toolkit to enable easy deployment in existing research work. Our benchmark tool drives the development of RL algorithms towards real-world applications. We provide LemgoRL as an open-source tool at https://github.com/rl-ina/lemgorl.
翻译:交叉交通信号控制器(TSC)的亚最佳控制政策导致拥挤,并导致对人类健康和环境产生消极影响。交通信号控制强化学习(RL)是设计更好的控制政策的一个很有希望的方法,近年来引起了相当大的研究兴趣。然而,这一领域的大部分工作使用交通情景的简化模拟环境来培训基于RL的TSC。在现实世界交通系统中部署RL时,必须消除简化模拟环境与现实世界应用之间的差距。因此,我们提议LemgoRL(LemgoRL)是一个基准工具,用于在德国中等规模城镇Lemgo的现实模拟环境中将RLA代理器培训为TSC。除了现实的模拟模型外,LemgoRL包括一个交通信号逻辑单位,以确保遵守所有监管和安全要求。LemgoRL提供与众所周知的OpenAI健身工具相同的界面,以便于现有研究工作的部署。我们的基准工具将RLA算法的发展推向现实世界应用。我们提供LemgoRL(L)作为在https://gthub.com/rina/leml/legor的开放源工具。