向现实世界部署交通信号控制强化学习 (Towards Real-World Deployment of Reinforcement Learning for Traffic Signal Control)

Arthur Müller,Vishal Rangras,Georg Schnittker,Michael Waldmann,Maxim Friesen,Tobias Ferfers,Lukas Schreckenberg,Florian Hufen,Jürgen Jasperneite,Marco Wiering

from arxiv, Paper was accepted by ICMLA 2021 (20th IEEE International Conference on Machine Learning and Applications). Code available under https://github.com/RL-INA/LemgoRL

Sub-optimal control policies in intersection traffic signal controllers (TSC) contribute to congestion and lead to negative effects on human health and the environment. Reinforcement learning (RL) for traffic signal control is a promising approach to design better control policies and has attracted considerable research interest in recent years. However, most work done in this area used simplified simulation environments of traffic scenarios to train RL-based TSC. To deploy RL in real-world traffic systems, the gap between simplified simulation environments and real-world applications has to be closed. Therefore, we propose LemgoRL, a benchmark tool to train RL agents as TSC in a realistic simulation environment of Lemgo, a medium-sized town in Germany. In addition to the realistic simulation model, LemgoRL encompasses a traffic signal logic unit that ensures compliance with all regulatory and safety requirements. LemgoRL offers the same interface as the wellknown OpenAI gym toolkit to enable easy deployment in existing research work. To demonstrate the functionality and applicability of LemgoRL, we train a state-of-the-art Deep RL algorithm on a CPU cluster utilizing a framework for distributed and parallel RL and compare its performance with other methods. Our benchmark tool drives the development of RL algorithms towards real-world applications.

翻译：交叉交通信号控制器(TSC)的亚最佳控制政策导致拥挤,并导致对人类健康和环境造成负面影响。交通信号控制强化学习(RL)是设计更好的控制政策的一个很有希望的方法,近年来引起了相当大的研究兴趣。然而,这一领域的大部分工作使用交通情景的简化模拟环境来培训基于RL的TSC。在现实世界交通系统中部署RL,必须消除简化模拟环境与现实世界应用软件之间的差距。因此,我们提议LemgoRLL,这是一个基准工具,用于在德国一个中等规模城镇Lemgo的现实模拟环境中将RLA代理器培训为TSC。除了现实的模拟模型外,LemgoRL包含一个交通信号逻辑单位,以确保遵守所有监管和安全要求。LemgoRL提供与众所周知的OpenAI健身工具相同的界面,以便方便现有研究工作的部署。为了展示LemgoRL的功能和适用性,我们建议用一个最先进的深RL算法对一个CPU集群进行训练,利用一个框架,用于真实的分布和平行的RL工具,并将其性与我们的发展工具进行比较。

相关内容

TSC

关注 0

服务范围涵盖服务创新研发的所有计算和软件科学技术方面。IEEE服务计算事务强调算法、数学、统计和计算方法，这些方法是服务计算的核心，是面向服务的体系结构、Web服务、业务流程集成、解决方案性能管理、服务操作和管理的新兴领域。官网地址：http://dblp.uni-trier.de/db/journals/tsc/

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日