CTV:利用深强化学习对交通信号和连接的自主车辆进行合作控制 (CoTV: Cooperative Control for Traffic Light Signals and Connected Autonomous Vehicles using Deep Reinforcement Learning)

The target of reducing travel time only is insufficient to support the development of future smart transportation systems. To align with the United Nations Sustainable Development Goals (UN-SDG), a further reduction of fuel and emissions, improvements of traffic safety, and the ease of infrastructure deployment and maintenance should also be considered. Different from existing work focusing on the optimization of the control in either traffic light signal (to improve the intersection throughput), or vehicle speed (to stabilize the traffic), this paper presents a multi-agent deep reinforcement learning (DRL) system called CoTV, which Cooperatively controls both Traffic light signals and connected autonomous Vehicles (CAV). Therefore, our CoTV can well balance the achievement of the reduction of travel time, fuel, and emission. In the meantime, CoTV can also be easy to deploy by cooperating with only one CAV that is the nearest to the traffic light controller on each incoming road. This enables more efficient coordination between traffic light controllers and CAV, thus leading to the convergence of training CoTV under the large-scale multi-agent scenario that is traditionally difficult to converge. We give the detailed system design of CoTV, and demonstrate its effectiveness in a simulation study using SUMO under various grid maps and realistic urban scenarios with mixed-autonomy traffic.

翻译：为了与联合国可持续发展目标(UN-SDG)保持一致,还应考虑进一步减少燃料和排放、改善交通安全以及便利基础设施的部署和维护。与目前侧重于优化对交通灯信号(以改进交汇路面)或车辆速度(稳定交通)的控制的工作不同,本文件提出了一个多试剂深度强化学习系统,称为COTV,它由合作控制交通灯信号和连接的自主车辆(CAV)。因此,我们的COTV可以很好地平衡缩短旅行时间、燃料和排放的实现。与此同时,CTV还可以通过与距离每条行进公路交通灯控制器最近的CAV合作,方便地部署CVAV。这使得交通灯控制器和CAVAV之间能够更有效地协调,从而使得培训COTV在传统上难以集中的大型多试办假设下趋于一致。我们提供了CTV的详细系统设计,并展示了它与各种电网图和现实型城市地图下混合型地图的模拟流量研究的有效性。

相关内容

CAV

关注 0

会议涵盖了从理论结果到具体应用的各个方面，重点讨论了实际的验证工具以及实现这些工具所需的算法和技术。CAV认为，在向生物系统和计算机安全等新领域扩展的同时，继续推动硬件和软件验证的进步至关重要。会议记录将发表在《计算机科学》系列的斯普林格-维拉格讲稿中。预计将邀请一些论文参加《系统设计中的形式化方法》专刊和《ACM杂志》。官网链接：http://i-cav.org/2019/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

专知会员服务

35+阅读 · 2019年12月12日