The rapid development of autonomous vehicles (AVs) holds vast potential for transportation systems through improved safety, efficiency, and access to mobility. However, the progression of these impacts, as AVs are adopted, is not well understood. Numerous technical challenges arise from the goal of analyzing the partial adoption of autonomy: partial control and observation, multi-vehicle interactions, and the sheer variety of scenarios represented by real-world networks. To shed light into near-term AV impacts, this article studies the suitability of deep reinforcement learning (RL) for overcoming these challenges in a low AV-adoption regime. A modular learning framework is presented, which leverages deep RL to address complex traffic dynamics. Modules are composed to capture common traffic phenomena (stop-and-go traffic jams, lane changing, intersections). Learned control laws are found to improve upon human driving performance, in terms of system-level velocity, by up to 57% with only 4-7% adoption of AVs. Furthermore, in single-lane traffic, a small neural network control law with only local observation is found to eliminate stop-and-go traffic - surpassing all known model-based controllers to achieve near-optimal performance - and generalize to out-of-distribution traffic densities.
翻译:自主车辆(AVs)的迅速发展通过提高安全、效率和机动性,为运输系统提供了巨大的潜力,但是,由于AVs的采用,这些影响的进展没有被人们完全理解。分析部分采用自主性的目标,产生了许多技术挑战:部分控制和观察、多车辆相互作用以及现实世界网络代表的多种多样的情景。为了揭示AV的近期影响,本条款研究了深强化学习(RL)在低AV适应制度下克服这些挑战的适宜性。介绍了一个模块化学习框架,利用深度RL处理复杂的交通动态。模块组成以捕捉共同的交通现象(中转交通阻塞、车道变化、交汇等)。据发现,在系统级速度方面,控制法可以提高人的驾驶性能,高达57%,只有4-7%的AVs被采用。此外,在单线交通中,发现一项小型神经网络控制法仅具有局部观察功能,可以消除中转流量——超过所有已知的交通模式性向外升级。