In most modern cities, traffic congestion is one of the most salient societal challenges. Past research has shown that inserting a limited number of autonomous vehicles (AVs) within the traffic flow, with driving policies learned specifically for the purpose of reducing congestion, can significantly improve traffic conditions. However, to date these AV policies have generally been evaluated under the same limited conditions under which they were trained. On the other hand, to be considered for practical deployment, they must be robust to a wide variety of traffic conditions. This paper establishes for the first time that a multiagent driving policy can be trained in such a way that it generalizes to different traffic flows, AV penetration, and road geometries, including on multi-lane roads.
翻译:在大多数现代城市,交通拥堵是社会面临的最突出挑战之一,过去的研究表明,在交通流量中插入数量有限的自治车辆(AV),加上专门为减少交通拥堵而学习的驾驶政策,可以大大改善交通条件,然而,迄今为止,这些AV政策一般是在培训他们时的同样有限条件下进行评估的,另一方面,为了实际部署,必须在各种交通条件下予以有力部署。本文首次规定,多试剂驾驶政策可以培训,使其概括不同的交通流量、AV渗透和道路地形,包括多线路道路。