Urban intersections are prone to delays and inefficiencies due to static precedence rules and occlusions limiting the view on prioritized traffic. Existing approaches to improve traffic flow, widely known as automatic intersection management systems, are mostly based on non-learning reservation schemes or optimization algorithms. Machine learning-based techniques show promising results in planning for a single ego vehicle. This work proposes to leverage machine learning algorithms to optimize traffic flow at urban intersections by jointly planning for multiple vehicles. Learning-based behavior planning poses several challenges, demanding for a suited input and output representation as well as large amounts of ground-truth data. We address the former issue by using a flexible graph-based input representation accompanied by a graph neural network. This allows to efficiently encode the scene and inherently provide individual outputs for all involved vehicles. To learn a sensible policy, without relying on the imitation of expert demonstrations, the cooperative planning task is phrased as a reinforcement learning problem. We train and evaluate the proposed method in an open-source simulation environment for decision making in automated driving. Compared to a first-in-first-out scheme and traffic governed by static priority rules, the learned planner shows a significant gain in flow rate, while reducing the number of induced stops. In addition to synthetic simulations, the approach is also evaluated based on real-world traffic data taken from the publicly available inD dataset.
翻译:现有改进交通流量的方法(广为人知的自动交叉管理系统)主要基于非学习预订计划或优化算法; 机械学习技术在规划单一自利车辆方面显示出有希望的成果; 这项工作提议利用机器学习算法,通过联合规划多辆车,优化城市交叉路口的交通流量; 基于学习的行为规划带来了若干挑战,要求有合适的投入和产出代表以及大量的地面实况数据。我们通过使用灵活的图表输入代表制和由静态优先规则管理的流动数据,来应对前一个问题。这样可以有效地对场景进行编码,并必然为所有所涉车辆提供个别产出。 要学习明智的政策,而不依赖专家示范的仿照,合作规划任务的措辞就是一个强化学习问题。 我们从开放源模拟环境中对自动化驾驶决策的拟议方法进行培训和评价。 与先出方案和由静态优先规则监管的交通相比,学习过的计划也显示从流动率中大幅增长。