Urban intersections are prone to delays and inefficiencies due to static precedence rules and occlusions limiting the view on prioritized traffic. Existing approaches to improve traffic flow, widely known as automatic intersection management systems, are mostly based on non-learning reservation schemes or optimization algorithms. Machine learning-based techniques show promising results in planning for a single ego vehicle. This work proposes to leverage machine learning algorithms to optimize traffic flow at urban intersections by jointly planning for multiple vehicles. Learning-based behavior planning poses several challenges, demanding for a suited input and output representation as well as large amounts of ground-truth data. We address the former issue by using a flexible graph-based input representation accompanied by a graph neural network. This allows to efficiently encode the scene and inherently provide individual outputs for all involved vehicles. To learn a sensible policy, without relying on the imitation of expert demonstrations, the cooperative planning task is considered as a reinforcement learning problem. We train and evaluate the proposed method in an open-source simulation environment for decision making in automated driving. Compared to a first-in-first-out scheme and traffic governed by static priority rules, the learned planner shows a significant gain in flow rate, while reducing the number of induced stops. In addition to synthetic simulations, the approach is also evaluated based on real-world traffic data taken from the publicly available inD dataset.
翻译:现有改进交通流量的方法(广为人知的自动交叉管理系统)主要基于非学习性预订计划或优化算法; 机械学习技术在规划单一自利车辆方面显示出有希望的成果; 这项工作提议利用机器学习算法,通过联合规划多辆车辆,优化城市交叉路口的交通流量; 基于学习的行为规划带来了若干挑战,要求有合适的投入和产出代表以及大量的地面实况数据。我们通过使用灵活的图表化输入代表,并辅之以一个图形神经网络来解决前一个问题。这样可以有效地对场景进行编码,并必然为所有所涉车辆提供个别产出。 要学习明智的政策,而不依赖专家示范的仿照,合作规划任务被视为一个强化学习问题。 我们在开放源模拟环境中为自动驾驶决策而培训和评价拟议的方法。 与先出方案和由静态优先规则规范的交通相比,所学计划显示的交通流动率显著提高,同时从可获取的合成世界数据降低实际数字。