An efficient and reliable multi-agent decision-making system is highly demanded for the safe and efficient operation of connected autonomous vehicles in intelligent transportation systems. Current researches mainly focus on the Deep Reinforcement Learning (DRL) methods. However, utilizing DRL methods in interactive traffic scenarios is hard to represent the mutual effects between different vehicles and model the dynamic traffic environments due to the lack of interactive information in the representation of the environments, which results in low accuracy of cooperative decisions generation. To tackle these difficulties, this research proposes a framework to enable different Graph Reinforcement Learning (GRL) methods for decision-making, and compares their performance in interactive driving scenarios. GRL methods combinate the Graph Neural Network (GNN) and DRL to achieve the better decisions generation in interactive scenarios of autonomous vehicles, where the features of interactive scenarios are extracted by the GNN, and cooperative behaviors are generated by DRL framework. Several GRL approaches are summarized and implemented in the proposed framework. To evaluate the performance of the proposed GRL methods, an interactive driving scenarios on highway with two ramps is constructed, and simulated experiment in the SUMO platform is carried out to evaluate the performance of different GRL approaches. Finally, results are analyzed in multiple perspectives and dimensions to compare the characteristic of different GRL approaches in intelligent transportation scenarios. Results show that the implementation of GNN can well represents the interaction between vehicles, and the combination of GNN and DRL is able to improve the performance of the generation of lane-change behaviors. The source code of our work can be found at https://github.com/Jacklinkk/TorchGRL.
翻译:为了在智能运输系统中安全、高效地运行连接的自主车辆,需要有一个高效和可靠的多剂决策系统。目前的研究主要侧重于深强化学习(DRL)方法。然而,在交互式交通情景中,使用DRL方法很难代表不同车辆之间的相互影响,并模拟动态交通环境,原因是在环境代表中缺乏互动信息,导致合作决策生成的准确性低。为解决这些困难,本研究提出了一个框架,以便能够使用不同的GRL系统强化学习(GRL)方法来进行决策,并比较其在交互式驾驶情景中的性能。GRL系统将GNNNO系统(GNNNN)和DRL系统组合方法组合起来,以更好地生成自动生成自动自动机动车辆的交互式情景。GNR系统在GNR系统/NUR系统版本中,通过模拟实验,最终将GRL系统版本的稳定性分析结果。