Discretionary lane-change is one of the critical challenges for autonomous vehicle (AV) design due to its significant impact on traffic efficiency. Existing intelligent lane-change solutions have primarily focused on optimizing the performance of the ego-vehicle, thereby suffering from limited generalization performance. Recent research has seen an increased interest in multi-agent reinforcement learning (MARL)-based approaches to address the limitation of the ego vehicle-based solutions through close coordination of multiple agents. Although MARL-based approaches have shown promising results, the potential impact of lane-change decisions on the overall traffic flow of a road segment has not been fully considered. In this paper, we present a novel hybrid MARL-based intelligent lane-change system for AVs designed to jointly optimize the local performance for the ego vehicle, along with the global performance focused on the overall traffic flow of a given road segment. With a careful review of the relevant transportation literature, a novel state space is designed to integrate both the critical local traffic information pertaining to the surrounding vehicles of the ego vehicle, as well as the global traffic information obtained from a road-side unit (RSU) responsible for managing a road segment. We create a reward function to ensure that the agents make effective lane-change decisions by considering the performance of the ego vehicle and the overall improvement of traffic flow. A multi-agent deep Q-network (DQN) algorithm is designed to determine the optimal policy for each agent to effectively cooperate in performing lane-change maneuvers. LCS-TF's performance was evaluated through extensive simulations in comparison with state-of-the-art MARL models. In all aspects of traffic efficiency, driving safety, and driver comfort, the results indicate that LCS-TF exhibits superior performance.
翻译:现有智能车道更换办法主要侧重于优化自驾驶车的性能,从而受到有限的概括性表现的影响。最近的研究显示,人们越来越关注多剂强化学习(MARL)办法,通过多物剂的密切协调,解决自驾驶车辆解决办法的局限性。虽然基于MARL的办法已经显示出有希望的结果,但车道改变决定对公路段整个交通流量的潜在影响尚未得到充分考虑。在本文件中,我们为自动驾驶车提出了一个新型混合的以MARL为基础的智能更换车道系统,目的是共同优化自驾驶车的当地性能,同时注重特定路段的总体交通流量。经过对相关运输文献的仔细审查,一个新的州空间旨在整合与自驾驶车辆周围车辆有关的当地交通信息,以及从路边单位获得的全球交通信息,负责管理公路段。我们为自驾驶车道的深度改变机动车道更换新车道,我们有效地将自驾驶车道改进机动车道的性业绩调整功能,通过使用自驾驶车道改进机动车道的每条路段进行最佳性表现评估。</s>