As an emerging technology, Connected Autonomous Vehicles are believed to pass intersections with greater efficiency, and related researches have been conducted for decades. However, compared to rule-based or model-based scheduling approaches, the distributed reinforcement learning has only begun to be applied in the field of autonomous intersection management since the past three years. The distributed learning-based method help vehicle make decisions more independently, and the adaptive adjustment of dynamic strategy in varied condition makes this method more suitable for the AIM at hybrid intersection. We propose a hierarchical decision framework with various receptive scope, action step length, and feedback period of reward, making use of information from different perspectives within different time spans and forming driving trends for vehicle to co-determine the action confrontationally at each step. The proposed model is proven effective in the experiment undertaken in a complicated intersection, and show better performance compared with baselines.
翻译:作为一种新兴技术,互联自主车辆被认为经过交叉,效率更高,而且相关研究已经进行了数十年,然而,与基于规则或基于模式的时间安排方法相比,自过去三年以来,分布式强化学习只开始应用于自主交叉管理领域,分散式学习型方法有助于车辆更独立地作出决定,动态战略的适应性调整使这种方法更适合混合交叉点的AIM。我们提议了一个等级决策框架,具有不同接受范围、行动步骤长度和反馈奖赏期,利用不同时间段的不同观点的信息,形成车辆在每一步步上共同决定行动的趋势。在复杂的交叉点进行的试验中,拟议模式证明是有效的,与基线相比,表现更好。