In this paper, to reduce the congestion rate at the city center and increase the quality of experience (QoE) of each user, the framework of long-range autonomous valet parking (LAVP) is presented, where an Autonomous Vehicle (AV) is deployed in the city, which can pick up, drop off users at their required spots, and then drive to the car park out of city center autonomously. In this framework, we aim to minimize the overall distance of the AV, while guarantee all users are served, i.e., picking up, and dropping off users at their required spots through optimizing the path planning of the AV and number of serving time slots. To this end, we first propose a learning based algorithm, which is named as Double-Layer Ant Colony Optimization (DL-ACO) algorithm to solve the above problem in an iterative way. Then, to make the real-time decision, while consider the dynamic environment (i.e., the AV may pick up and drop off users from different locations), we further present a deep reinforcement learning (DRL) based algorithm, which is known as deep Q network (DQN). The experimental results show that the DL-ACO and DQN-based algorithms both achieve the considerable performance.
翻译:在本文中,为了降低市中心的拥堵率,提高每个用户的经验质量(QoE),提出了长距离自动泊车(LAVP)框架(LAVP),在市内部署一个自动泊车(AV),可以接收、在所需地点让用户下车,然后开车到市中心的停车场。在此框架内,我们的目标是尽量减少AV的总体距离,同时通过优化AV和许多服务时间档的路径规划,保证所有用户都能在他们需要的地点接驳和丢弃用户。为此,我们首先提出基于学习的算法,称为双轨自动泊车(DL-ACO)算法,以迭接方式解决上述问题。然后,我们进行实时决策,同时考虑动态环境(即AV可以从不同地点接驳和丢弃用户),我们进一步介绍基于深度强化学习(DRL)的DL算法(DRN),该算法被称为双深级运算(DQ)的实验性能网络。