Mobile parcel lockers have been recently proposed by logistics operators as a technology that could help reduce traffic congestion and operational costs in urban freight distribution. Given their ability to relocate throughout their area of deployment, they hold the potential to improve customer accessibility and convenience. In this study, we formulate the Mobile Parcel Locker Problem (MPLP) , a special case of the Location-Routing Problem (LRP) which determines the optimal stopover location for MPLs throughout the day and plans corresponding delivery routes. A Hybrid Q Learning Network based Method (HQM) is developed to resolve the computational complexity of the resulting large problem instances while escaping local optima. In addition, the HQM is integrated with global and local search mechanisms to resolve the dilemma of exploration and exploitation faced by classic reinforcement learning methods. We examine the performance of HQM under different problem sizes (up to 200 nodes) and benchmarked it against the exact approach and Genetic Algorithm (GA). Our results indicate that HQM achieves better optimisation performance with shorter computation time than the exact approach solved by the Gurobi solver in large problem instances. Additionally, the average reward obtained by HQM is 1.96 times greater than GA, which demonstrates that HQM has a better optimisation ability. Further, we identify critical factors that contribute to fleet size requirements, travel distances, and service delays. Our findings outline that the efficiency of MPLs is mainly contingent on the length of time windows and the deployment of MPL stopovers. Finally, we highlight managerial implications based on parametric analysis to provide guidance for logistics operators in the context of efficient last-mile distribution operations.
翻译:物流运营商最近提议将移动包裹储物柜作为一种有助于减少交通拥堵和城市货运分配业务费用的技术,作为物流运营商最近提出的一项技术,可以帮助减少城市货运分配中的交通拥堵和运营成本。鉴于他们有能力在整个部署地区迁移,他们具有改善客户无障碍和方便性的潜力。在本研究中,我们制定了移动包裹仓储问题(MPLP),这是定位箱问题(LRP)的一个特例,它决定了全天期间MPL的最佳中途停留地点和相应的交付路线计划。我们开发了一个基于混合Q学习网络的方法(HQM),以解决由此产生的大问题案例的计算复杂性。此外,HQM与全球和地方搜索机制相结合,以解决典型强化学习方法所面临的勘探和开发困境。我们检查了不同问题规模(高达200个节点)下的HQM的绩效,根据准确的方法和遗传Algorith Algoriththmtm(GAGA),我们获得的优化计算速度比Gurobi解决问题者所解决的准确方法要短得多。此外,HM在大规模部署过程中,我们获得的运行成本分析的运行成本成本成本成本成本,我们的平均比例比比我们更精确。