With the rise of e-commerce and increasing customer requirements, logistics service providers face a new complexity in their daily planning, mainly due to efficiently handling same day deliveries. Existing multi-stage stochastic optimization approaches that allow to solve the underlying dynamic vehicle routing problem are either computationally too expensive for an application in online settings, or -- in the case of reinforcement learning -- struggle to perform well on high-dimensional combinatorial problems. To mitigate these drawbacks, we propose a novel machine learning pipeline that incorporates a combinatorial optimization layer. We apply this general pipeline to a dynamic vehicle routing problem with dispatching waves, which was recently promoted in the EURO Meets NeurIPS Vehicle Routing Competition at NeurIPS 2022. Our methodology ranked first in this competition, outperforming all other approaches in solving the proposed dynamic vehicle routing problem. With this work, we provide a comprehensive numerical study that further highlights the efficacy and benefits of the proposed pipeline beyond the results achieved in the competition, e.g., by showcasing the robustness of the encoded policy against unseen instances and scenarios.
翻译:随着电子商务的兴起和客户需求的增加,物流服务提供商在日常计划中面临着新的复杂性,主要原因是如何有效处理同日送货。现有的多阶段随机优化方法允许解决潜在的动态车辆路径问题,但要么在在线环境中计算成本过高,要么在强化学习的情况下难以在高维组合问题上表现良好。为了缓解这些缺点,我们提出了一种新颖的机器学习流程,其中包含一个组合优化层。我们将这个通用的管道应用到一个带派遣波动的动态车辆路径问题上,这个问题最近在 NeurIPS 2022 的“欧洲与 NeurIPS 车辆路径规划竞赛”中获得了推广。我们的方法在这次比赛中排名第一,在解决所提出的动态车辆路径问题方面优于所有其他方法。通过这项工作,我们提供了一项全面的数值研究,进一步展示了所提出的管道在比赛中取得的结果之外的有效性和优势,例如展示了所编码策略对未见过的实例和场景的鲁棒性。