托盘排排排、分配和库存补充问题单调近似动态方案编制方法:对无人机和电动车辆电池交换站的应用 (A Monotone Approximate Dynamic Programming Approach for the Stochastic Scheduling, Allocation, and Inventory Replenishment Problem: Applications to Drone and Electric Vehicle Battery Swap Stations)

2021 年 5 月 14 日

A Monotone Approximate Dynamic Programming Approach for the Stochastic Scheduling, Allocation, and Inventory Replenishment Problem: Applications to Drone and Electric Vehicle Battery Swap Stations

翻译：托盘排排排、分配和库存补充问题单调近似动态方案编制方法:对无人机和电动车辆电池交换站的应用

Amin Asadi,Sarah Nurre Pinkley

There is a growing interest in using electric vehicles (EVs) and drones for many applications. However, battery-oriented issues, including range anxiety and battery degradation, impede adoption. Battery swap stations are one alternative to reduce these concerns that allow the swap of depleted for full batteries in minutes. We consider the problem of deriving actions at a battery swap station when explicitly considering the uncertain arrival of swap demand, battery degradation, and replacement. We model the operations at a battery swap station using a finite horizon Markov Decision Process model for the stochastic scheduling, allocation, and inventory replenishment problem (SAIRP), which determines when and how many batteries are charged, discharged, and replaced over time. We present theoretical proofs for the monotonicity of the value function and monotone structure of an optimal policy for special SAIRP cases. Due to the curses of dimensionality, we develop a new monotone approximate dynamic programming (ADP) method, which intelligently initializes a value function approximation using regression. In computational tests, we demonstrate the superior performance of the new regression-based monotone ADP method as compared to exact methods and other monotone ADP methods. Further, with the tests, we deduce policy insights for drone swap stations.

翻译：使用电动车辆(EVs)和无人驾驶飞机在许多应用中引起了越来越多的兴趣。然而,以电池为导向的问题,包括范围焦虑和电池退化,阻碍了采用。电池互换站是减少这些关切的一种替代方法,这些关切允许用耗竭电池换成几分钟全电池。当明确考虑互换需求、电池退化和替换的不确定到来时,我们考虑在电池互换站采取行动的问题。我们用一个电池互换站的固定地平线Markov 决策程序模型模拟在电池互换站的操作,该模型用于随机测序、分配和库存补充问题(SAIRP),确定何时和多少电池被充电、卸电和替换。我们为特殊SAIRP案件中的价值函数和最佳政策的单项结构提供了理论证明。由于维度的诅咒,我们开发了一个新的单项近似动态编程程序(ADP)方法,该方法通过回归,明智地开始一种价值函数近似。在计算测试中,我们展示了新的基于回归的单项单项ADP方法的优异性性性性性性,与精确的变换政策、ADP方法的推算。