Nowadays, fast delivery services have created the need for high-density warehouses. The puzzle-based storage system is a practical way to enhance the storage density, however, facing difficulties in the retrieval process. In this work, a deep reinforcement learning algorithm, specifically the Double&Dueling Deep Q Network, is developed to solve the multi-item retrieval problem in the system with general settings, where multiple desired items, escorts, and I/O points are placed randomly. Additionally, we propose a general compact integer programming model to evaluate the solution quality. Extensive numerical experiments demonstrate that the reinforcement learning approach can yield high-quality solutions and outperforms three related state-of-the-art heuristic algorithms. Furthermore, a conversion algorithm and a decomposition framework are proposed to handle simultaneous movement and large-scale instances respectively, thus improving the applicability of the PBS system.
翻译:目前,快速交付服务已经产生了对高密度仓库的需求。 以谜题为基础的存储系统是提高存储密度的实用方法,然而,在检索过程中遇到了困难。 在这项工作中,开发了一种深强化学习算法,特别是双驱动深Q网络,以解决系统中的多项目检索问题,一般设置为多个需要的物品、护送和I/O点随机放置。此外,我们提议了一个通用的常规整数编程模型来评估解决方案的质量。 广泛的数字实验表明,强化学习方法可以产生高质量的解决方案,并优于三种相关的最新超值算法。 此外,还提出了转换算法和分解框架,分别处理同时移动和大规模事件,从而改进了PBS系统的适用性。