A Robotic Mobile Fulfillment System is a robotised parts-to-picker system that is particularly well-suited for e-commerce warehousing. One distinguishing feature of this type of warehouse is its high storage modularity. Numerous robots are moving shelves simultaneously, and the shelves can be returned to any open location after the picking operation is completed. This work focuses on the real-time storage allocation problem to minimise the travel time of the robots. An efficient -- but computationally costly -- Monte Carlo Tree Search method is used offline to generate high-quality experience. This experience can be learned by a neural network with a proper coordinates-based features representation. The obtained neural network is used as an action predictor in several new storage policies, either as-is or in rollout and supervised tree search strategies. Resulting performance levels depend on the computing time available at a decision step and are consistently better compared to real-time decision rules from the literature.
翻译:机器人移动化填充系统是一个机器人化的部件到拾拾器系统,特别适合电子商务仓储。这类仓库的一个显著特征是储存模块性高。许多机器人同时移动架子,在采摘操作完成后,可以将架子送回任何开放地点。这项工作侧重于实时储存分配问题,以最大限度地减少机器人的旅行时间。一种高效的 -- -- 但计算成本很高的 -- -- 蒙特卡洛树搜索方法在网上使用,以产生高质量的经验。这个经验可以由具有适当坐标特征的神经网络学习。获得的神经网络在几个新的储存政策中用作行动预测器,要么作为行动预测器,要么在推出,要么在监督的树木搜索战略中。实现业绩水平取决于决策阶段可用的计算时间,并且与文献中的实时决策规则相比始终更好。