Deep Reinforcement Learning (DRL) has achieved remarkable success in scenarios such as games and has emerged as a potential solution for control tasks. That is due to its ability to leverage scalability and handle complex dynamics. However, few works have targeted environments grounded in real-world settings. Indeed, real-world scenarios can be challenging, especially when faced with the high dimensionality of the state space and unknown reward function. We release a testbed consisting of an environment simulator and demonstrations of human operation concerning pump scheduling of a real-world water distribution facility to facilitate research. The pump scheduling problem can be viewed as a decision process to decide when to operate pumps to supply water while limiting electricity consumption and meeting system constraints. To provide a starting point, we release a well-documented codebase, present an overview of some challenges that can be addressed and provide a baseline representation of the problem. The code and dataset are available at https://gitlab.com/hdonancio/pumpscheduling.
翻译:深强化学习(DRL)在游戏等情景中取得了显著成功,并已成为控制任务的潜在解决方案,这是因为它有能力利用可缩放性和处理复杂动态,然而,很少有工程针对现实世界环境中的环境,事实上,现实世界情景可能具有挑战性,特别是在国家空间的高度维度和未知的奖赏功能面前。我们发布了一个测试台,由环境模拟器和人类操作演示组成,涉及实际世界水分配设施的泵排期,以便利研究。泵排期问题可被视为决定何时操作水泵以供应水,同时限制电力消耗和会议系统限制的决定程序。为了提供一个起点,我们发布了一个有详细记录的代码库,概述一些可以应对的挑战,并提供问题的基线说明。代码和数据集可在https://gitlab.com/hdonancio/pumpschedutinguling查阅。