This paper is a technical overview of DeepMind and Google's recent work on reinforcement learning for controlling commercial cooling systems. Building on expertise that began with cooling Google's data centers more efficiently, we recently conducted live experiments on two real-world facilities in partnership with Trane Technologies, a building management system provider. These live experiments had a variety of challenges in areas such as evaluation, learning from offline data, and constraint satisfaction. Our paper describes these challenges in the hope that awareness of them will benefit future applied RL work. We also describe the way we adapted our RL system to deal with these challenges, resulting in energy savings of approximately 9% and 13% respectively at the two live experiment sites.
翻译:本文对DeepMind和Google最近关于强化学习以控制商业冷却系统的工作进行了技术回顾。 我们最近与建筑管理系统提供商Trane Technologies合作,对两个真实世界的设施进行了现场实验。 这些现场实验在评估、从离线数据中学习和限制满意度等领域面临各种挑战。我们的文件描述了这些挑战,希望对这些挑战的认识将有益于未来应用RL的工作。我们还描述了我们如何调整RL系统以应对这些挑战,从而在两个现场实验地点分别节省了大约9%和13%的能源。