Heating in private households is a major contributor to the emissions generated today. Heat pumps are a promising alternative for heat generation and are a key technology in achieving our goals of the German energy transformation and to become less dependent on fossil fuels. Today, the majority of heat pumps in the field are controlled by a simple heating curve, which is a naive mapping of the current outdoor temperature to a control action. A more advanced control approach is model predictive control (MPC) which was applied in multiple research works to heat pump control. However, MPC is heavily dependent on the building model, which has several disadvantages. Motivated by this and by recent breakthroughs in the field, this work applies deep reinforcement learning (DRL) to heat pump control in a simulated environment. Through a comparison to MPC, it could be shown that it is possible to apply DRL in a model-free manner to achieve MPC-like performance. This work extends other works which have already applied DRL to building heating operation by performing an in-depth analysis of the learned control strategies and by giving a detailed comparison of the two state-of-the-art control methods.
翻译:热泵是热能发电的一个很有希望的替代方法,也是实现德国能源转型目标的关键技术,并且越来越不依赖化石燃料。今天,实地大多数热泵都由简单的热量曲线控制,这是对当前室外温度的天真图象,可以进行控制行动。一种更先进的控制方法是模型预测控制(MPC),在多个研究项目中用于热泵控制。然而,热泵控制在很大程度上依赖于建筑模型,这有几个缺点。受这一模型和最近实地突破的驱动,这项工作将深度强化学习(DRL)应用于模拟环境中的热泵控制。通过与MPC比较,可以证明可以以无模式的方式应用DRL实现像MPC一样的性能。这项工作扩展了已经将DRL应用于建设供暖操作的其他工程,对学习过的控制战略进行了深入分析,并对两种州级控制方法进行了详细比较。