Future power systems will rely heavily on micro grids with a high share of decentralised renewable energy sources and energy storage systems. The high complexity and uncertainty in this context might make conventional power dispatch strategies infeasible. Reinforcement-learning based (RL) controllers can address this challenge, however, cannot themselves provide safety guarantees, preventing their deployment in practice. To overcome this limitation, we propose a formally validated RL controller for economic dispatch. We extend conventional constraints by a time-dependent constraint encoding the islanding contingency. The contingency constraint is computed using set-based backwards reachability analysis and actions of the RL agent are verified through a safety layer. Unsafe actions are projected into the safe action space while leveraging constrained zonotope set representations for computational efficiency. The developed approach is demonstrated on a residential use case using real-world measurements.
翻译:未来电力系统将严重依赖分散式可再生能源和能源储存系统比例很高的微型电网; 这方面的高度复杂性和不确定性可能使常规的电力发送战略变得不可行; 以强化学习为基础的控制器本身无法提供安全保障,从而无法在实际中防止部署这些系统; 为了克服这一限制,我们提议为经济发送建立一个正式有效的RL控制器。 我们通过对岛屿应急情况进行基于时间的限制来扩大常规限制。 应急限制是通过基于固定的后向可达性分析来计算的,通过安全层来核查RL代理器的行动。 预测安全行动空间将采取不安全行动,同时利用受限制的zononoope为计算效率设定代表器。 在使用现实世界测量的住宅使用案例中,演示了发达的方法。