Volt-var control (VVC) is the problem of operating power distribution systems within healthy regimes by controlling actuators in power systems. Existing works have mostly adopted the conventional routine of representing the power systems (a graph with tree topology) as vectors to train deep reinforcement learning (RL) policies. We propose a framework that combines RL with graph neural networks and study the benefits and limitations of graph-based policy in the VVC setting. Our results show that graph-based policies converge to the same rewards asymptotically however at a slower rate when compared to vector representation counterpart. We conduct further analysis on the impact of both observations and actions: on the observation end, we examine the robustness of graph-based policy on two typical data acquisition errors in power systems, namely sensor communication failure and measurement misalignment. On the action end, we show that actuators have various impacts on the system, thus using a graph representation induced by power systems topology may not be the optimal choice. In the end, we conduct a case study to demonstrate that the choice of readout function architecture and graph augmentation can further improve training performance and robustness.
翻译:沃尔特瓦尔控制(VVC)是通过控制电源系统中的驱动器在健康制度内操作动力分配系统的问题; 现有的工程大多采用代表动力系统的常规常规常规(用树本图绘制)作为载体来训练深强化学习(RL)政策; 我们提议了一个框架,将RL与图形神经网络结合起来,并研究VVC设置中基于图形的政策的好处和局限性; 我们的结果表明,以图形为基础的政策与矢量代表对应方相比,在同样程度上会以同样的方式获得回报,但速度却较慢。 我们进一步分析观测和行动的影响:在观测结束时,我们检查基于图形的政策对动力系统中两个典型的数据获取错误(即传感器通信故障和测量不匹配)的稳健性。 在行动结束时,我们表明,以图表为基础的政策对系统有不同的影响,因此,使用动力系统表层学引出的图形表示方式可能不是最佳选择。 最后,我们进行一项案例研究,以证明选择读出功能结构和图形增强能力可以进一步改善培训的绩效和稳健性。