The electric grid is a key enabling infrastructure for the ambitious transition towards carbon neutrality as we grapple with climate change. With deepening penetration of renewable energy resources and electrified transportation, the reliable and secure operation of the electric grid becomes increasingly challenging. In this paper, we present PSML, a first-of-its-kind open-access multi-scale time-series dataset, to aid in the development of data-driven machine learning (ML) based approaches towards reliable operation of future electric grids. The dataset is generated through a novel transmission + distribution (T+D) co-simulation designed to capture the increasingly important interactions and uncertainties of the grid dynamics, containing electric load, renewable generation, weather, voltage and current measurements at multiple spatio-temporal scales. Using PSML, we provide state-of-the-art ML baselines on three challenging use cases of critical importance to achieve: (i) early detection, accurate classification and localization of dynamic disturbance events; (ii) robust hierarchical forecasting of load and renewable energy with the presence of uncertainties and extreme events; and (iii) realistic synthetic generation of physical-law-constrained measurement time series. We envision that this dataset will enable advances for ML in dynamic systems, while simultaneously allowing ML researchers to contribute towards carbon-neutral electricity and mobility.
翻译:随着可再生能源资源和电气化运输的深入渗透,电网的可靠和安全运行变得日益具有挑战性。本文介绍PSML,这是首创的开放进入型多尺度时间序列数据集,协助开发数据驱动机器学习(ML)方法,以实现未来电网的可靠运行。数据集是通过新型传输+分配(T+D)联合模拟生成的,旨在捕捉电网动态中日益重要的互动和不确定性,包括电荷、可再生能源发电、气象、电压和多个时空尺度的当前测量。我们利用PSML,为三个具有关键重要性的具有挑战性的使用案例提供最先进的ML基线,以便实现:(一) 及早发现、准确分类和确定动态扰动事件的地方化;(二) 以存在不确定性和极端事件的方式对负荷和可再生能源进行稳健的等级预测;(三) 以现实合成方式生成包含电网动态负荷、可再生能源、可再生能源发电、可再生能源发电、气象、电流、电压和电流测量系统,同时为碳中性测量系统提供便利。