The stochastic nature of photovoltaic (PV) power has led both academia and industry to a large amount of research work aiming at the development of accurate PV power forecasting models. However, most of those models are based on machine learning algorithms and are considered as black boxes which do not provide any insight or explanation about their predictions. Therefore, their direct implementation in environments, where transparency is required, and the trust associated with their predictions may be questioned. To this end, we propose a two stage probabilistic forecasting framework able to generate highly accurate, reliable, and sharp forecasts yet offering full transparency on both the point forecasts and the prediction intervals (PIs). In the first stage, we exploit natural gradient boosting (NGBoost) for yielding probabilistic forecasts while in the second stage, we calculate the Shapley additive explanation (SHAP) values in order to fully understand why a prediction was made. To highlight the performance and the applicability of the proposed framework, real data from two PV parks located in Southern Germany are employed. Initially, the natural gradient boosting is thoroughly compared with two state-of-the-art algorithms, namely Gaussian process and lower upper bound estimation, in a wide range of forecasting metrics. Secondly, a detailed analysis of the model's complex nonlinear relationships and interaction effects between the various features is presented. The latter allows us to interpret the model, identify some learned physical properties, explain individual predictions, reduce the computational requirements for the training without jeopardizing the model accuracy, detect possible bugs, and gain trust in the model. Finally, we conclude that the model was able to develop nonlinear relationships following human logic and intuition based on learned physical properties.
翻译:光电(PV)动力的随机性质导致学术界和产业界进行大量旨在开发准确的光电预测模型的研究工作,但这些模型大多以机器学习算法为基础,被视为没有对其预测提供任何洞察力或解释的黑盒。因此,光电(PV)动力在环境中的直接实施,需要透明度,与其预测相关的信任可能受到质疑。为此,我们提议一个两个阶段的概率预测框架,能够产生高度准确、可靠和尖锐的预测,但能为点预测和预测间隔(PIs)带来充分的透明度。在第一阶段,我们利用自然梯度加速(NGBoost)来产生概率预测,而在第二阶段,我们计算这些模型在环境直接实施,因为环境环境需要透明度,而与预测有关的信任。为此,我们提议一个阶段的概率预测框架的性能和适用性能预测框架,来自位于南德国的两个光电园的真实模型,其真实数据能够产生高度准确性,但能的预测间隔期和预测间隔期(PIPs)都提供了完全的透明度。在第一阶段,我们利用自然梯度提升(NGBoost)的加速加速(NGBoost)推算法(NGeal)来产生概率预测预估测非概率特征特征特征特征特征,在第二阶段的物理测算结果中,最终分析中,从而可以得出各种的轨测算结果。