This article presents a novel hybrid approach using statistics and machine learning to forecast the national demand of electricity. As investment and operation of future energy systems require long-term electricity demand forecasts with hourly resolution, our mathematical model fills a gap in energy forecasting. The proposed methodology was constructed using hourly data from Ukraine's electricity consumption ranging from 2013 to 2020. To this end, we analysed the underlying structure of the hourly, daily and yearly time series of electricity consumption. The long-term yearly trend is evaluated using macroeconomic regression analysis. The mid-term model integrates temperature and calendar regressors to describe the underlying structure, and combines ARIMA and LSTM ``black-box'' pattern-based approaches to describe the error term. The short-term model captures the hourly seasonality through calendar regressors and multiple ARMA models for the residual. Results show that the best forecasting model is composed by combining multiple regression models and a LSTM hybrid model for residual prediction. Our hybrid model is very effective at forecasting long-term electricity consumption on an hourly resolution. In two years of out-of-sample forecasts with 17520 timesteps, it is shown to be within 96.83 \% accuracy.
翻译:本文提出了一种新颖的混合方法,利用统计和机器学习来预测国家电力需求。由于未来能源系统的投资和运营需要具有小时分辨率的长期电力需求预测,因此我们的数学模型填补了能源预测的空白。该方法是利用乌克兰从2013年到2020年的电力消耗的小时数据构建的。为此,我们分析了电力消耗的小时、日和年时间序列的基本结构。通过宏观经济回归分析评估了长期年趋势。中期模型通过结合温度和日历回归器来描述基本结构,并结合ARIMA和LSTM "黑箱"基于模式的方法来描述误差项。短期模型通过日历回归器和多个ARMA模型捕捉小时季节性。结果表明,最佳预测模型由多个回归模型和LSTM混合模型组成,用于残余量预测。我们的混合模型在小时分辨率下预测长期电力消耗非常有效。在17520个时间步长的两年的样本外预测中,准确率达到了96.83%。