Solar forecasting from ground-based sky images using deep learning models has shown great promise in reducing the uncertainty in solar power generation. One of the biggest challenges for training deep learning models is the availability of labeled datasets. With more and more sky image datasets open sourced in recent years, the development of accurate and reliable solar forecasting methods has seen a huge growth in potential. In this study, we explore three different training strategies for deep-learning-based solar forecasting models by leveraging three heterogeneous datasets collected around the world with drastically different climate patterns. Specifically, we compare the performance of models trained individually based on local datasets (local models) and models trained jointly based on the fusion of multiple datasets from different locations (global models), and we further examine the knowledge transfer from pre-trained solar forecasting models to a new dataset of interest (transfer learning models). The results suggest that the local models work well when deployed locally, but significant errors are observed for the scale of the prediction when applied offsite. The global model can adapt well to individual locations, while the possible increase in training efforts need to be taken into account. Pre-training models on a large and diversified source dataset and transferring to a local target dataset generally achieves superior performance over the other two training strategies. Transfer learning brings the most benefits when there are limited local data. With 80% less training data, it can achieve 1% improvement over the local baseline model trained using the entire dataset. Therefore, we call on the efforts from the solar forecasting community to contribute to a global dataset containing a massive amount of imagery and displaying diversified samples with a range of sky conditions.
翻译:利用深层学习模型对地基天空图像的太阳预报显示,在减少太阳能发电的不确定性方面大有希望。培训深层学习模型的最大挑战之一是提供标签化数据集。近年来,随着越来越多的天空图像数据集的公开来源,准确和可靠的太阳预报方法的开发潜力有了巨大的增长。在这项研究中,我们探索了三个不同的培训战略,即利用世界各地收集的气候模式差异很大的三套不同数据集,为深层学习型太阳预报模型提供不同的培训战略。具体地说,我们比较了根据当地数据集(当地模型)和根据不同地点(全球模型)多个数据集合并而共同培训的模型的绩效。我们进一步审查了从事先培训过的太阳预测模型向新的感兴趣数据集(转移学习模型)的知识转移。结果显示,当地模型在本地部署时效果良好,但观测到的信号有重大错误。全球模型可以适应各个不同的地点,同时需要将培训条件的可能增加纳入考虑。在大规模和多样化的天平面模型中,在大规模和多样化的天平面上,在大规模传输数据时,通过经过培训的模型,在大规模和最多样化的源值数据上,我们进行了有限的数据学习,可以实现更高的数据。