Solar forecasting from ground-based sky images has shown great promise in reducing the uncertainty in solar power generation. With more and more sky image datasets open sourced in recent years, the development of accurate and reliable deep learning-based solar forecasting methods has seen a huge growth in potential. In this study, we explore three different training strategies for solar forecasting models by leveraging three heterogeneous datasets collected globally with different climate patterns. Specifically, we compare the performance of local models trained individually based on single datasets and global models trained jointly based on the fusion of multiple datasets, and further examine the knowledge transfer from pre-trained solar forecasting models to a new dataset of interest. The results suggest that the local models work well when deployed locally, but significant errors are observed when applied offsite. The global model can adapt well to individual locations at the cost of a potential increase in training efforts. Pre-training models on a large and diversified source dataset and transferring to a target dataset generally achieves superior performance over the other two strategies. With 80% less training data, it can achieve comparable performance as the local baseline trained using the entire dataset.
翻译:从地基天空图像进行的太阳预报在减少太阳能发电的不确定性方面显示了巨大的希望。近年来,随着越来越多的天空图像数据集的公开来源,准确和可靠的深层学习太阳预报方法的发展潜力有了巨大的增长。在本研究中,我们探索了三种不同的太阳能预报模型培训战略,利用全球收集的、气候模式不同的三个不同数据集。具体地说,我们比较了以单个数据集为基础单独培训的当地模型的性能和根据多个数据集的融合联合培训的全球模型的性能,并进一步研究了从预先训练的太阳预报模型向新的感兴趣数据集转移知识的情况。结果显示,当地模型在当地部署时效果良好,但在非现场应用时则观察到重大错误。全球模型可以很好地适应各个地点,代价是培训工作可能增加的费用。关于大型和多样化源数据集的预培训模型,以及向目标数据集的转让通常取得优异于其他两个战略的性能。培训数据减少了80%,它可以取得与使用整个数据集培训的当地基线的类似性性能。