Sky-image-based solar forecasting using deep learning has been recognized as a promising approach in reducing the uncertainty in solar power generation. However, one of the biggest challenges is the lack of massive and diversified sky image samples. In this study, we present a comprehensive survey of open-source ground-based sky image datasets for very short-term solar forecasting (i.e., forecasting horizon less than 30 minutes), as well as related research areas which can potentially help improve solar forecasting methods, including cloud segmentation, cloud classification and cloud motion prediction. We first identify 72 open-source sky image datasets that satisfy the needs of machine/deep learning. Then a database of information about various aspects of the identified datasets is constructed. To evaluate each surveyed datasets, we further develop a multi-criteria ranking system based on 8 dimensions of the datasets which could have important impacts on usage of the data. Finally, we provide insights on the usage of these datasets for different applications. We hope this paper can provide an overview for researchers who are looking for datasets for very short-term solar forecasting and related areas.
翻译:利用深层学习进行以天空图像为基础的太阳预报被公认为是减少太阳能发电不确定性的一个很有希望的方法,然而,最大的挑战之一是缺乏大规模和多样化的天空图像样本。在本研究中,我们提出对开放源地基地面图像数据集的全面调查,用于非常短期的太阳预报(即预测地平线不到30分钟),以及可能有助于改进太阳预报方法的相关研究领域,包括云分化、云分分化和云流预测。我们首先确定72个满足机器/深层学习需要的开放源天空图像数据集。然后建立一个关于所查明数据集各个方面的信息数据库。为了评估每个接受调查的数据集,我们进一步根据数据集的8个层面制定多标准排序系统,这可能对数据的使用产生重要影响。最后,我们深入了解这些数据集在不同应用中的使用情况。我们希望这份文件能够为正在寻找非常短期太阳预报和相关领域的数据集的研究人员提供一个概览。