The challenges in applications of solar energy lies in its intermittency and dependency on meteorological parameters such as; solar radiation, ambient temperature, rainfall, wind-speed etc., and many other physical parameters like dust accumulation etc. Hence, it is important to estimate the amount of solar photovoltaic (PV) power generation for a specific geographical location. Machine learning (ML) models have gained importance and are widely used for prediction of solar power plant performance. In this paper, the impact of weather parameters on solar PV power generation is estimated by several Ensemble ML (EML) models like Bagging, Boosting, Stacking, and Voting for the first time. The performance of chosen ML algorithms is validated by field dataset of a 10kWp solar PV power plant in Eastern India region. Furthermore, a complete test-bed framework has been designed for data mining as well as to select appropriate learning models. It also supports feature selection and reduction for dataset to reduce space and time complexity of the learning models. The results demonstrate greater prediction accuracy of around 96% for Stacking and Voting EML models. The proposed work is a generalized one and can be very useful for predicting the performance of large-scale solar PV power plants also.
翻译:太阳能应用的挑战在于其间歇性和依赖气象参数,如太阳辐射、环境温度、降雨、风速等,以及灰尘积累等许多其他物理参数。 因此,必须估计某一具体地理位置的太阳能光伏发电量。机器学习模型已变得日益重要,并广泛用于预测太阳能发电厂的性能。在本文件中,天气参数对太阳能光伏发电的影响是由若干综合ML(EML)模型估计的,例如拉动、推动、粉碎、粉碎和首次投票等。选定的ML算法的性能由印度东部地区10kWp太阳能光伏发电站的实地数据集验证。此外,为数据挖掘和选择适当的学习模型设计了一个完整的试验床框架。它还支持为数据集选择和减少特征,以减少学习模型的空间和时间复杂性。结果显示,在粉刷和投票的EML模型中,大约96%的预测准确性更高。拟议的工作也是对大规模光电转换模型的大规模性能的预测。