Concentrated solar power (CSP) is one of the growing technologies that is leading the process of changing from fossil fuels to renewable energies. The sophistication and size of the systems require an increase in maintenance tasks to ensure reliability, availability, maintainability and safety. Currently, automatic fault detection in CSP plants using Parabolic Trough Collector systems evidences two main drawbacks: 1) the devices in use needs to be manually placed near the receiver tube, 2) the Machine Learning-based solutions are not tested in real plants. We address both gaps by combining the data extracted with the use of an Unmaned Aerial Vehicle, and the data provided by sensors placed within 7 real plants. The resulting dataset is the first one of this type and can help to standardize research activities for the problem of fault detection in this type of plants. Our work proposes supervised machine-learning algorithms for detecting broken envelopes of the absorber tubes in CSP plants. The proposed solution takes the class imbalance problem into account, boosting the accuracy of the algorithms for the minority class without harming the overall performance of the models. For a Deep Residual Network, we solve an imbalance and a balance problem at the same time, which increases by 5% the Recall of the minority class with no harm to the F1-score. Additionally, the Random Under Sampling technique boost the performance of traditional Machine Learning models, being the Histogram Gradient Boost Classifier the algorithm with the highest increase (3%) in the F1-Score. To the best of our knowledge, this paper is the first providing an automated solution to this problem using data from operating plants.
翻译:集中太阳能发电(CSP)是引导从化石燃料向可再生能源转变过程的不断发展的技术之一。系统的复杂性和规模要求增加维护任务,以确保可靠性、可用性、可维持性和安全性。目前,使用Parabolic Trough收集器系统的CSP工厂自动发现故障证明两个主要缺点:1)使用中的装置需要人工放在接收管附近;2)机器学习解决方案没有在实际工厂中测试。我们通过将从自动化机车中提取的数据与7个实际工厂内传感器提供的数据结合起来,来解决这两个差距。由此产生的数据集是这类工厂中的第一个,有助于使这类工厂中发现故障问题的研究活动标准化。我们的工作提出了监督的机器学习算法,用于检测CSP工厂中吸收器管的破碎信封;2)机器学习解决方案没有在实际工厂中测试。拟议解决方案将班级失衡问题考虑在内,提高少数群体类的算法的准确性,而不会损害模型的整体性能。对于深固化的服务器来说,我们用一种传统方法解决了这种类型数据型数据集的不平衡性能,而采用最高级的Frilli程则提高。