Modern utility-scale wind turbines are equipped with a Supervisory Control And Data Acquisition (SCADA) system gathering vast amounts of operational data that can be used for analysis to improve operation and maintenance of turbines. We analyze high frequency SCADA-data from the Thanet offshore wind farm in the UK and evaluate Pearson correlation matrices for a variety of observables with a moving time window. This renders possible an assessment of non-stationarity in mutual dependencies of different types of data. Drawing from our experience in other complex systems, such as financial markets and traffic, we show this by employing a hierarchical $k$-means clustering algorithm on the correlation matrices. The different clusters exhibit distinct typical correlation structures to which we refer as states. Looking first at only one and later at multiple turbines, the main dependence of these states is shown to be on wind speed. In accordance, we identify them as operational states arising from different settings in the turbine control system based on the available wind speed. We model the boundary wind speeds separating the states based on the clustering solution. This allows the usage of our methodology as a pre-processing for analysis, e.g. failure detection or prediction, by sorting new data based on wind speed and comparing it to the respective operational state, thereby taking the non-stationarity into account.
翻译:现代公用事业型风力涡轮机配备了监督控制和数据采集系统(SCADA),该系统收集了大量业务数据,可用于分析改进涡轮机的运行和维护。我们分析英国Thanet离岸风力农场的高频SCADA数据,并用移动时间窗口评估各种可观测的Pearson相关矩阵。这样就可以评估不同类型数据的相互依存性不常态性。我们根据在金融市场和交通等其他复杂系统中的经验,通过在相关矩阵上使用一个等级的美元手段组合算法来显示这一点。不同的组群显示了不同典型的典型相关结构,我们称其为状态。首先在多个涡轮机上只看一个或以后,这些状态的主要依赖是风速。根据现有风速,我们将这些状态确定为涡轮机控制系统不同环境产生的运行状态。我们根据集群解决方案对边界风速进行区分,这样就可以使用我们的方法作为分析的预处理前方法,例如故障探测或运行速度预测,从而将新数据进行对比,从而将速度和运行状态纳入不同的状态。