This paper proposes a machine learning method based on the Extra Trees (ET) algorithm for forecasting Significant Wave Heights in oceanic waters. To derive multiple features from the CDIP buoys, which make point measurements, we first nowcast various parameters and then forecast them at 30-min intervals. The proposed algorithm has Scatter Index (SI), Bias, Correlation Coefficient, Root Mean Squared Error (RMSE) of 0.130, -0.002, 0.97, and 0.14, respectively, for one day ahead prediction and 0.110, -0.001, 0.98, and 0.122, respectively, for 14-day ahead prediction on the testing dataset. While other state-of-the-art methods can only forecast up to 120 hours ahead, we extend it further to 14 days. Our proposed setup includes spectral features, hv-block cross-validation, and stringent QC criteria. The proposed algorithm performs significantly better than the state-of-the-art methods commonly used for significant wave height forecasting for one-day ahead prediction. Moreover, the improved performance of the proposed machine learning method compared to the numerical methods shows that this performance can be extended to even longer periods allowing for early prediction of significant wave heights in oceanic waters.
翻译:本文建议了一种基于预测大洋水域重大波高的“额外树”算法的机器学习方法。为了从CCDIP浮标中得出多种特征,进行点测量,我们先是先播各种参数,然后每隔30分钟预报。提议的算法包括散射指数(SI)、比亚斯、相交系数、根正正正方错误(RMSE)分别为0.130、0.002、0.97和0.14,分别用于提前一天预测,以及用于提前14天预测测试数据集的0.110、0.001、0.98和0.122。其他最先进的方法只能提前120小时预测,但我们将它进一步扩展至14天。我们提议的算法包括光谱特征、hv-区交叉校准和严格的QC标准。拟议的算法比用于提前一天预测重大波高预报的通常采用的最新方法要好得多。此外,与数字方法相比,拟议的机器学习方法的性能可以改进,表明这种性能可以延长到甚至更长的时期,从而允许在海洋中早期预测重要海浪的早期。