Groundwater level prediction is an applied time series forecasting task with important social impacts to optimize water management as well as preventing some natural disasters: for instance, floods or severe droughts. Machine learning methods have been reported in the literature to achieve this task, but they are only focused on the forecast of the groundwater level at a single location. A global forecasting method aims at exploiting the groundwater level time series from a wide range of locations to produce predictions at a single place or at several places at a time. Given the recent success of global forecasting methods in prestigious competitions, it is meaningful to assess them on groundwater level prediction and see how they are compared to local methods. In this work, we created a dataset of 1026 groundwater level time series. Each time series is made of daily measurements of groundwater levels and two exogenous variables, rainfall and evapotranspiration. This dataset is made available to the communities for reproducibility and further evaluation. To identify the best configuration to effectively predict groundwater level for the complete set of time series, we compared different predictors including local and global time series forecasting methods. We assessed the impact of exogenous variables. Our result analysis shows that the best predictions are obtained by training a global method on past groundwater levels and rainfall data.
翻译:地下水水平预测是一种应用的时间序列预测任务,具有重要的社会影响,以优化水管理并防止某些自然灾害:例如洪水或严重干旱。文献中报告了实现这项任务的机械学习方法,但这种方法仅侧重于单一地点地下水水平的预测。全球预测方法的目的是从各地点广泛利用地下水水平时间序列,以便在同一地点或某一时间的几个地点作出预测。鉴于全球预测方法最近在著名的竞赛中取得成功,因此,在地下水水平预测中评估这些方法并查看它们如何与当地方法进行比较是有意义的。在这项工作中,我们创建了一个1026地下水水平时间序列的数据集。每个时间序列都是对地下水水平的日常测量和两个外源变量、降雨量和蒸发量进行的时间序列。这一数据集提供给各社区,以便进行再现和进一步评估。为了确定整个时间序列中有效预测地下水水平的最佳配置,我们比较了不同的预测器,包括地方和全球时间序列的预测方法。我们评估了外源变量的影响。我们的成果分析表明,通过培训的降雨量和过去的数据,得到了最佳的预测。