A large fraction of major waterways have dams influencing streamflow, which must be accounted for in large-scale hydrologic modeling. However, daily streamflow prediction for basins with dams is challenging for various modeling approaches, especially at large scales. Here we took a divide-and-conquer approach to examine which types of basins could be well represented by a long short-term memory (LSTM) deep learning model using only readily-available information. We analyzed data from 3557 basins (83% dammed) over the contiguous United States and noted strong impacts of reservoir purposes, capacity-to-runoff ratio (dor), and diversion on streamflow on streamflow modeling. Surprisingly, while the LSTM model trained on a widely-used reference-basin dataset performed poorly for more non-reference basins, the model trained on the whole dataset presented a median test Nash-Sutcliffe efficiency coefficient (NSE) of 0.74, reaching benchmark-level performance. The zero-dor, small-dor, and large-dor basins were found to have distinct behaviors, so migrating models between categories yielded catastrophic results. However, training with pooled data from different sets yielded optimal median NSEs of 0.73, 0.78, and 0.71 for these groups, respectively, showing noticeable advantages over existing models. These results support a coherent, mixed modeling strategy where smaller dams are modeled as part of rainfall-runoff processes, but dammed basins must not be treated as reference ones and must be included in the training set; then, large-dor reservoirs can be represented explicitly and future work should examine modeling reservoirs for fire protection and irrigation, followed by those for hydroelectric power generation, and flood control, etc.
翻译:大部分主要水道都有影响流流的水坝,而流流则必须在大型水力模型中加以解释。然而,对有水坝流域的每日流流预测对于各种建模方法,特别是大尺度的建模方法来说具有挑战性。在这里,我们采取了一种分而治之的方法,以研究哪些类型的流域可以用长期短期内存(LSTM)的深层次学习模型来很好地代表,仅使用现成的信息。我们分析了来自毗连的美国3557个流域的数据(83%浸泡在水流中),并注意到储油量目的、能力对流比率(温度)和流流流流流量模型转移转移的强烈影响。令人惊讶的是,在广泛使用的参考流域数据集数据集中,经过培训的LSTM模型对哪些类型的流域没有很好,在整个数据集中,Nash-Sutcliffe效率系数(NSE)为0.74,达到基准水平绩效。经过处理的储油量模型、小型和大型储油层盆地必须发现有不同的行为模式,因此,在流流流流流流流流流流的类别之间转移模型中,因此产生灾难性的结果。 然而,由Gemememememememembil 和Greal 培训产生,这些模型构成的模型组成,这些模型组成,这些模型和混合数据组分别显示为最优。