Stochastic precipitation generators (SPGs) are a class of statistical models which generate synthetic data that can simulate dry and wet rainfall stretches for long durations. Generated precipitation time series data are used in climate projections, impact assessment of extreme weather events, and water resource and agricultural management. We construct an SPG for daily precipitation data that is specified as a semi-continuous distribution at every location, with a point mass at zero for no precipitation and a mixture of two exponential distributions for positive precipitation. Our generators are obtained as hidden Markov models (HMMs) where the underlying climate conditions form the states. We fit a 3-state HMM to daily precipitation data for the Chesapeake Bay watershed in the Eastern coast of the USA for the wet season months of July to September from 2000--2019. Data is obtained from the GPM-IMERG remote sensing dataset, and existing work on variational HMMs is extended to incorporate semi-continuous emission distributions. In light of the high spatial dimension of the data, a stochastic optimization implementation allows for computational speedup. The most likely sequence of underlying states is estimated using the Viterbi algorithm, and we identify the differences in the weather regimes associated with the states of the proposed model. Synthetic data generated from the HMM can reproduce monthly precipitation statistics as well as spatial dependency present in the historical GPM-IMERG data.
翻译:在气候预测、极端天气事件影响评估、水资源和农业管理方面,我们使用生成的降水时间序列数据。我们为每日降水数据建造了一个用于每个地点半连续分布的SPG数据,每个地点的降水量为零点质量,不降水量和两种指数分布的混合,正降水量。我们的发电机是作为隐藏的Markov模型(HMMS)获得的,其基本气候条件形成各州。我们为2000年至2019年7月至9月湿季美国东海岸Chesapeake湾流域的每日降水量数据配备了三州HMM,2000年7月至2019年7月至9月的湿季数月,我们为日降水量序列数据进行了应用。数据来自GPM-IMERG遥感数据集的半连续分布,而关于变压的当前工作则扩展为包含半连续排放分布。根据这些数据的高空间层面,我们进行了随机优化实施,以便进行计算速度。我们估计了目前全球海滨海岸地区Chesapeake Basimal(SyM)系统所生成的历史数据序列。