In traditional extreme value analysis, the bulk of the data is ignored, and only the tails of the distribution are used for inference. Extreme observations are specified as values that exceed a threshold or as maximum values over distinct blocks of time, and subsequent estimation procedures are motivated by asymptotic theory for extremes of random processes. For environmental data, nonstationary behavior in the bulk of the distribution, such as seasonality or climate change, will also be observed in the tails. To accurately model such nonstationarity, it seems natural to use the entire dataset rather than just the most extreme values. It is also common to observe different types of nonstationarity in each tail of a distribution. Most work on extremes only focuses on one tail of a distribution, but for temperature, both tails are of interest. This paper builds on a recently proposed parametric model for the entire probability distribution that has flexible behavior in both tails. We apply an extension of this model to historical records of daily mean temperature at several locations across the United States with different climates and local conditions. We highlight the ability of the method to quantify changes in the bulk and tails across the year over the past decades and under different geographic and climatic conditions. The proposed model shows good performance when compared to several benchmark models that are typically used in extreme value analysis of temperature.
翻译:在传统的极端价值分析中,大部分数据被忽略,而且只有分布的尾巴被用来进行推断。极端观测被指定为超过临界值或不同时间段的最大值,随后的估算程序被随机过程极端现象的无症状理论所驱动。对于环境数据,大部分分布中的非静止行为,例如季节性或气候变化,也将在尾部中观察到。为了准确模拟这种不常态性,使用整个数据集而非最极端的值似乎是自然的。在分布的每个尾部观察不同类别的非静止值也是常见的。大多数关于极端的工作只侧重于分布的尾部,但对于温度而言,两者都是有意义的。本文件以最近提出的一个参数模型为基础,对两种尾部都具有灵活行为的整个概率分布进行模拟。我们将这一模型推广到美国不同气候和当地条件的若干地点的日平均温度历史记录中。我们强调在分布的每一个尾部和尾部的不同尾部中观察不同类型非常态性值是常见的。关于极端性的工作仅侧重于分布的尾部,但对于温度而言,两者都是有兴趣的。本文件以最近提出的一个参数模型为基础,用以量化模型的变化,而在过去几十年里使用的是不同的气候条件下的模型。