Verifying probabilistic forecasts for extreme events is a highly active research area because popular media and public opinions are naturally focused on extreme events, and biased conclusions are readily made. In this context, classical verification methods tailored for extreme events, such as thresholded and weighted scoring rules, have undesirable properties that cannot be mitigated, and the well-known continuous ranked probability score (CRPS) is no exception. In this paper, we define a formal framework for assessing the behavior of forecast evaluation procedures with respect to extreme events, which we use to demonstrate that assessment based on the expectation of a proper score is not suitable for extremes. Alternatively, we propose studying the properties of the CRPS as a random variable by using extreme value theory to address extreme event verification. An index is introduced to compare calibrated forecasts, which summarizes the ability of probabilistic forecasts for predicting extremes. The strengths and limitations of this method are discussed using both theoretical arguments and simulations.
翻译:核实极端事件预测概率是一个非常积极的研究领域,因为大众传媒和公众舆论自然会把注意力集中在极端事件上,而且很容易得出偏颇的结论。在这方面,针对极端事件而专门设计的典型核查方法,如临界值和加权评分规则,具有无法减轻的不良性质,众所周知的连续概率分数(CRPS)也不例外。在本文件中,我们界定了评估极端事件预测评价程序行为的正式框架,我们用这个框架来证明,基于预期得分适当的评估不适合极端事件。或者,我们建议利用极端价值理论来研究CRPS作为随机变量的特性,以处理极端事件核查问题。我们采用了一种指数来比较经过校准的预测,该指数总结预测极端现象的概率预测能力。我们用理论论点和模拟来讨论这一方法的优点和局限性。