The use of tiered warnings and multicategorical forecasts are ubiquitous in meteorological operations. Here, a flexible family of scoring functions is presented for evaluating the performance of ordered multicategorical forecasts. Each score has a risk parameter $\alpha$, selected for the specific use case, so that it is consistent with a forecast directive based on the fixed threshold probability $1-\alpha$ (equivalently, a fixed $\alpha$-quantile mapping). Each score also has use-case specific weights so that forecasters who accurately discriminate between categorical thresholds are rewarded in proportion to the weight for that threshold. A variation is presented where the penalty assigned to near misses or close false alarms is discounted, which again is consistent with directives based on fixed risk measures. The scores presented provide an alternative to many performance measures currently in use, whose optimal threshold probabilities for forecasting an event typically vary with each forecast case, and in the case of equitable scores are based around sample base rates rather than risk measures suitable for users.
翻译:分级警告和多类别预报的使用在气象作业中是普遍存在的。 这里展示了一个灵活的评分功能组合, 用于评价定单多类别预报的性能。 每个评分都有风险参数 $\ alpha$, 为特定用途案例选定, 以便与基于固定阈值1 - alpha$的预测指令( 等值, 固定alpha$- quantile 映射 ) 相一致。 每个评分也都有使用的具体权重, 以便准确区分绝对阈值的预测人员按该阈值的重量比例得到奖励。 当指定近失事或关闭假警报的处罚被打折扣时, 显示一种差异, 这又符合基于固定风险计量的指令 。 所显示的评分为目前使用的许多绩效计量提供了替代方法, 其预测事件的最佳阈值概率通常随每个预测案例而不同, 在公平评分时, 以抽样基准率为基础, 而不是适合用户的风险评估措施为基础。