A Model Intercomparison Project (MIP) consists of teams who each estimate the same underlying quantity (e.g., temperature projections to the year 2070), and the spread of the estimates indicates their uncertainty. It recognizes that a community of scientists will not agree completely but that there is value in looking for a consensus and information in the range of disagreement. A simple average of the teams' outputs gives a consensus estimate, but it does not recognize that some outputs are more variable than others. Statistical analysis of variance (ANOVA) models offer a way to obtain a weighted consensus estimate of outputs with a variance that is the smallest possible and hence the tightest possible 'one-sigma' and 'two-sigma' intervals. Modulo dependence between MIP outputs, the ANOVA approach weights a team's output inversely proportional to its variation. When external verification data are available for evaluating the fidelity of each MIP output, ANOVA weights can also provide a prior distribution for Bayesian Model Averaging to yield a consensus estimate. We use a MIP of carbon dioxide flux inversions to illustrate the ANOVA-based weighting and subsequent consensus inferences.
翻译:模型相互比较项目(MIP)由各小组组成,每个小组分别估计相同的基本数量(如2070年的温度预测),而估计数的分布表明其不确定性。它承认科学家群体不会完全达成一致,但寻找共识和各种不同意见的信息是有价值的。小组产出的简单平均数提供了协商一致的估计,但并不承认某些产出比其他产出更具有变量。对差异的统计分析(ANOVA)模型提供了一种途径,以获得对产出的加权一致估计,其差异最小,因此也是最接近的“一格玛”和“二格玛”间隔。Modulo在MIP产出之间的依赖性,ANOVA对团队产出的加权与其差异成反比。当有外部核查数据用于评价每项MIP产出的准确性时,ANOVA重量也可以提供Bayesian模型的事先分发,以得出协商一致的估计。我们使用二氧化碳通量的MIP,用于说明基于ANOVA的加权和随后的协商一致。