Climate models play a crucial role in understanding the effect of environmental and man-made changes on climate to help mitigate climate risks and inform governmental decisions. Large global climate models such as the Community Earth System Model (CESM), developed by the National Center for Atmospheric Research, are very complex with millions of lines of code describing interactions of the atmosphere, land, oceans, and ice, among other components. As development of the CESM is constantly ongoing, simulation outputs need to be continuously controlled for quality. To be able to distinguish a "climate-changing" modification of the code base from a true climate-changing physical process or intervention, there needs to be a principled way of assessing statistical reproducibility that can handle both spatial and temporal high-dimensional simulation outputs. Our proposed work uses probabilistic classifiers like tree-based algorithms and deep neural networks to perform a statistically rigorous goodness-of-fit test of high-dimensional spatio-temporal data.
翻译:气候模型在理解环境和人为变化对气候的影响,帮助减轻气候风险和为政府决策提供信息方面发挥着关键作用。大型全球气候模型,如国家大气研究中心开发的社区地球系统模型(CESM)非常复杂,有数百万条描述大气、陆地、海洋和冰层相互作用的代码线。随着CESM的不断发展,模拟产出需要按质量不断加以控制。为了能够将代码基础的“气候变化”修改与真正的气候变化物理过程或干预区分开来,需要有一个原则性的方法来评估统计再复制能力,既能处理空间和时空高度模拟产出。我们提议的工作利用基于树木的算法和深神经网络等概率分类者来进行严格统计的、完善的高维空时数据测试。