With advances in scientific computing and mathematical modeling, complex phenomena can now be reliably simulated. Such simulations can however be very time-intensive, requiring millions of CPU hours to perform. One solution is multi-fidelity emulation, which uses data of varying accuracies (or fidelities) to train an efficient predictive model (or emulator) for the expensive simulator. In complex problems, simulation data with different fidelities are often connected scientifically via a directed acyclic graph (DAG), which cannot be integrated within existing multi-fidelity emulator models. We thus propose a new Graphical Multi-fidelity Gaussian process (GMGP) model, which embeds this scientific DAG information within a Gaussian process framework. We show that the GMGP has desirable modeling traits via two Markov properties, and admits a scalable formulation for recursively computing the posterior predictive distribution along sub-graphs. We also present an experimental design framework over the DAG given a computational budget, and propose a nonlinear extension of the GMGP model via deep Gaussian processes. The advantages of the GMGP model over existing methods are then demonstrated via a suite of numerical experiments and an application to emulation of heavy-ion collisions, which can be used to study the conditions of matter in the Universe shortly after the Big Bang.
翻译:随着科学计算和数学建模的进步,现在可以可靠地模拟复杂现象。但是,这种模拟可能非常耗时,需要数百万个CPU小时才能完成。一个解决办法是多纤维模擬,它使用不同理解(或忠实)的数据,为昂贵模拟器培训高效的预测模型(或模拟器),在复杂的问题中,不同忠诚的模拟数据往往通过定向循环图(DAG)在科学上连接,该图无法融入现有的多纤维模拟模型。因此,我们提出了一个新的图形化多纤维化高斯进程(GMGP)模型,该模型将这一科学的DAGG信息嵌入高斯进程框架。我们表明,GMGP具有通过两个Markov属性进行理想的模型特征,并承认一种可缩放的配方配方,在计算预算的情况下,对DAGAG提出一个实验框架,并提议通过深层的GGGGP模型的模型和非线性扩展。在GGGG模型应用后,通过现有的重的GGP模型应用方法,在GGGGGGGM的模型中,可以展示其现有的重度实验优势。