加速复制交流的随机梯度Langevin扩散增强Bayesian DeepONet,以解决噪音超强的参数PDE (Accelerated replica exchange stochastic gradient Langevin diffusion enhanced Bayesian DeepONet for solving noisy parametric PDEs)

The Deep Operator Networks~(DeepONet) is a fundamentally different class of neural networks that we train to approximate nonlinear operators, including the solution operator of parametric partial differential equations (PDE). DeepONets have shown remarkable approximation and generalization capabilities even when trained with relatively small datasets. However, the performance of DeepONets deteriorates when the training data is polluted with noise, a scenario that occurs very often in practice. To enable DeepONets training with noisy data, we propose using the Bayesian framework of replica-exchange Langevin diffusion. Such a framework uses two particles, one for exploring and another for exploiting the loss function landscape of DeepONets. We show that the proposed framework's exploration and exploitation capabilities enable (1) improved training convergence for DeepONets in noisy scenarios and (2) attaching an uncertainty estimate for the predicted solutions of parametric PDEs. In addition, we show that replica-exchange Langeving Diffusion (remarkably) also improves the DeepONet's mean prediction accuracy in noisy scenarios compared with vanilla DeepONets trained with state-of-the-art gradient-based optimization algorithms (e.g. Adam). To reduce the potentially high computational cost of replica, in this work, we propose an accelerated training framework for replica-exchange Langevin diffusion that exploits the neural network architecture of DeepONets to reduce its computational cost up to 25% without compromising the proposed framework's performance. Finally, we illustrate the effectiveness of the proposed Bayesian framework using a series of experiments on four parametric PDE problems.

翻译：深操作员网络 ~ (DeepONet) 是一条完全不同的神经网络, 我们向近似非线性操作员, 包括模拟部分差异方程式(PDE) 的解决方案操作员提供培训。 DeepONets 显示出惊人的近似和概括能力, 即使经过相对小的数据集培训。但是, DeepONets 的性能在培训数据被噪音污染时会恶化, 这种情景在实践中经常发生。为了让DeepONets培训能够用噪音数据进行DeepONets培训, 我们提议使用贝叶尔斯框架复制交换朗氏数据。这样一个框架使用两个粒子来探索和另一个粒子来探索DeepONets的损失功能景观。我们显示,拟议框架的探索和开发能力能够(1) 改善DeepONets在噪音假设情景下的培训一致性和概括能力,(2) 对参数PDES的预测解决方案进行不确定性估计。此外, 我们显示, 复制Langeving Difulation(可明显地) 框架意味着, 将噪音框架的准确性预测, 与经州- Deeponets培训后, 利用州级网络的加速的加速成本模型分析, 将降低成本分析, 我们提议的升级的模型进行。