Neural operators have gained significant attention recently due to their ability to approximate high-dimensional parametric maps between function spaces. At present, only parametric function approximation has been addressed in the neural operator literature. In this work we investigate incorporating parametric derivative information in neural operator training; this information can improve function approximations, additionally it can be used to improve the approximation of the derivative with respect to the parameter, which is often the key to scalable solution of high-dimensional outer-loop problems (e.g. Bayesian inverse problems). Parametric Jacobian information is formally intractable to incorporate due to its high-dimensionality, to address this concern we propose strategies based on reduced SVD, randomized sketching and the use of reduced basis surrogates. All of these strategies only require only $O(r)$ Jacobian actions to construct sample Jacobian data, and allow us to reduce the linear algebra and memory costs associated with the Jacobian training from the product of the input and output dimensions down to $O(r^2)$, where $r$ is the dimensionality associated with the dimension reduction technique. Numerical results for parametric PDE problems demonstrate that the addition of derivative information to the training problem can significantly improve the parametric map approximation, particularly given few data. When Jacobian actions are inexpensive compared to the parametric map, this information can be economically substituted for parametric map data. Additionally we show that Jacobian error approximations improve significantly with the introduction of Jacobian training data. This result opens the door to the use of derivative-informed neural operators (DINOs) in outer-loop algorithms where they can amortize the additional training data cost via repeated evaluations.
翻译:神经操作员最近由于有能力在功能空间之间近似高维参数地图而得到极大关注。 目前,神经操作员文献只涉及参数功能近似值。 我们调查了将参数衍生物信息纳入神经操作员培训的参数培训中,这种信息可以改进函数近似值,另外,还可用于改进衍生物相对于参数的近近似值,而参数往往是高维外环问题可伸缩解决方案的关键(例如巴耶西亚反向问题)。 参数Jacobian信息由于高度的高度,正式难以纳入,以解决我们提出的基于降低SVD、随机绘图和使用降低基代谢值的策略。 所有这些战略仅要求用美元(r)提高衍生物对参数的近似近似值,并使我们能够降低与高度外环球问题相关的直径调和记忆成本(例如巴耶西亚反向反向问题 ) 。 值 Jacobian 培训产品中的直径直径直值成本值成本可以大幅提高, 以美元作为与降低维度技术相关的维值评估。 数字操作员将数据比的直径直径直径比数据的结果, 当我们的直径直径比数据能数据能数据可以明显地分析结果显示, 。 直径直径比数据可以显著数据 。