Supervised operator learning is an emerging machine learning paradigm with applications to modeling the evolution of spatio-temporal dynamical systems and approximating general black-box relationships between functional data. We propose a novel operator learning method, LOCA (Learning Operators with Coupled Attention), motivated from the recent success of the attention mechanism. In our architecture, the input functions are mapped to a finite set of features which are then averaged with attention weights that depend on the output query locations. By coupling these attention weights together with an integral transform, LOCA is able to explicitly learn correlations in the target output functions, enabling us to approximate nonlinear operators even when the number of output function in the training set measurements is very small. Our formulation is accompanied by rigorous approximation theoretic guarantees on the universal expressiveness of the proposed model. Empirically, we evaluate the performance of LOCA on several operator learning scenarios involving systems governed by ordinary and partial differential equations, as well as a black-box climate prediction problem. Through these scenarios we demonstrate state of the art accuracy, robustness with respect to noisy input data, and a consistently small spread of errors over testing data sets, even for out-of-distribution prediction tasks.
翻译:监督操作员学习是一种新兴的机器学习模式,其应用是模拟时空空间动态系统的演进,以及功能数据之间接近一般黑盒关系。我们提议了一种新型操作员学习方法,即LOCA(学习操作员加注意),其动力来自关注机制最近的成功。在我们的结构中,输入功能被映射为一套有限的特征,这些特征先以关注权加权值平均,然后根据产出查询地点加以平均。通过将这些关注权重与整体变异结合起来,LOCA能够明确了解目标输出功能的关联性,使我们能够在培训数据集测量的产出功能数量非常小的情况下,大致接近非线性操作员。我们的设计配以严格的近似理论保证,保证拟议模型的普遍清晰性。我们随机地评估LOCA在几个操作员学习情景上的绩效,这些情景涉及受普通和部分差异方程式制约的系统,以及黑箱气候预测问题。通过这些情景,我们展示了艺术准确性状况、对冷度投入数据的坚固度,并持续地对数据进行小幅分布作出预测。