Kernel machines have sustained continuous progress in the field of quantum chemistry. In particular, they have proven to be successful in the low-data regime of force field reconstruction. This is because many equivariances and invariances due to physical symmetries can be incorporated into the kernel function to compensate for much larger datasets. So far, the scalability of kernel machines has however been hindered by its quadratic memory and cubical runtime complexity in the number of training points. While it is known, that iterative Krylov subspace solvers can overcome these burdens, their convergence crucially relies on effective preconditioners, which are elusive in practice. Effective preconditioners need to partially pre-solve the learning problem in a computationally cheap and numerically robust manner. Here, we consider the broad class of Nystr\"om-type methods to construct preconditioners based on successively more sophisticated low-rank approximations of the original kernel matrix, each of which provides a different set of computational trade-offs. All considered methods aim to identify a representative subset of inducing (kernel) columns to approximate the dominant kernel spectrum.
翻译:核方法已经在量子化学领域中持续取得进步。特别是,在力场重构的低数据范围内,它们已被证明是成功的。这是因为许多由于物理对称性而产生的等变性和不变性可以被纳入核函数中以补偿更大的数据集。然而,核方法的可扩展性迄今为止受到其二次内存和训练点数量的三次运行时复杂性的阻碍。虽然已知迭代Krylov子空间求解器可以克服这些负担,但它们的收敛关键取决于有效的预处理器,而这在实践中难以实现。有效的预处理器需要以计算便宜和数值稳健的方式部分解决学习问题。在这里,我们考虑Nyström类型方法的广泛类别,以构建基于逐步更复杂的低秩逼近的预处理器,每个方法都提供不同的计算折衷。所有考虑的方法都旨在识别一个代表性的感应(核)列的子集,以近似主核谱。