The application of machine learning methods in quantum chemistry has enabled the study of numerous chemical phenomena, which are computationally intractable with traditional ab-initio methods. However, some quantum mechanical properties of molecules and materials depend on non-local electronic effects, which are often neglected due to the difficulty of modeling them efficiently. This work proposes a modified attention mechanism adapted to the underlying physics, which allows to recover the relevant non-local effects. Namely, we introduce spherical harmonic coordinates (SPHCs) to reflect higher-order geometric information for each atom in a molecule, enabling a non-local formulation of attention in the SPHC space. Our proposed model So3krates - a self-attention based message passing neural network - uncouples geometric information from atomic features, making them independently amenable to attention mechanisms. Thereby we construct spherical filters, which extend the concept of continuous filters in Euclidean space to SPHC space and serve as foundation for a spherical self-attention mechanism. We show that in contrast to other published methods, So3krates is able to describe non-local quantum mechanical effects over arbitrary length scales. Further, we find evidence that the inclusion of higher-order geometric correlations increases data efficiency and improves generalization. So3krates matches or exceeds state-of-the-art performance on popular benchmarks, notably, requiring a significantly lower number of parameters (0.25 - 0.4x) while at the same time giving a substantial speedup (6 - 14x for training and 2 - 11x for inference) compared to other models.
翻译:在量子化学中应用机器学习方法,使得能够对多种化学现象进行研究,这些化学现象在计算上与传统的AB-initio方法是难以理解的。然而,分子和材料的一些量子机械特性取决于非局部电子效应,而由于难以有效地建模这些效应,这些特性往往被忽略。这项工作建议了一种适应基础物理学的调整关注机制,从而能够恢复相关的非局部效应。也就是说,我们引入球形口音坐标(SPHCs),以反映分子中每个原子的更高层次的几何信息,从而能够在SPHC空间中形成一种非局部的注意。我们提议的模型So3krates -- 一种基于自我注意的信息传递神经网络 -- 一种与原子特征不相交错的地理测量信息,使它们独立地适应关注机制。 我们由此建立了一种球形过滤器,将Euclidean空间连续过滤器的概念延伸到SPHC空间,并作为同一球形自我观察机制的基础。我们发现,与其他公布的方法相比,So3krates模型能够将非本地的参数传递到通过神经网络传递的信息,而明显地标的精确度比标值比标准的精确度比,从而提高一般的精确度数据效率。