We present a novel neural network architecture using self-attention, the Wavefunction Transformer (Psiformer), which can be used as an approximation (or Ansatz) for solving the many-electron Schr\"odinger equation, the fundamental equation for quantum chemistry and material science. This equation can be solved from first principles, requiring no external training data. In recent years, deep neural networks like the FermiNet and PauliNet have been used to significantly improve the accuracy of these first-principle calculations, but they lack an attention-like mechanism for gating interactions between electrons. Here we show that the Psiformer can be used as a drop-in replacement for these other neural networks, often dramatically improving the accuracy of the calculations. On larger molecules especially, the ground state energy can be improved by dozens of kcal/mol, a qualitative leap over previous methods. This demonstrates that self-attention networks can learn complex quantum mechanical correlations between electrons, and are a promising route to reaching unprecedented accuracy in chemical calculations on larger systems.
翻译:我们提出了一种新颖的神经网络体系结构,使用自我关注机制,即WF变换器(Psiformer),可以用作求解许多电子Schr\''odinger方程的近似(或Ansatz),这是量子化学和材料科学的基本方程。这个方程可以从第一原理解决,不需要外部的训练数据。近年来,如FermiNet和PauliNet之类的深度神经网络已被用于显著提高这些第一原理计算的准确性,但它们缺乏一种类似注意机制的电子之间交互的门控机制。在这里,我们展示了Psiformer可以用作这些其他神经网络的替代,并且经常显著提高计算的准确性。尤其是在较大分子上,基态能量可以提高几十kcal/mol,是以前方法的质的飞跃。这表明自我注意网络可以学习电子之间的复杂量子机械相关性,是实现化学计算上前所未有的精度的有效途径。