Attention mechanisms are developing into a viable alternative to convolutional layers as elementary building block of NNs. Their main advantage is that they are not restricted to capture local dependencies in the input, but can draw arbitrary connections. This unprecedented capability coincides with the long-standing problem of modeling global atomic interactions in molecular force fields and other many-body problems. In its original formulation, however, attention is not applicable to the continuous domains in which the atoms live. For this purpose we propose a variant to describe geometric relations for arbitrary atomic configurations in Euclidean space that also respects all relevant physical symmetries. We furthermore demonstrate, how the successive application of our learned attention matrices effectively translates the molecular geometry into a set of individual atomic contributions on-the-fly.
翻译:关注机制正在发展成为作为非核国家基本组成部分的革命层的一个可行的替代机制,其主要优势在于它们不限于捕捉投入中的地方依赖性,而是可以任意地吸引联系。这种前所未有的能力与分子力领域全球原子相互作用模型的长期问题和其他多体问题不谋而合。然而,在最初的提法中,注意力并不适用于原子所生活的连续领域。为此目的,我们提出一个变量来描述欧西里德空间任意原子配置的几何关系,这些配置也尊重所有相关的物理对称。我们进一步表明,我们所学的注意矩阵的连续应用如何有效地将分子几何学转化为一系列天上单独原子的贡献。