Achieving a complete and symmetric description of a group of point particles, such as atoms in a molecule, is a common problem in physics and theoretical chemistry. The introduction of machine learning to science has made this issue even more critical, as it underpins the ability of a model to reproduce arbitrary physical relationships, and to do so while being consistent with basic symmetries and conservation laws. However, the descriptors that are commonly used to represent point clouds -- most notably those adopted to describe matter at the atomic scale -- are unable to distinguish between special arrangements of particles. This makes it impossible to machine learn their properties. Frameworks that are provably complete exist, but are only so in the limit in which they simultaneously describe the mutual relationship between all atoms, which is impractical. We introduce, and demonstrate on a particularly insidious class of atomic arrangements, a strategy to build descriptors that rely solely on information on the relative arrangement of triplets of particles, but can be used to construct symmetry-adapted models that have universal approximation power.
翻译:实现一组点粒子的完整和对称描述,例如分子中的原子,是物理学和理论化学的一个常见问题。机器学习科学使这一问题变得更加重要,因为它支撑了模型复制任意物理关系的能力,而且这样做符合基本的对称和保存法。然而,通常用来代表点云的描述符 -- -- 主要是用来描述原子规模物质的描述符 -- -- 无法区分特殊粒子安排。这使得机器无法学习其特性。可以肯定的完整框架存在,但仅限于它们同时描述所有原子之间相互关系的限度,这种框架是不切实际的。我们引入并展示了一种特别阴险的原子安排类别,以仅仅依靠粒子三重的相对安排的信息来构建描述符,但可以用来构建具有普遍近近似能力的对称模型。</s>