Data-driven schemes that associate molecular and crystal structures with their microscopic properties share the need for a concise, effective description of the arrangement of their atomic constituents. Many types of models rely on descriptions of atom-centered environments, that are associated with an atomic property or with an atomic contribution to an extensive macroscopic quantity. Frameworks in this class can be understood in terms of atom-centered density correlations (ACDC), that are used as a basis for a body-ordered, symmetry-adapted expansion of the targets. Several other schemes, that gather information on the relationship between neighboring atoms using graph-convolutional (or message-passing) ideas, cannot be directly mapped to correlations centered around a single atom. We generalize the ACDC framework to include multi-centered information, generating representations that provide a complete linear basis to regress symmetric functions of atomic coordinates, and form the basis to systematize our understanding of both atom-centered and graph-convolutional machine-learning schemes.
翻译:将分子和晶体结构与其微粒特性联系起来的由数据驱动的计划,共同需要对其原子成分的安排进行简明有效的描述。许多类型的模型都依赖于原子中心环境的描述,这些环境与原子属性有关,或与原子对广泛宏观数量的贡献有关。这一类的框架可以从原子中心密度相关性的角度来理解,这些框架被用作物体排列、对称和调整扩大目标的基础。其他一些计划收集了利用图形革命(或信息传递)理念的相邻原子之间的关系的信息,但无法直接与以单一原子为中心的关联进行绘图。我们一般地将原子中心框架纳入多原子信息,形成完整的直线基础以回归原子坐标的对称功能,并构成我们系统理解原子中心和图形革命机器学习计划的基础。