Machine-learning models based on a point-cloud representation of a physical object are ubiquitous in scientific applications and particularly well-suited to the atomic-scale description of molecules and materials. Among the many different approaches that have been pursued, the description of local atomic environments in terms of their neighbor densities has been used widely and very succesfully. We propose a novel density-based method which involves computing ``Wigner kernels''. These are fully equivariant and body-ordered kernels that can be computed iteratively with a cost that is independent of the radial-chemical basis and grows only linearly with the maximum body-order considered. This is in marked contrast to feature-space models, which comprise an exponentially-growing number of terms with increasing order of correlations. We present several examples of the accuracy of models based on Wigner kernels in chemical applications, for both scalar and tensorial targets, reaching state-of-the-art accuracy on the popular QM9 benchmark dataset, and we discuss the broader relevance of these ideas to equivariant geometric machine-learning.
翻译:基于物理物体的点球表示法的机器学习模型在科学应用中普遍存在,特别适合于分子和材料的原子规模描述。在所追求的多种不同方法中,以相邻密度对当地原子环境的描述被广泛使用,而且非常容易使用。我们提出了一种新的基于密度的方法,其中涉及计算“显微核内核”。这些是完全不均匀的、由身体排列的内核,可以与一种独立于辐射化学基础、仅随着所考虑的最大体序而直线增长的成本进行迭接计算。这与地貌空间模型形成鲜明的对照,该模型包含成倍增长的术语,其关联性也不断提高。我们举出了几个例子,说明基于化学应用中“显微核内核”的模型的准确性,用于标度和阵列目标,达到流行的QM9基准数据集的状态精确性,我们讨论这些概念与QM9基准数据集的更广泛相关性,我们讨论这些概念与QQQ-Q-Q-阵数的测深度机学习之间的关联性。</s>