The prosperity of computer vision (CV) and natural language procession (NLP) in recent years has spurred the development of deep learning in many other domains. The advancement in machine learning provides us with an alternative option besides the computationally expensive density functional theories (DFT). Kernel method and graph neural networks have been widely studied as two mainstream methods for property prediction. The promising graph neural networks have achieved comparable accuracy to the DFT method for specific objects in the recent study. However, most of the graph neural networks with high precision so far require fully connected graphs with pairwise distance distribution as edge information. In this work, we shed light on the Directed Graph Attention Neural Network (DGANN), which only takes chemical bonds as edges and operates on bonds and atoms of molecules. DGANN distinguishes from previous models with those features: (1) It learns the local chemical environment encoding by graph attention mechanism on chemical bonds. Every initial edge message only flows into every message passing trajectory once. (2) The transformer blocks aggregate the global molecular representation from the local atomic encoding. (3) The position vectors and coordinates are used as inputs instead of distances. Our model has matched or outperformed most baseline graph neural networks on QM9 datasets even without thorough hyper-parameters searching. Moreover, this work suggests that models directly utilizing 3D coordinates can still reach high accuracies for molecule representation even without rotational and translational invariance incorporated.
翻译:近年来,计算机视觉(CV)和自然语言进程(NLP)的繁荣近些年来,计算机视觉(CV)和自然语言进程(NLP)的繁荣刺激了许多其他领域的深层次学习。机器学习的进步为我们提供了除计算成本昂贵的密度功能理论(DFT)之外的替代选择。核心方法和图形神经网络作为财产预测的两个主流方法得到了广泛的研究。有希望的图形神经网络在最近的研究中为特定对象实现了与DFT方法相似的精确度。然而,迄今为止,大多数具有高度精确度的图形神经网络需要完全连接的图形,作为边缘信息进行对接的距离分布。在这项工作中,我们为直接的图形关注神经网络(DGANN)打开了灯光,这个网络仅将化学债券作为边际,在债券和分子原子原子原子原子原子原子上运行。DGANNT与以前的模型有区别:(1)它通过化学债券的图形关注机制学习当地的化学环境。每个初始边端信息只一次流到每个信息通过轨迹。(2)变形块将全球分子的分布从本地原子编码中收集。(3)甚至将位置矢量都用于翻译,甚至将其用作最远的内径直径定位的内置的内置的内置的内径流。