Recently, graph neural networks (GNNs) have been successfully applied to predicting molecular properties, which is one of the most classical cheminformatics tasks with various applications. Despite their effectiveness, we empirically observe that training a single GNN model for diverse molecules with distinct structural patterns limits its prediction performance. In this paper, motivated by this observation, we propose TopExpert to leverage topology-specific prediction models (referred to as experts), each of which is responsible for each molecular group sharing similar topological semantics. That is, each expert learns topology-specific discriminative features while being trained with its corresponding topological group. To tackle the key challenge of grouping molecules by their topological patterns, we introduce a clustering-based gating module that assigns an input molecule into one of the clusters and further optimizes the gating module with two different types of self-supervision: topological semantics induced by GNNs and molecular scaffolds, respectively. Extensive experiments demonstrate that TopExpert has boosted the performance for molecular property prediction and also achieved better generalization for new molecules with unseen scaffolds than baselines. The code is available at https://github.com/kimsu55/ToxExpert.
翻译:最近,图形神经网络(GNNs)被成功地应用于预测分子特性,这是具有各种应用的最古典化学化学任务之一。尽管这些网络具有效力,但我们从经验上观察到,为不同结构模式的不同分子培训一个单一的GNN模型限制了其预测性能。在本文中,我们提议顶层专家利用具有不同地形特点的预测模型(称为专家),每个模型都负责拥有相似的地形语义的每个分子组。也就是说,每个专家在接受相应的表层学组培训时,学习了特定的表层学歧视特征。为了通过它们的表层学模式应对分子组合的关键挑战,我们引入了一个基于集群的模组模组模组,将一个输入分子划入一个组,并进一步优化配有两种不同类型的自我监督型的模组:由GNNS和分子骨架分别引出的表理学语义。广泛的实验表明,顶层实验提高了分子特性预测的性能,并且实现了新分子分子的精准化。MADAFMSM/COMSMDMDMDMSDMDMDSUDSUDDSUDMDS</s>