The structural design of functional molecules, also called molecular optimization, is an essential chemical science and engineering task with important applications, such as drug discovery. Deep generative models and combinatorial optimization methods achieve initial success but still struggle with directly modeling discrete chemical structures and often heavily rely on brute-force enumeration. The challenge comes from the discrete and non-differentiable nature of molecule structures. To address this, we propose differentiable scaffolding tree (DST) that utilizes a learned knowledge network to convert discrete chemical structures to locally differentiable ones. DST enables a gradient-based optimization on a chemical graph structure by back-propagating the derivatives from the target properties through a graph neural network (GNN). Our empirical studies show the gradient-based molecular optimizations are both effective and sample efficient. Furthermore, the learned graph parameters can also provide an explanation that helps domain experts understand the model output.
翻译:功能分子的结构设计,也称为分子优化,是一项重要的化学科学和工程任务,具有重要的应用,如药物发现等。深基因模型和组合优化方法取得了初步成功,但仍在与直接建模离散化学结构进行挣扎,而且往往严重依赖粗力计。挑战来自分子结构的离散和无差异性质。为此,我们建议使用知识网络将离散化学结构转换为地方差异化学结构的可区分的脚手架树(DST),DST通过一个图形神经网络(GNN)对目标特性衍生物进行反光分析,使化学图结构的梯度优化。我们的经验研究表明,基于梯度的分子优化既有效,又具有样本效率。此外,所学的图表参数还可以提供解释,帮助域专家了解模型输出。