While generative models have recently become ubiquitous in many scientific areas, less attention has been paid to their evaluation. For molecular generative models, the state-of-the-art examines their output in isolation or in relation to its input. However, their biological and functional properties, such as ligand-target interaction is not being addressed. In this study, a novel biologically-inspired benchmark for the evaluation of molecular generative models is proposed. Specifically, three diverse reference datasets are designed and a set of metrics are introduced which are directly relevant to the drug discovery process. In particular we propose a recreation metric, apply drug-target affinity prediction and molecular docking as complementary techniques for the evaluation of generative outputs. While all three metrics show consistent results across the tested generative models, a more detailed comparison of drug-target affinity binding and molecular docking scores revealed that unimodal predictiors can lead to erroneous conclusions about target binding on a molecular level and a multi-modal approach is thus preferrable. The key advantage of this framework is that it incorporates prior physico-chemical domain knowledge into the benchmarking process by focusing explicitly on ligand-target interactions and thus creating a highly efficient tool not only for evaluating molecular generative outputs in particular, but also for enriching the drug discovery process in general.
翻译:虽然基因模型最近在许多科学领域变得无处不在,但对其评价的关注却较少。对于分子基因模型,最先进的模型在孤立或与其投入相关的情况下检查其产出。然而,其生物和功能特性,例如离子和目标相互作用等,并没有得到解决。在本研究中,为评价分子基因模型,提出了一个由生物学启发的新颖基准,具体地说,设计了三个不同的参考数据集,并引入了一套与药物发现过程直接相关的指标。特别是,我们提出了娱乐指标,采用药物目标亲近性预测和分子对接作为评估基因模型产出的补充技术。尽管所有三个指标都显示在经过测试的基因模型中取得一致的结果,但更详细地比较药物目标的结合和分子对齐分数表明,单一的预测可导致对与分子水平有约束力的目标得出错误的结论,因此,一种多模式方法更为可取。这个框架的主要优点是,它将之前的药物目标对药物目标的近似近似性预测和分子化学知识作为补充技术,用于评估基因测试过程,因此,只将高额的基因化学领域知识纳入高度的基因测定目标。