Forming a molecular candidate set that contains a wide range of potentially effective compounds is crucial to the success of drug discovery. While many aim to optimize particular chemical properties, there is limited literature on how to properly measure and encourage the exploration of the chemical space when generating drug candidates. This problem is challenging due to the lack of formal criteria to select good exploration measures. We propose a novel framework to systematically evaluate exploration measures for drug candidate generation. The framework is built upon three formal analyses: an axiomatic analysis that validates the potential measures analytically, an empirical analysis that compares the correlations of the measures to a proxy gold standard, and a practical analysis that benchmarks the effectiveness of the measures in an optimization framework of molecular generation. We are able to evaluate a wide range of potential exploration measures under this framework and make recommendations on existing and novel exploration measures that are suitable for the task of drug discovery.
翻译:一组分子候选组组成,其中含有范围广泛的潜在有效化合物,对于药物发现的成功至关重要。虽然其中许多旨在优化特定化学特性,但关于如何在产生药物候选体时适当测量和鼓励探索化学空间的文献有限。由于缺乏选择良好勘探措施的正式标准,这一问题具有挑战性。我们提议了一个新框架,以系统评价药物候选体的勘探措施。这个框架以三项正式分析为基础:分析确认潜在措施的不言而喻分析,比较各项措施与替代金标准的相关性的经验性分析,以及在分子生成优化框架内衡量各项措施有效性的实际分析。我们能够评价这一框架下的范围广泛的潜在勘探措施,并就适合毒品发现任务的现有和新颖的勘探措施提出建议。