The race to meet the challenges of the global pandemic has served as a reminder that the existing drug discovery process is expensive, inefficient and slow. There is a major bottleneck screening the vast number of potential small molecules to shortlist lead compounds for antiviral drug development. New opportunities to accelerate drug discovery lie at the interface between machine learning methods, in this case developed for linear accelerators, and physics-based methods. The two in silico methods, each have their own advantages and limitations which, interestingly, complement each other. Here, we present an innovative infrastructural development that combines both approaches to accelerate drug discovery. The scale of the potential resulting workflow is such that it is dependent on supercomputing to achieve extremely high throughput. We have demonstrated the viability of this workflow for the study of inhibitors for four COVID-19 target proteins and our ability to perform the required large-scale calculations to identify lead antiviral compounds through repurposing on a variety of supercomputers.
翻译:应对全球流行病挑战的竞赛提醒人们,现有的药物发现过程是昂贵、低效和缓慢的,对大量潜在的小分子进行重大瓶颈筛查,将大量潜在的小分子排入抗病毒药物开发的短名单铅化合物。加速药物发现的新机会在于机器学习方法之间的界面,即为线性加速器开发的机器学习方法与物理学方法。在硅方法中,两种方法都有各自的优势和局限性,有趣的是,两者相辅相成。在这里,我们展示了一种创新的基础设施发展,将两种方法结合起来,以加速药物发现。由此产生的潜在工作流程的规模是,它依赖于超高速计算,以达到极高的吞吐量。我们已经展示了这种对四种COVID-19目标蛋白质进行抑制剂研究的工作流程的可行性,以及我们通过对各种超级计算机进行再处理,进行所需的大规模计算以确定铅抗病毒化合物的能力。