使用混合机器学习和物理模拟高性能计算机加速发现药物 (Pandemic Drugs at Pandemic Speed: Accelerating COVID-19 Drug Discovery with Hybrid Machine Learning- and Physics-based Simulations on High Performance Computers)

Performer · COVID-19 · 短列表 · Reminders · 可辨认的 ·

2021 年 3 月 4 日

Pandemic Drugs at Pandemic Speed: Accelerating COVID-19 Drug Discovery with Hybrid Machine Learning- and Physics-based Simulations on High Performance Computers

翻译：使用混合机器学习和物理模拟高性能计算机加速发现药物

Agastya P. Bhati,Shunzhou Wan,Dario Alfè,Austin R. Clyde,Mathis Bode,Li Tan,Mikhail Titov,Andre Merzky,Matteo Turilli,Shantenu Jha,Roger R. Highfield,Walter Rocchia,Nicola Scafuri,Sauro Succi,Dieter Kranzlmüller,Gerald Mathias,David Wifling,Yann Donon,Alberto Di Meglio,Sofia Vallecorsa,Heng Ma,Anda Trifan,Arvind Ramanathan,Tom Brettin,Alexander Partin,Fangfang Xia,Xiaotan Duan,Rick Stevens,Peter V. Coveney

The race to meet the challenges of the global pandemic has served as a reminder that the existing drug discovery process is expensive, inefficient and slow. There is a major bottleneck screening the vast number of potential small molecules to shortlist lead compounds for antiviral drug development. New opportunities to accelerate drug discovery lie at the interface between machine learning methods, in this case developed for linear accelerators, and physics-based methods. The two in silico methods, each have their own advantages and limitations which, interestingly, complement each other. Here, we present an innovative method that combines both approaches to accelerate drug discovery. The scale of the resulting workflow is such that it is dependent on high performance computing. We have demonstrated the applicability of this workflow on four COVID-19 target proteins and our ability to perform the required large-scale calculations to identify lead compounds on a variety of supercomputers.

翻译：应对全球流行病挑战的竞赛提醒人们,现有的药物发现过程是昂贵、低效和缓慢的,对大量潜在的小分子进行重大瓶颈筛查,以排入抗病毒药物开发的铅化合物短名单。加速药物发现的新机会在于机器学习方法(此处是为线性加速器开发的)和物理方法之间的界面。两种硅方法都有各自的优势和局限性,有趣的是,两者是相辅相成的。这里,我们提出了一种创新方法,将两种方法结合起来,以加速药物发现。由此产生的工作流程的规模取决于高性能计算。我们已经展示了这种工作流程对四种COVID-19目标蛋白质的适用性,以及我们进行所需的大规模计算的能力,以便在各种超级计算机上识别铅化合物。