The current COVID-19 global pandemic caused by the SARS-CoV-2 betacoronavirus has resulted in over a million deaths and is having a grave socio-economic impact, hence there is an urgency to find solutions to key research challenges. Much of this COVID-19 research depends on distributed computing. In this article, I review distributed architectures -- various types of clusters, grids and clouds -- that can be leveraged to perform these tasks at scale, at high-throughput, with a high degree of parallelism, and which can also be used to work collaboratively. High-performance computing (HPC) clusters will be used to carry out much of this work. Several bigdata processing tasks used in reducing the spread of SARS-CoV-2 require high-throughput approaches, and a variety of tools, which Hadoop and Spark offer, even using commodity hardware. Extremely large-scale COVID-19 research has also utilised some of the world's fastest supercomputers, such as IBM's SUMMIT -- for ensemble docking high-throughput screening against SARS-CoV-2 targets for drug-repurposing, and high-throughput gene analysis -- and Sentinel, an XPE-Cray based system used to explore natural products. Grid computing has facilitated the formation of the world's first Exascale grid computer. This has accelerated COVID-19 research in molecular dynamics simulations of SARS-CoV-2 spike protein interactions through massively-parallel computation and was performed with over 1 million volunteer computing devices using the Folding@home platform. Grids and clouds both can also be used for international collaboration by enabling access to important datasets and providing services that allow researchers to focus on research rather than on time-consuming data-management tasks.
翻译:SARS-COV-2 betacorona病毒引发的当前COVID-19全球流行病导致100多万人死亡,并正在产生严重的社会经济影响,因此迫切需要找到解决关键研究挑战的办法。COVID-19研究的很大一部分取决于分布式计算。在本篇文章中,我审查了分布式结构 -- -- 各种类型的集群、网格和云 -- -- 可以用来大规模地完成这些任务,具有高度的平行性,也可以用于协同工作。高性能计算(HPC)群集将被用于开展许多这项工作。在减少SAS-COV-2的传播过程中所使用的一些大数据处理任务需要高通量的方法和各种工具。Hadoop和Sprark提供,甚至使用商品硬件。 极大规模的COVID-19研究还利用了世界上一些最快的超级计算机计算机计算机计算机,例如IMB's SUMMMIT, 将高通量的检测数据用于SARS-C-COV-2目标。在S-C-C-C-C-Sentreal Streal Streal-deal Streal Creal-deal deal devial exal 上进行一个高级的计算机化数据分析系统, 和高级的计算机化数据分析。在Sental-creal-cal-creal-creal-creal-creabal-creal-creal-creal-modudududududuction 上,通过S-lational-lational-hal-C-C-lational-lational-lationalking-C-C-dal-lationalking-dal-dal-dal-dal-dal-dal-dal-dal-dal-dal-dal-dal-dal-dal-dal-C-dal-C-daldal-C-dal-dal-dal-dal-dal-dal-dal-C-C-C-C-dal-C-dal-dal-dal-dal-modal-C-C-C-C-C-dal-C-C-C-C-C-C-dal-d-l-