The Linac Coherent Light Source (LCLS) is an X- ray free electron laser (XFEL) facility enabling the study of the structure and dynamics of single macromolecules. A major upgrade will bring the repetition rate of the X-ray source from 120 to 1 million pulses per second. Exascale high performance computing (HPC) capabilities will be required to process the corresponding data rates. We present SpiniFEL, an application used for structure determination of proteins from single-particle imaging (SPI) experiments. An emerging technique for imaging individual proteins and other large molecular complexes by outrunning radiation damage, SPI breaks free from the need for crystallization (which is difficult for some proteins) and allows for imaging molecular dynamics at near ambient conditions. SpiniFEL is being developed to run on supercomputers in near real-time while an experiment is taking place, so that the feedback about the data can guide the data collection strategy. We describe here how we reformulated the mathematical framework for parallelizable implementation and accelerated the most compute intensive parts of the application. We also describe the use of Pygion, a Python interface for the Legion task-based programming model and compare to our existing MPI+GPU implementation.
翻译:Linac Coherent光源(LCLS)是一个X射线免费电子激光(XFEL)设施,用于研究单一大型分子球体的结构和动态。重大升级将使X射线源的重复率从每秒1.2亿个脉冲从1.2万次提高到每秒100万个脉冲。处理相应的数据率需要Exscale高性能计算能力。我们介绍了SpiniFEL,这是用于从单粒子成像(SPI)实验中确定蛋白质结构的一种应用。一种新兴技术,通过辐射损害外的辐射损害来成象个别蛋白和其他大型分子复合体,SPI脱离结晶化的需要(某些蛋白很难做到),并允许在接近环境条件下进行成像分子动力学。SpiniFEL正在开发,以便在实验进行时在近实时运行超级计算机,以便对数据进行反馈,指导数据收集战略。我们在这里描述我们如何重新制定数学框架,用于平行实施,并加速应用中最精密的密集部分。我们还描述了对GI+GI号模型的利用情况。