Beamlines at synchrotron light source facilities are powerful scientific instruments used to image samples and observe phenomena at high spatial and temporal resolutions. Typically, these facilities are equipped only with modest compute resources for the analysis of generated experimental datasets. However, high data rate experiments can easily generate data in volumes that take days (or even weeks) to process on those local resources. To address this challenge, we present a system that unifies leadership computing and experimental facilities by enabling the automated establishment of data analysis pipelines that extend from edge data acquisition systems at synchrotron beamlines to remote computing facilities; under the covers, our system uses Globus Auth authentication to minimize user interaction, funcX to run user-defined functions on supercomputers, and Globus Flows to define and execute workflows. We describe the application of this system to ptychography, an ultra-high-resolution coherent diffraction imaging technique that can produce 100s of gigabytes to terabytes in a single experiment. When deployed on the DGX A100 ThetaGPU cluster at the Argonne Leadership Computing Facility and a microscopy beamline at the Advanced Photon Source, our system performs analysis as an experiment progresses to provide timely feedback.
翻译:同步光源设施Bemlines在同步光源设施中是一种强大的科学工具,用于在高空间和时空分辨率下对样本进行图像取样和观测现象。通常,这些设施仅配备少量的计算资源,用于分析生成的实验数据集,然而,高数据率实验可以很容易地生成量的数据,需要几天(甚至几周)处理当地资源。为了应对这一挑战,我们提出了一个系统,将领导计算和实验设施统一起来,使数据分析管道能够自动建立,从同步光线同步光线的边缘数据采集系统扩展到远程计算设施;在覆盖下,我们的系统使用Globus Autus认证来最大限度地减少用户互动,使用funcX来运行超级计算机用户定义的功能,以及Globus Flows来定义和执行工作流程。我们描述了这个系统在定位学方面的应用情况,这是一种超高分辨率的一致调控成成像技术,可以在一次实验中产生100千兆字节到兆字节。当安装在DGX A100 ThetaGPU 集群时,并在Agnnene Defarrestal Soursimingings, asiming asimings。