The Compact Muon Solenoid (CMS) experiment at the Large Hadron Collider (LHC) is one of the largest data producers in the scientific world, with standard data products centrally produced, and then used by often competing teams within the collaboration. This work is focused on how a local institution, University of California San Diego (UCSD), partnered with the Open Science Grid (OSG) to use Azure cloud resources to augment its available computing to accelerate time to results for multiple analyses pursued by a small group of collaborators. The OSG is a federated infrastructure allowing many independent resource providers to serve many independent user communities in a transparent manner. Historically the resources would come from various research institutions, spanning small universities to large HPC centers, based on either community needs or grant allocations, so adding commercial clouds as resource providers is a natural evolution. The OSG technology allows for easy integration of cloud resources, but the data-intensive nature of CMS compute jobs required the deployment of additional data caching infrastructure to ensure high efficiency.
翻译:在大型强子相撞器(LHC)上,Clubizing Muon Solenoid(CMS)实验是科学界最大的数据生产者之一,标准数据产品集中生产,然后在协作中被经常相互竞争的团队使用。这项工作的重点是,加利福尼亚圣地亚哥大学(UCSDY)与开放科学网(OSG)合作,利用Azure云资源增加现有计算,以加快时间,取得结果,让一小组合作者进行多重分析。OSG是一个联合基础设施,使许多独立的资源提供者能够以透明的方式为许多独立的用户社区服务。历史上,资源来自各种研究机构,将小型大学扩展到大型HPC中心,以社区需求或拨款为基础,从而增加商业云作为资源提供者是一种自然演进。OSG技术便于云资源整合,但CMS的数据密集性质要求部署额外的数据缓冲基础设施以确保高效率。