Modern network performance monitoring toolkits, such as perfSONAR, take a remarkable number of measurements about the local network environment. To gain a complete picture of network performance, however, one needs to aggregate data across a large number of endpoints. The Service Analysis and Network Diagnosis (SAND) data pipeline collects data from diverse sources and ingests these measurements into a message bus. The message bus allows the project to send the data to multiple consumers, including a tape archive, an Elasticsearch database, and a peer infrastructure at CERN. In this paper, we explain the architecture and evolution of the SAND data pipeline, the scale of the resulting dataset, and how it supports a wide variety of network analysis applications.
翻译:诸如perfSONAR等现代网络性能监测工具包对当地网络环境进行大量测量。然而,为了全面了解网络性能,需要将大量端点的数据汇总起来。服务分析和网络诊断(SAND)数据管道从不同来源收集数据,并将这些数据输入信息总线。信息总线使该项目能够向多个消费者发送数据,包括磁带档案、Elasticsearch数据库和CERP的同行基础设施。我们在本文件中解释了SAND数据管道的结构和演变、生成数据集的规模以及它如何支持广泛的网络分析应用。