Consumers, regulators, and ISPs all use client-based "speed tests" to measure network performance, both in single-user settings and in aggregate. Two prevalent speed tests, Ookla's Speedtest and Measurement Lab's Network Diagnostic Test (NDT), are often used for similar purposes, despite having significant differences in both the test design and implementation, and in the infrastructure used to perform measurements. In this paper, we present the first-ever comparative evaluation of Ookla and NDT7 (the latest version of NDT), both in controlled and wide-area settings. Our goal is to characterize when and to what extent these two speed tests yield different results, as well as the factors that contribute to the differences. To study the effects of the test design, we conduct a series of controlled, in-lab experiments under a comprehensive set of network conditions and usage modes (e.g., TCP congestion control, native vs. browser client). Our results show that Ookla and NDT7 report similar speeds under most in-lab conditions, with the exception of networks that experience high latency, where Ookla consistently reports higher throughput. To characterize the behavior of these tools in wide-area deployment, we collect more than 80,000 pairs of Ookla and NDT7 measurements across nine months and 126 households, with a range of ISPs and speed tiers. This first-of-its-kind paired-test analysis reveals many previously unknown systemic issues, including high variability in NDT7 test results and systematically under-performing servers in the Ookla network.
翻译:消费者、监管者和ISP都使用基于客户的“ 速度测试” 来测量网络性能, 无论是在单一用户环境中还是总体而言。 两种普遍的速度测试,即Ookla的快速测试和测量实验室的网络诊断测试(NDT),经常用于类似的目的,尽管测试设计和实施以及用于测量的基础设施存在重大差异。 在本文中,我们首次对Ookla和NDT7 (NDT的最新版本)进行了比较评估, 包括在控制和广域环境中。 我们的目标是确定这两种速度测试何时和在多大程度上产生不同的结果,以及造成差异的因素。 为了研究测试设计的效果,我们在一套全面的网络条件和使用模式(如TCP拥堵控制、本地用户与浏览器客户)下进行了一系列受控的实验室内实验。 我们的结果表明,Ookla和NDT7 (NDT7) 在大多数被控制和广域域环境中都报告类似的速度。 我们的网络存在高水平, 其中Ookla 连续的测试结果显示在NDTA 7 和 AS 系统性测试工具的高度运行中, 超过我们这些不为1080 的系统内部的频率。