Comparing ISPs on broadband speed is challenging, since measurements can vary due to subscriber attributes such as operation system and test conditions such as access capacity, server distance, TCP window size, time-of-day, and network segment size. In this paper, we draw inspiration from observational studies in medicine, which face a similar challenge in comparing the effect of treatments on patients with diverse characteristics, and have successfully tackled this using "causal inference" techniques for {\em post facto} analysis of medical records. Our first contribution is to develop a tool to pre-process and visualize the millions of data points in M-Lab at various time- and space-granularities to get preliminary insights on factors affecting broadband performance. Next, we analyze 24 months of data pertaining to twelve ISPs across three countries, and demonstrate that there is observational bias in the data due to disparities amongst ISPs in their attribute distributions. For our third contribution, we apply a multi-variate matching method to identify suitable cohorts that can be compared without bias, which reveals that ISPs are closer in performance than thought before. Our final contribution is to refine our model by developing a method for estimating speed-tier and re-apply matching for comparison of ISP performance. Our results challenge conventional rankings of ISPs, and pave the way towards data-driven approaches for unbiased comparisons of ISPs world-wide.
翻译:比较宽带速度的互联网服务供应商是具有挑战性的,因为测量工作可能因用户属性而有所不同,如操作系统以及接入能力、服务器距离、TCP窗口大小、每天时间和网络段大小等测试条件等用户属性而有所不同。在本文件中,我们从医学观测研究中得到启发,这些观测研究在比较治疗对具有不同特征的患者的影响方面面临着类似的挑战,并成功地解决了这一难题,对医疗记录进行分析时使用了“因果关系”技术。我们的第一个贡献是开发一个工具,用于在各种时间和空间特征上对M-Lab的数百万个数据点进行预处理和可视化,以初步了解影响宽带绩效的因素。接下来,我们分析了与三个国家12个互联网服务供应商有关的24个月的数据,表明由于互联网服务供应商在属性分布上存在差异,数据存在观测偏差,因此在数据中存在观察偏差。关于医疗记录分析的第三个贡献,我们采用了多种变量匹配方法,以确定可以无偏差地比较的合适组群,这显示互联网服务供应商在业绩方面比以前想象得近于想象。我们对互联网服务供应商最终评估的方法,以精确地评估了我们全球范围数据排名。