During the last few decades, online controlled experiments (also known as A/B tests) have been adopted as a golden standard for measuring business improvements in industry. In our company, there are more than a billion users participating in thousands of experiments simultaneously, and with statistical inference and estimations conducted to thousands of online metrics in those experiments routinely, computational costs would become a large concern. In this paper we propose a novel algorithm for estimating the covariance of online metrics, which introduces more flexibility to the trade-off between computational costs and precision in covariance estimation. This covariance estimation method reduces computational cost of metric calculation in large-scale setting, which facilitates further application in both online controlled experiments and adaptive experiments scenarios like variance reduction, continuous monitoring, Bayesian optimization, etc., and it can be easily implemented in engineering practice.
翻译:在过去几十年中,在线控制实验(又称A/B测试)被采纳为衡量行业商业改进的黄金标准。 在我们的公司,有超过10亿用户同时参与数千项实验,并且对这些实验中成千上万的在线指标进行统计推论和估计,计算成本将成为一个大问题。 在本文中,我们提出了一个用于估计在线指标的共差的新算法,这为计算成本和常变估算精确度之间的权衡提供了更大的灵活性。这种共变估算法降低了大规模设置的计量计算计算成本,这有利于进一步应用在线控制实验和适应性实验情景,如差异减少、持续监测、贝叶斯优化等,并且可以在工程实践中实施。