Conducting experiments with objectives that take significant delays to materialize (e.g. conversions, add-to-cart events, etc.) is challenging. Although the classical "split sample testing" is still valid for the delayed feedback, the experiment will take longer to complete, which also means spending more resources on worse-performing strategies due to their fixed allocation schedules. Alternatively, adaptive approaches such as "multi-armed bandits" are able to effectively reduce the cost of experimentation. But these methods generally cannot handle delayed objectives directly out of the box. This paper presents an adaptive experimentation solution tailored for delayed binary feedback objectives by estimating the real underlying objectives before they materialize and dynamically allocating variants based on the estimates. Experiments show that the proposed method is more efficient for delayed feedback compared to various other approaches and is robust in different settings. In addition, we describe an experimentation product powered by this algorithm. This product is currently deployed in the online experimentation platform of JD.com, a large e-commerce company and a publisher of digital ads.
翻译:虽然传统的“分样抽样测试”对于延迟反馈仍然有效,但实验还需要更长时间才能完成,这也意味着将更多资源用于因固定分配时间表而表现不佳的战略上。 或者,“多武装强盗”等适应性方法能够有效降低实验成本。但这些方法一般无法直接从盒子中处理延迟目标。本文介绍了适应性实验解决方案,通过估计实际基本目标实现前和根据估计动态分配变量来适应延迟的二进制反馈目标。实验表明,与各种其他方法相比,拟议方法比延迟反馈更有效,在不同环境中也很健全。此外,我们描述了由这种算法驱动的实验产品。这一产品目前被安装在JD.com(一家大型电子商务公司和一个数字广告出版商)的在线实验平台上。