Metrics provide strong evidence to support hypotheses in online experimentation and hence reduce debates in the decision-making process. In this work, we introduce the concept of dropout buyers and categorize users with incomplete metric values into two groups: visitors and dropout buyers. For the analysis of incomplete metrics, we propose a cluster-based k-nearest neighbors-based imputation method. Our proposed imputation method considers both the experiment-specific features and users' activities along their shopping paths, allowing different imputation values for different users. To facilitate efficient imputation in large-scale data sets in online experimentation, the proposed method uses a combination of stratification and clustering. The performance of the proposed method was compared to several conventional methods in a past experiment at eBay.
翻译:在这项工作中,我们引入了辍学买家的概念,并将不完全的衡量值用户分为两类:访客和辍学买家。为了分析不完全的衡量标准,我们建议了一个基于集群的以近邻为基础的估算方法。我们提议的估算方法既考虑实验的具体特点,也考虑用户在购物路径上的活动,允许不同用户使用不同的估算值。为了便利在线实验中大规模数据集的高效估算,拟议方法采用了分层和组合组合组合相结合的方法。拟议方法的绩效与电子Bay以往试验中的一些常规方法进行了比较。