Different from large-scale platforms such as Taobao and Amazon, developing CVR models in small-scale recommendation scenarios is more challenging due to the severe Data Distribution Fluctuation (DDF) issue. DDF prevents existing CVR models from being effective since 1) several months of data are needed to train CVR models sufficiently in small scenarios, leading to considerable distribution discrepancy between training and online serving; and 2) e-commerce promotions have much more significant impacts on small scenarios, leading to distribution uncertainty of the upcoming time period. In this work, we propose a novel CVR method named MetaCVR from a perspective of meta learning to address the DDF issue. Firstly, a base CVR model which consists of a Feature Representation Network (FRN) and output layers is elaborately designed and trained sufficiently with samples across months. Then we treat time periods with different data distributions as different occasions and obtain positive and negative prototypes for each occasion using the corresponding samples and the pre-trained FRN. Subsequently, a Distance Metric Network (DMN) is devised to calculate the distance metrics between each sample and all prototypes to facilitate mitigating the distribution uncertainty. At last, we develop an Ensemble Prediction Network (EPN) which incorporates the output of FRN and DMN to make the final CVR prediction. In this stage, we freeze the FRN and train the DMN and EPN with samples from recent time period, therefore effectively easing the distribution discrepancy. To the best of our knowledge, this is the first study of CVR prediction targeting the DDF issue in small-scale recommendation scenarios. Experimental results on real-world datasets validate the superiority of our MetaCVR and online A/B test also shows our model achieves impressive gains of 11.92% on PCVR and 8.64% on GMV.
翻译:与大型平台不同,如道保和亚马逊平台不同,在小规模建议情景中开发CVR模型由于数据分布波动问题严重,更具挑战性。DDF防止现有的CVR模型产生效力,因为1)需要几个月的数据在小型情景中充分培训CVR模型,从而导致培训和在线服务之间在分布上的巨大差异;和(2)电子商务推广对小型情景的影响要大得多,导致未来时间段的分布不确定性。在这项工作中,我们从元学习的角度提出了名为MetACVR的新型CVR方法,以解决DDDF问题。首先,一个基础CVR模型,包括一个功能代表网络(FRN)和产出层,经过精心设计和培训,连续几个月对CVR模型模型进行充分的培训,随后,我们利用相应的样本和经过预先培训的FRNFN模型,对每个样本和所有原型模型的MV模型的分布结果都进行了计算。我们从每个样本和所有DMR模型中得出了远程MFF的距离指标值。最后的C测试期,我们从目前的C测试阶段和RFR模型的输出数据流化数据流化数据流化数据流流到最新数据流数据流化数据流数据流数据流数据流数据流,最后,我们最后的CFRMFR数据流数据流数据流数据流数据流到最新数据流数据流数据流到最新数据流数据流数据流。