One of the difficulties of conversion rate (CVR) prediction is that the conversions can delay and take place long after the clicks. The delayed feedback poses a challenge: fresh data are beneficial to continuous training but may not have complete label information at the time they are ingested into the training pipeline. To balance model freshness and label certainty, previous methods set a short waiting window or even do not wait for the conversion signal. If conversion happens outside the waiting window, this sample will be duplicated and ingested into the training pipeline with a positive label. However, these methods have some issues. First, they assume the observed feature distribution remains the same as the actual distribution. But this assumption does not hold due to the ingestion of duplicated samples. Second, the certainty of the conversion action only comes from the positives. But the positives are scarce as conversions are sparse in commercial systems. These issues induce bias during the modeling of delayed feedback. In this paper, we propose DElayed FEedback modeling with Real negatives (DEFER) method to address these issues. The proposed method ingests real negative samples into the training pipeline. The ingestion of real negatives ensures the observed feature distribution is equivalent to the actual distribution, thus reducing the bias. The ingestion of real negatives also brings more certainty information of the conversion. To correct the distribution shift, DEFER employs importance sampling to weigh the loss function. Experimental results on industrial datasets validate the superiority of DEFER. DEFER have been deployed in the display advertising system of Alibaba, obtaining over 6.0% improvement on CVR in several scenarios. The code and data in this paper are now open-sourced {https://github.com/gusuperstar/defer.git}.
翻译:转换率( CVR) 预测的一个困难是, 转换率( CVR) 的难度之一是, 转换可能会延迟, 并在点击后很久才发生。 延迟的反馈带来了挑战: 新的数据有利于持续的培训, 但可能没有完整的标签信息 。 为了平衡模型的新鲜性和标签确定性, 先前的方法设置了一个短暂的等待窗口, 甚至不等待转换信号 。 如果转换发生在等待窗口之外, 将会复制这个样本, 并用一个正面标签将它输入培训管道。 但是, 这些方法存在一些问题。 首先, 它们假设观察到的特征分布与实际的发布相同。 但是, 这个假设并不保存, 是因为它们摄入了重复的样本。 其次, 转换行动的确定性仅来自正数。 但是, 这些问题会在模拟反馈过程中产生偏差。 在本文中, 我们建议用真实的数值模型来模拟 FEEEDER( DeFER) 获取解决这些问题的方法。 拟议的方法, 正在将实际的降价值显示的显示值显示与实际的显示值的显示值值值的显示值的显示值的显示值值值的显示为正值。 因此,, 将数据转换为实际的分布为正值。