As a basic research problem for building effective recommender systems, post-click conversion rate (CVR) estimation has long been plagued by sample selection bias and data sparsity issues. To address the data sparsity issue, prevalent methods based on entire space multi-task model leverage the sequential pattern of user actions, i.e. exposure $\rightarrow$ click $\rightarrow$ conversion to construct auxiliary learning tasks. However, they still fall short of guaranteeing the unbiasedness of CVR estimates. This paper theoretically demonstrates two defects of these entire space multi-task models: (1) inherent estimation bias (IEB) for CVR estimation, where the CVR estimate is inherently higher than the ground truth; (2) potential independence priority (PIP) for CTCVR estimation, where the causality from click to conversion might be overlooked. This paper further proposes a principled method named entire space counterfactual multi-task model (ESCM$^2$), which employs a counterfactual risk minimizer to handle both IEB and PIP issues at once. To demonstrate the effectiveness of the proposed method, this paper explores its parameter tuning in practice, derives its analytic properties, and showcases its effectiveness in industrial CVR estimation, where ESCM$^2$ can effectively alleviate the intrinsic IEB and PIP issues and outperform baseline models.
翻译:作为建立有效建议系统的基本研究问题,点击后转换率(CVR)估算长期以来一直受到抽样选择偏差和数据宽度问题的困扰。为解决数据宽度问题,基于整个空间多任务模型的普遍方法利用了用户行动的顺序模式,即接触$\rightrowle$点击$\rightrowl$@rightrowlear$ 转换为构建辅助学习任务。然而,它们仍然不能保证CVR估算的公正性。本文理论上显示了整个空间多任务模型的两个缺陷:(1) CVR估算的内在估计偏差(IEB),CVR估算必然高于地面真相;(2) CCTVR估算的潜在独立优先事项(PIP),从点击到转换的因果关系可能被忽视。本文还进一步提出了名为整个空间反现实多任务模型(ESCM$%2$)的原则方法,该方法使用反现实风险最小化器处理IEB和PIP问题。为了展示拟议方法的有效性,本文件探讨了CVR值的参数在降低ISR2 和IR 基线问题上的内在有效性。