Online advertising aims to increase user engagement and maximize revenue, but users respond heterogeneously to ad exposure. Some users purchase only when exposed to ads, while others purchase regardless of exposure, and still others never purchase. This heterogeneity can be characterized by latent response types, commonly referred to as principal strata, defined by users' joint potential outcomes under exposure and non-exposure. However, users' true strata are unobserved, making direct analysis infeasible. In this article, instead of learning the true strata, we propose a novel approach that learns users' pseudo-strata by leveraging information from an outcome (revenue) observed after the response (purchase). We construct pseudo-strata to classify users and introduce misclassification rewards to quantify the expected revenue gain of pseudo-strata-based policies relative to true strata. Within a Bayesian classification framework, we learn the pseudo-strata by optimizing the expected revenue. To implement these procedures, we introduce identification assumptions and estimation methods, and establish their large-sample properties. Simulation studies show that the proposed method achieves more accurate strata classification and substantially higher revenue than baselines. We further illustrate the method using a large-scale industrial dataset from the Criteo Predictive Search Platform.
翻译:在线广告旨在提升用户参与度并最大化收益,但用户对广告曝光的响应存在异质性。部分用户仅在广告曝光后购买,另一些用户无论是否曝光均会购买,而其他用户则从不购买。这种异质性可通过潜在响应类型(通常称为主层)来刻画,其定义为用户在曝光与非曝光条件下的联合潜在结果。然而,用户的真实层不可观测,导致直接分析不可行。本文提出一种新方法,不学习真实层,而是利用响应(购买)后观测到的结果(收益)信息来学习用户的伪层。我们构建伪层以对用户进行分类,并引入误分类奖励来量化基于伪层的策略相对于真实层的预期收益增益。在贝叶斯分类框架下,我们通过优化预期收益来学习伪层。为实现这些流程,我们提出了识别假设与估计方法,并建立了其大样本性质。仿真研究表明,所提方法相比基线实现了更精确的层分类及显著更高的收益。我们进一步使用来自Criteo预测搜索平台的大规模工业数据集对该方法进行了实证说明。