Uplift modeling is crucial in various applications ranging from marketing and policy-making to personalized recommendations. The main objective is to learn optimal treatment allocations for a heterogeneous population. A primary line of existing work modifies the loss function of the decision tree algorithm to identify cohorts with heterogeneous treatment effects. Another line of work estimates the individual treatment effects separately for the treatment group and the control group using off-the-shelf supervised learning algorithms. The former approach that directly models the heterogeneous treatment effect is known to outperform the latter in practice. However, the existing tree-based methods are mostly limited to a single treatment and a single control use case, except for a handful of extensions to multiple discrete treatments. In this paper, we fill this gap in the literature by proposing a generalization to the tree-based approaches to tackle multiple discrete and continuous-valued treatments. We focus on a generalization of the well-known causal tree algorithm due to its desirable statistical properties, but our generalization technique can be applied to other tree-based approaches as well. We perform extensive experiments to showcase the efficacy of our method when compared to other methods.
翻译:在从营销和决策到个性化建议等各种应用中,升级模型至关重要。主要目标是为不同人群学习最佳的治疗分配。现有工作的主要一线改变了决策树算法的损失功能,以识别具有不同治疗效果的组群。另一线工作分别估计治疗组和对照组使用现成监督的学习算法的个别治疗效果。已知直接模拟异性治疗效应的先前方法在实践中优于后者。然而,现有基于树种的方法大多限于单一的治疗和单一的控制使用案例,但多种离散治疗的少数扩展除外。在本文件中,我们通过建议对基于树种的方法进行概括化,以解决多种离散和持续受重视的治疗方法。我们注重对众所周知的因果树算法的概括化,因为其理想的统计特性,但我们的一般化技术也可以适用于其他基于树种的方法。我们进行了广泛的实验,以展示我们方法与其他方法相比的功效。