A central question in many fields of scientific research is to determine how an outcome would be affected by an action, or to measure the effect of an action (a.k.a treatment effect). In recent years, a need for estimating the heterogeneous treatment effects conditioning on the different characteristics of individuals has emerged from research fields such as personalized healthcare, social science, and online marketing. To meet the need, researchers and practitioners from different communities have developed algorithms by taking the treatment effect heterogeneity modeling approach and the uplift modeling approach, respectively. In this paper, we provide a unified survey of these two seemingly disconnected yet closely related approaches under the potential outcome framework. We then provide a structured survey of existing methods by emphasizing on their inherent connections with a set of unified notations to make comparisons of the different methods easy. We then review the main applications of the surveyed methods in personalized marketing, personalized medicine, and social studies. Finally, we summarize the existing software packages and present discussions based on the use of methods on synthetic, semi-synthetic and real world data sets and provide some general guidelines for choosing methods.
翻译:在许多科学研究领域,一个核心问题是确定一项行动将如何影响一项结果,或衡量一项行动的效果(a.k.a.a.a.a.a.a.a.a.a.a.a.a.a.a.a.a.a.a.a.治疗效果)。近年来,需要从个人化保健、社会科学和在线销售等研究领域来估计以个人不同特征为条件的不同治疗效果。为了满足这种需要,来自不同社区的研究人员和从业者分别采用了治疗效应异质模型和升级模型方法,从而发展了算法。在本文件中,我们根据可能的成果框架,对这两种似乎互不相干但密切相关的方法进行了统一调查。然后,我们通过强调这两种方法与一套统一标记的内在联系,使不同方法的比较变得容易。然后我们审查所调查的方法在个性化营销、个性化医学和社会研究方面的主要应用。最后,我们总结现有的软件包和介绍关于合成、半合成和真实世界数据集方法的使用的讨论,并提供一些选择方法的一般准则。