In many applications, there is a need to predict the effect of an intervention on different individuals from data. For example, which customers are persuadable by a product promotion? which patients should be treated with a certain type of treatment? These are typical causal questions involving the effect or the change in outcomes made by an intervention. The questions cannot be answered with traditional classification methods as they only use associations to predict outcomes. For personalised marketing, these questions are often answered with uplift modelling. The objective of uplift modelling is to estimate causal effect, but its literature does not discuss when the uplift represents causal effect. Causal heterogeneity modelling can solve the problem, but its assumption of unconfoundedness is untestable in data. So practitioners need guidelines in their applications when using the methods. In this paper, we use causal classification for a set of personalised decision making problems, and differentiate it from classification. We discuss the conditions when causal classification can be resolved by uplift (and causal heterogeneity) modelling methods. We also propose a general framework for causal classification, by using off-the-shelf supervised methods for flexible implementations. Experiments have shown two instantiations of the framework work for causal classification and for uplift (causal heterogeneity) modelling, and are competitive with the other uplift (causal heterogeneity) modelling methods.
翻译:在许多应用中,需要预测干预对数据中不同个人的影响。例如,哪些客户可以被产品促销所相信?哪些客户可以接受产品促销?哪些病人应当得到某种类型的治疗?这些是典型的因果问题,涉及干预的效果或结果的变化。这些问题不能用传统的分类方法解答,因为它们只使用协会来预测结果。对于个性化营销,这些问题往往用提升模型解答。提升模型的目的是估计因果关系,但是其文献并不讨论何时上升效应代表因果效应。 碱性异性建模可以解决问题,但在数据中无法检验其无根据的假设。因此,从业者在使用方法时需要应用准则。在本文中,我们使用因果性分类来回答一系列个性化决策问题,并将其与分类区分开来。我们讨论因果分类可以通过提升(和因果异性异性)建模方法解决的条件。我们还提出一个因果关系分类的一般框架,即使用非受监督的方法来灵活实施。实验性模型化和具有竞争性的模型化方法。