Causal decision making (CDM) at scale has become a routine part of business, and increasingly CDM is based on machine learning algorithms. For example, businesses often target offers, incentives, and recommendations with the goal of affecting consumer behavior. Recently, we have seen an acceleration of research related to CDM and to causal effect estimation (CEE) using machine learned models. This article highlights an important perspective: CDM is not the same as CEE, and counterintuitively, accurate CEE is not necessary for accurate CDM. Our experience is that this is not well understood by practitioners nor by most researchers. Technically, the estimand of interest is different, and this has important implications both for modeling and for the use of statistical models for CDM. We draw on recent research to highlight three of these implications. (1) We should carefully consider the objective function of the causal machine learning, and if possible, we should optimize for accurate "treatment assignment" rather than for accurate effect-size estimation. (2) Confounding does not have the same effect on CDM as it does on CEE. The upshot here is that for supporting CDM it may be just as good to learn with confounded data as with unconfounded data. Finally, (3) causal statistical modeling may not be necessary at all to support CDM, because there may be (and perhaps often is) a proxy target for statistical modeling that can do as well or better. This observation helps to explain at least one broad common CDM practice that seems "wrong" at first blush: the widespread use of non-causal models for targeting interventions. Our perspective is that these observations open up substantial fertile ground for future research. Whether or not you share our perspective completely, we hope we facilitate future research in this area by pointing to related articles from multiple contributing fields.
翻译:规模的因果决策(CDM)已成为商业的常规部分,清洁发展机制越来越多地以机器学习算法为基础。例如,企业往往以影响消费者行为为目标的提供、激励和建议为目标。最近,我们看到了与清洁发展机制和因果估计有关的研究的加速,使用了机器学习模型。这一条强调了一个重要的观点:清洁发展机制与中东欧不同,反直觉而言,准确的中东欧对于准确的清洁发展机制来说并不必要。我们的经验是,实践者或大多数研究人员对此没有很好地理解。从技术上讲,兴趣的估量并不不同,这对清洁发展机制的建模和使用统计模型都具有重要影响。我们最近的研究突出了其中的三种影响。 (1) 我们应该仔细考虑因果机学习的客观功能,如果可能的话,我们应该优化准确的“治疗任务”而不是准确的效应估测。(2) 重新定位对清洁发展机制没有像对东欧那样产生同样的影响。这里的回顾是,为了支持CDM的建模,在非目标方面可能很好地学习了非理性的观察模型,也许有助于我们未来的统计目标,因为统计学界数据可能不完全地分享。