Causal decision making (CDM) based on machine learning has become a routine part of business. Businesses algorithmically target offers, incentives, and recommendations to affect consumer behavior. Recently, we have seen an acceleration of research related to CDM and causal effect estimation (CEE) using machine-learned models. This article highlights an important perspective: CDM is not the same as CEE, and counterintuitively, accurate CEE is not necessary for accurate CDM. Our experience is that this is not well understood by practitioners or most researchers. Technically, the estimand of interest is different, and this has important implications both for modeling and for the use of statistical models for CDM. We draw on prior research to highlight three implications. (1) We should consider carefully the objective function of the causal machine learning, and if possible, optimize for accurate treatment assignment rather than for accurate effect-size estimation. (2) Confounding does not have the same effect on CDM as it does on CEE. The upshot is that for supporting CDM it may be just as good or even better to learn with confounded data as with unconfounded data. Finally, (3) causal statistical modeling may not be necessary to support CDM because a proxy target for statistical modeling might do as well or better. This third observation helps to explain at least one broad common CDM practice that seems wrong at first blush: the widespread use of non-causal models for targeting interventions. The last two implications are particularly important in practice, as acquiring (unconfounded) data on all counterfactuals can be costly and often impracticable. These observations open substantial research ground. We hope to facilitate research in this area by pointing to related articles from multiple contributing fields, including two dozen articles published the last three to four years.
翻译:以机器学习为基础的因果决策(CDM)已经成为日常业务的一部分。从逻辑学角度讲,商业目标提供、激励和对消费者行为产生影响的建议都成为了日常业务的一部分。最近,我们看到利用机器学习模型加速了与清洁发展机制和因果估计(CEE)有关的研究,这篇文章强调了一个重要的观点:清洁发展机制与中东欧不同,反直觉而言,准确的中东欧对于准确的清洁发展机制并不必要。我们的经验是,实践者或大多数研究人员对此并不十分了解。在技术上,利益估计和兴趣不同,这往往对清洁发展机制的建模和使用统计模型都具有重要影响。我们借助先前的研究来突出三种影响:(1) 我们应仔细考虑因果机学习和因果估计(CEEE)的客观功能,如果可能的话,优化治疗任务分配,而不是精确的效应估计。(2) 纠结对清洁发展机制的影响与中东欧不同。 归根结是,支持清洁发展机制的最后一点是,通过提供没有根据的数据来进行直截实的数据,可能更好或更好理解。最后一点是,(3) 错点的统计模型观测可能没有必要,在两个领域进行非直接的方面进行。