Causal inference is a critical research topic across many domains, such as statistics, computer science, education, public policy and economics, for decades. Nowadays, estimating causal effect from observational data has become an appealing research direction owing to the large amount of available data and low budget requirement, compared with randomized controlled trials. Embraced with the rapidly developed machine learning area, various causal effect estimation methods for observational data have sprung up. In this survey, we provide a comprehensive review of causal inference methods under the potential outcome framework, one of the well known causal inference framework. The methods are divided into two categories depending on whether they require all three assumptions of the potential outcome framework or not. For each category, both the traditional statistical methods and the recent machine learning enhanced methods are discussed and compared. The plausible applications of these methods are also presented, including the applications in advertising, recommendation, medicine and so on. Moreover, the commonly used benchmark datasets as well as the open-source codes are also summarized, which facilitate researchers and practitioners to explore, evaluate and apply the causal inference methods.
翻译:几十年来,对观察数据的因果影响进行估计已成为一个令人感兴趣的研究方向,因为与随机控制的试验相比,现有数据数量巨大,预算需求较低。与快速开发的机器学习领域相比,观测数据的各种因果影响估计方法已经出现。在这次调查中,我们全面审查了潜在成果框架下的因果推断方法,这是众所周知的因果推断框架之一。这些方法分为两类,取决于它们是否需要对潜在成果框架的所有三个假设。对于每一类,都讨论和比较传统统计方法和最近的机器学习强化方法。还介绍了这些方法的合理应用,包括在广告、建议、医学等方面的应用。此外,还总结了通常使用的基准数据集以及开放源代码,以便于研究人员和从业人员探讨、评价和应用因果推断方法。