Causal Machine Learning (CausalML) is an umbrella term for machine learning methods that formalize the data-generation process as a structural causal model (SCM). This allows one to reason about the effects of changes to this process (i.e., interventions) and what would have happened in hindsight (i.e., counterfactuals). We categorize work in \causalml into five groups according to the problems they tackle: (1) causal supervised learning, (2) causal generative modeling, (3) causal explanations, (4) causal fairness, (5) causal reinforcement learning. For each category, we systematically compare its methods and point out open problems. Further, we review modality-specific applications in computer vision, natural language processing, and graph representation learning. Finally, we provide an overview of causal benchmarks and a critical discussion of the state of this nascent field, including recommendations for future work.
翻译:Causal Machinening(Causal Machinening)是机械学习方法的总括术语,将数据生成过程正规化为结构性因果模型(SCM),使人们可以解释这一过程变化的影响(即干预)和事后观察(即反事实)中会发生什么。我们根据所处理的问题,将\causmal的工作分为五组:(1)因果监督学习,(2)因果分类模型,(3)因果解释,(4)因果公正,(5)因果强化学习。我们系统地比较其方法,指出未解决的问题。此外,我们审查计算机愿景、自然语言处理和图表表述学习中特定模式的应用。最后,我们概述了因果基准,并批判性地讨论了这个新生领域的状况,包括为未来工作提出的建议。