Causal inference is essential for data-driven decision making across domains such as business engagement, medical treatment or policy making. However, research on causal discovery and inference has evolved separately, and the combination of the two domains is not trivial. In this work, we develop Deep End-to-end Causal Inference (DECI), a single flow-based method that takes in observational data and can perform both causal discovery and inference, including conditional average treatment effect (CATE) estimation. We provide a theoretical guarantee that DECI can recover the ground truth causal graph under mild assumptions. In addition, our method can handle heterogeneous, real-world, mixed-type data with missing values, allowing for both continuous and discrete treatment decisions. Moreover, the design principle of our method can generalize beyond DECI, providing a general End-to-end Causal Inference (ECI) recipe, which enables different ECI frameworks to be built using existing methods. Our results show the superior performance of DECI when compared to relevant baselines for both causal discovery and (C)ATE estimation in over a thousand experiments on both synthetic datasets and other causal machine learning benchmark datasets.
翻译:原因推断对于在商业参与、医疗或决策等不同领域进行数据驱动决策至关重要。然而,关于因果关系的发现和推断的研究是分别演变而成的,这两个领域的结合并非微不足道。在这项工作中,我们开发了深端至端原因推断(DECI),这是一种单一的基于流动的方法,在观察数据中采用,可以同时进行因果关系的发现和推断,包括有条件平均治疗效果的估算。我们提供了理论保证,DECI可以在轻度假设下恢复地面真相因果图表。此外,我们的方法可以处理不同、真实世界、混合类型的数据,并有缺失的值,同时允许连续和离散的治疗决定。此外,我们方法的设计原则除了DECI之外,还可以概括我们的方法的设计原则,提供一般的端端至端原因推断(ECI)食谱,从而能够利用现有方法构建不同的ECI框架。我们的结果显示,DECI在与相关的因果关系发现基准和(C)ATE估计基准相比,在1,000多个合成数据设置和其他因果机器学习基准数据的实验中,其表现优。