We consider the problem of answering observational, interventional, and counterfactual queries in a causally sufficient setting where only observational data and the causal graph are available. Utilizing the recent developments in diffusion models, we introduce diffusion-based causal models (DCM) to learn causal mechanisms, that generate unique latent encodings to allow for direct sampling under interventions as well as abduction for counterfactuals. We utilize DCM to model structural equations, seeing that diffusion models serve as a natural candidate here since they encode each node to a latent representation, a proxy for the exogenous noise, and offer flexible and accurate modeling to provide reliable causal statements and estimates. Our empirical evaluations demonstrate significant improvements over existing state-of-the-art methods for answering causal queries. Our theoretical results provide a methodology for analyzing the counterfactual error for general encoder/decoder models which could be of independent interest.
翻译:我们在一个只有观察数据和因果图表的因果充足环境中考虑回答观察、干预和反事实询问的问题。我们利用扩散模型的最新发展,采用基于扩散的因果模型来学习因果关系机制,产生独特的潜在编码,以便在干预下进行直接取样,并绑架反事实。我们利用数据元数据模型来模拟结构方程,看到扩散模型在这里充当自然候选方,因为它们将每个节点编码成潜在代表,代表外生噪音,并提供灵活和准确的模型,以提供可靠的因果说明和估计。我们的经验评估表明,对回答因果询问的现有最新方法有了重大改进。我们的理论结果为分析一般编码/解码模型的反事实错误提供了一种方法,而这些模型可能具有独立的兴趣。