This paper proposes a Disentangled gEnerative cAusal Representation (DEAR) learning method. Unlike existing disentanglement methods that enforce independence of the latent variables, we consider the general case where the underlying factors of interests can be causally correlated. We show that previous methods with independent priors fail to disentangle causally correlated factors. Motivated by this finding, we propose a new disentangled learning method called DEAR that enables causal controllable generation and causal representation learning. The key ingredient of this new formulation is to use a structural causal model (SCM) as the prior for a bidirectional generative model. The prior is then trained jointly with a generator and an encoder using a suitable GAN loss incorporated with supervision. We provide theoretical justification on the identifiability and asymptotic consistency of the proposed method, which guarantees disentangled causal representation learning under appropriate conditions. We conduct extensive experiments on both synthesized and real data sets to demonstrate the effectiveness of DEAR in causal controllable generation, and the benefits of the learned representations for downstream tasks in terms of sample efficiency and distributional robustness.
翻译:本文提出了一种分解的CAusal代表制(DEAR)学习方法。与现有的使潜在变量具有独立性的分解方法不同,我们考虑的是利害相关因素可能因果关联的一般情况。我们表明,以前具有独立前科的方法没有解脱因果关系相关因素。根据这一发现,我们提出了一种新的分解的学习方法,称为DEAR, 能够控制因果生成和因果代表制学习。这一新配方的关键内容是使用结构性因果模型作为双向遗传模型的先导。先由发电机和编码器共同培训,使用适当的GAN损失与监督相结合的GAN损失。我们从理论上说明拟议方法的可识别性和一致性,保证在适当条件下进行分解因果代表制学习。我们对综合和真实数据集进行广泛的实验,以证明DEAR在可控因果生成方面的有效性,以及在抽样效率和分布稳健性方面为下游任务所了解的表述方式的好处。