Variational autoencoders (VAEs) and other generative methods have garnered growing interest not just for their generative properties but also for the ability to dis-entangle a low-dimensional latent variable space. However, few existing generative models take causality into account. We propose a new decoder based framework named the Causal Counterfactual Generative Model (CCGM), which includes a partially trainable causal layer in which a part of a causal model can be learned without significantly impacting reconstruction fidelity. By learning the causal relationships between image semantic labels or tabular variables, we can analyze biases, intervene on the generative model, and simulate new scenarios. Furthermore, by modifying the causal structure, we can generate samples outside the domain of the original training data and use such counterfactual models to de-bias datasets. Thus, datasets with known biases can still be used to train the causal generative model and learn the causal relationships, but we can produce de-biased datasets on the generative side. Our proposed method combines a causal latent space VAE model with specific modification to emphasize causal fidelity, enabling finer control over the causal layer and the ability to learn a robust intervention framework. We explore how better disentanglement of causal learning and encoding/decoding generates higher causal intervention quality. We also compare our model against similar research to demonstrate the need for explicit generative de-biasing beyond interventions. Our initial experiments show that our model can generate images and tabular data with high fidelity to the causal framework and accommodate explicit de-biasing to ignore undesired relationships in the causal data compared to the baseline.
翻译:变异自动编码器(VAE)和其他基因方法不仅因其基因特性而引起越来越多的兴趣,而且因其分解低维潜伏变异空间的能力而引起越来越多的兴趣。然而,现有的变异自动编码器很少考虑到因果关系。我们提议了一个新的变异自动编码器框架,称为Causal 反动生成模型(CCGM),其中包括一个部分可训练的因果层,在这个层中,可以学习因果模型的一部分,而不会对重建的忠诚产生重大影响。通过了解图像正义标签或表列变量之间的因果关系,我们可以分析偏差,干预基因变异模型,并模拟新情景。此外,通过修改因果结构,我们可以产生原始培训数据外的样本,并利用这种反相貌模型去动性模型来降低偏见的数据集。因此,仍然可以使用已知的偏差数据集来训练因果关系模型,并学习因果关系关系,但我们可以在基因化方面产生不偏差的数据。我们提出的方法将一个不因果隐性更深的空间模型结合起来,对基因变异的基因模型进行干涉模型,并具体地调整了因果性数据变正值分析能力,以强调因果关系的因果关系/正统化框架。我们如何学会学会学会学会学会学会学会学习如何学会研究。我们如何研究。我们如何研究。