Disentanglement via mechanism sparsity was introduced recently as a principled approach to extract latent factors without supervision when the causal graph relating them in time is sparse, and/or when actions are observed and affect them sparsely. However, this theory applies only to ground-truth graphs satisfying a specific criterion. In this work, we introduce a generalization of this theory which applies to any ground-truth graph and specifies qualitatively how disentangled the learned representation is expected to be, via a new equivalence relation over models we call consistency. This equivalence captures which factors are expected to remain entangled and which are not based on the specific form of the ground-truth graph. We call this weaker form of identifiability partial disentanglement. The graphical criterion that allows complete disentanglement, proposed in an earlier work, can be derived as a special case of our theory. Finally, we enforce graph sparsity with constrained optimization and illustrate our theory and algorithm in simulations.
翻译:最近,作为一种原则性的方法,通过机制宽度解析方法,在不监督的情况下,当与它们相关的因果图在时间上稀少时,和/或当行动被观察并影响很少时,作为一种原则性的方法,在不监督的情况下,提取潜在因素。然而,这一理论只适用于符合特定标准的地面真象图。在这项工作中,我们引入了该理论的概括化,该理论适用于任何地面真象图,并从质量上指明了通过与我们称之为一致性的模型的新等同关系,预期所学的表达方式会如何分解。这种等同性捕捉了那些预期会一直缠绕在一起并且并非基于地面真象图具体形式的因素。我们称之为这种较弱的可识别性形式部分解析。在早期工作中提出的允许完全分离的图形标准可以作为我们理论的一个特例加以推导出。最后,我们用限制的优化来强制执行图形散乱,并在模拟中说明我们的理论和算法。