In representation learning, there has been recent interest in developing algorithms to disentangle the ground-truth generative factors behind data, and metrics to quantify how fully this occurs. However, these algorithms and metrics often assume that both representations and ground-truth factors are flat, continuous, and factorized, whereas many real-world generative processes involve rich hierarchical structure, mixtures of discrete and continuous variables with dependence between them, and even varying intrinsic dimensionality. In this work, we develop benchmarks, algorithms, and metrics for learning such hierarchical representations.
翻译:在代表性学习方面,最近人们有兴趣发展各种算法,以解开数据背后的地面真象变异因素,并制订衡量标准,以量化这种变异是如何完全发生的。 但是,这些算法和衡量标准往往假定,表象和地面真象因素都是平坦的、持续的和因数化的,而许多现实世界的基因化过程涉及丰富的等级结构、离散的和连续的变数的混合,它们彼此依赖,甚至各种内在的维度。 在这项工作中,我们为学习这种等级表态制定了基准、算法和衡量标准。