This work proposes a new computational framework for learning an explicit generative model for real-world datasets. In particular we propose to learn {\em a closed-loop transcription} between a multi-class multi-dimensional data distribution and a { linear discriminative representation (LDR)} in the feature space that consists of multiple independent multi-dimensional linear subspaces. In particular, we argue that the optimal encoding and decoding mappings sought can be formulated as the equilibrium point of a {\em two-player minimax game between the encoder and decoder}. A natural utility function for this game is the so-called {\em rate reduction}, a simple information-theoretic measure for distances between mixtures of subspace-like Gaussians in the feature space. Our formulation draws inspiration from closed-loop error feedback from control systems and avoids expensive evaluating and minimizing approximated distances between arbitrary distributions in either the data space or the feature space. To a large extent, this new formulation unifies the concepts and benefits of Auto-Encoding and GAN and naturally extends them to the settings of learning a {\em both discriminative and generative} representation for multi-class and multi-dimensional real-world data. Our extensive experiments on many benchmark imagery datasets demonstrate tremendous potential of this new closed-loop formulation: under fair comparison, visual quality of the learned decoder and classification performance of the encoder is competitive and often better than existing methods based on GAN, VAE, or a combination of both. We notice that the so learned features of different classes are explicitly mapped onto approximately {\em independent principal subspaces} in the feature space; and diverse visual attributes within each class are modeled by the {\em independent principal components} within each subspace.
翻译:这项工作提出了一个新的计算框架, 用于为真实世界数据集学习一个明确的基因模型。 特别是, 我们提议在由多个独立的多维线性子空间构成的特性空间中, 在多级多维数据分布和 {线性分析表达式(LDR) 之间学习 { 线性分析表达式(LDR) 之间的闭路翻转 。 特别是, 我们争论说, 所寻求的最佳编码和解码映射可以作为 {em- 双播放器的小型游戏在编码器和 解码器之间的平衡点 。 这个游戏的自然效用功能是所谓的 emball-loop 线性数据流分布式的缩略图 。 我们的配制从控制系统中的闭路错误反馈中得到灵感, 避免在数据空间或特性空间中任意分布的近似距离进行昂贵的评估和最小化 。 在很大程度上, 这个新的编程将自动编码和 GAN 和GAN 的自然将其扩展为所谓的 ~ ireal ligial rode disal disal disal 。