We consider the problem of learning discriminative representations for data in a high-dimensional space with distribution supported on or around multiple low-dimensional linear subspaces. That is, we wish to compute a linear injective map of the data such that the features lie on multiple orthogonal subspaces. Instead of treating this learning problem using multiple PCAs, we cast it as a sequential game using the closed-loop transcription (CTRL) framework recently proposed for learning discriminative and generative representations for general low-dimensional submanifolds. We prove that the equilibrium solutions to the game indeed give correct representations. Our approach unifies classical methods of learning subspaces with modern deep learning practice, by showing that subspace learning problems may be provably solved using the modern toolkit of representation learning. In addition, our work provides the first theoretical justification for the CTRL framework, in the important case of linear subspaces. We support our theoretical findings with compelling empirical evidence. We also generalize the sequential game formulation to more general representation learning problems. Our code, including methods for easy reproduction of experimental results, is publically available on GitHub.
翻译:我们考虑了在多维线性子空间或多维线性子空间周围分布的高维空间数据中学习有区别的表达方式的问题。也就是说,我们希望对数据进行线性直射图的计算,使这些特征存在于多个正方位子空间上。我们不使用多多五氯苯处理这一学习问题,而是将它作为一个连续游戏,使用最近提议的闭环转录(CTRL)框架来学习普通低维子层的有区别和有区别的表达方式。我们证明,对游戏的平衡解决办法确实提供了正确的表达方式。我们的方法将学习有现代深层学习实践的子空间的经典方法统一起来,通过展示亚空学习问题可以通过现代代言语学习工具包的可辨别解决。此外,我们的工作为CTRL框架提供了在重要的线性子空间中的第一个理论理由。我们用令人信服的经验证据来支持我们的理论结论。我们还将连续游戏的表述方式概括为更普遍的有代表性的学习问题。我们的代码,包括易于复制实验结果的方法,在GitHubb可以公开获得。