In this work, we adopt Wyner common information framework for unsupervised multi-view representation learning. Within this framework, we propose two novel formulations that enable the development of computational efficient solvers based on the alternating minimization principle. The first formulation, referred to as the {\em variational form}, enjoys a linearly growing complexity with the number of views and is based on a variational-inference tight surrogate bound coupled with a Lagrangian optimization objective function. The second formulation, i.e., the {\em representational form}, is shown to include known results as special cases. Here, we develop a tailored version from the alternating direction method of multipliers (ADMM) algorithm for solving the resulting non-convex optimization problem. In the two cases, the convergence of the proposed solvers is established in certain relevant regimes. Furthermore, our empirical results demonstrate the effectiveness of the proposed methods as compared with the state-of-the-art solvers. In a nutshell, the proposed solvers offer computational efficiency, theoretical convergence guarantees, scalable complexity with the number of views, and exceptional accuracy as compared with the state-of-the-art techniques. Our focus here is devoted to the discrete case and our results for continuous distributions are reported elsewhere.
翻译:在本文中,我们采用 Wyner 公共信息框架进行无监督多视角表示学习。在该框架内,我们提出了两种新颖的公式,使得基于交替最小化原理的计算有效求解器得以发展。第一种公式被称为{\em 变分形式},其复杂度随着视角数量的增加而呈线性增长,并基于变分推理紧的替代上界以及与拉格朗日优化目标函数相结合。第二种公式,即 {\em 陈述形式},被证明包含已知结果的特殊情况。在此基础上,我们开发了一个专门适用于求解所得的非凸优化问题的交替方向乘子(ADMM)算法的定制版本。在这两种情况下,我们在相关的重要范围内证明了所提出求解器的收敛性。此外,我们的实验结果证明了所提出方法与最先进技术相比的有效性。简而言之,所提出的求解器提供了计算效率、理论收敛保证、与视角数量可扩展的复杂度,并且优于最先进技术的精度。本文重点放在离散情况下,我们在其他地方报告了对于连续分布的结果。