We study a matrix recovery problem with unknown correspondence: given the observation matrix $M_o=[A,\tilde P B]$, where $\tilde P$ is an unknown permutation matrix, we aim to recover the underlying matrix $M=[A,B]$. Such problem commonly arises in many applications where heterogeneous data are utilized and the correspondence among them are unknown, e.g., due to privacy concerns. We show that it is possible to recover $M$ via solving a nuclear norm minimization problem under a proper low-rank condition on $M$, with provable non-asymptotic error bound for the recovery of $M$. We propose an algorithm, $\text{M}^3\text{O}$ (Matrix recovery via Min-Max Optimization) which recasts this combinatorial problem as a continuous minimax optimization problem and solves it by proximal gradient with a Max-Oracle. $\text{M}^3\text{O}$ can also be applied to a more general scenario where we have missing entries in $M_o$ and multiple groups of data with distinct unknown correspondence. Experiments on simulated data, the MovieLens 100K dataset and Yale B database show that $\text{M}^3\text{O}$ achieves state-of-the-art performance over several baselines and can recover the ground-truth correspondence with high accuracy.
翻译:我们用未知的通信来研究矩阵回收问题:鉴于观测矩阵$M_o=[A,\tillde PB]$[A,\tilde PB]$,美元是一个未知的变异矩阵矩阵,我们的目标是收回基底矩阵$M=[A,B]$。在许多应用中,由于隐私问题等不同数据被使用,而且它们之间的对应性是不为人知的。我们表明,在以美元为单位的适当低等级条件下,通过解决核规范最小化问题,以美元为单位解决核规范问题,可以回收$M$[A,\tilde PB]$,其中非可识别的非自动调节错误为美元。我们建议一种算法,$\text{M*3\text{O}$(通过Min-max 最大优化) 来恢复基底矩阵。我们用$M_o_o_o3和多组的模拟数据库可以显示一些不为未知的数据的恢复性能。