The matrix completion problem seeks to recover a $d\times d$ ground truth matrix of low rank $r\ll d$ from observations of its individual elements. Real-world matrix completion is often a huge-scale optimization problem, with $d$ so large that even the simplest full-dimension vector operations with $O(d)$ time complexity become prohibitively expensive. Stochastic gradient descent (SGD) is one of the few algorithms capable of solving matrix completion on a huge scale, and can also naturally handle streaming data over an evolving ground truth. Unfortunately, SGD experiences a dramatic slow-down when the underlying ground truth is ill-conditioned; it requires at least $O(\kappa\log(1/\epsilon))$ iterations to get $\epsilon$-close to ground truth matrix with condition number $\kappa$. In this paper, we propose a preconditioned version of SGD that preserves all the favorable practical qualities of SGD for huge-scale online optimization while also making it agnostic to $\kappa$. For a symmetric ground truth and the Root Mean Square Error (RMSE) loss, we prove that the preconditioned SGD converges to $\epsilon$-accuracy in $O(\log(1/\epsilon))$ iterations, with a rapid linear convergence rate as if the ground truth were perfectly conditioned with $\kappa=1$. In our experiments, we observe a similar acceleration for item-item collaborative filtering on the MovieLens25M dataset via a pair-wise ranking loss, with 100 million training pairs and 10 million testing pairs. [See supporting code at https://github.com/Hong-Ming/ScaledSGD.]
翻译:矩阵完成问题试图从对各个元素的观察中回收一个低等级的$d_time d$25 d$dd 地面真相矩阵。 不幸的是, SGD在基本地面真相不完善时经历急剧的减速; 真正世界矩阵的完成往往是一个巨大的优化问题, 美元如此之大, 甚至最简单的全维矢量矢量操作也变得过于昂贵。 斯托切梯度下降( SGD) 是能够大规模解决矩阵完成的少数算法之一, 并且也可以自然地在不断演变的地面真相中处理流数据流数据流。 不幸的是, SGD 在基本地面真相不完善时, SGD会经历一个非常缓慢的减速; 它至少需要$O( kappa\log (1/ mepsilon)) 美元, 才能得到条件为$\ kaploformation的完整矢量矢量矢量矢量矢量矢量矩阵操作。 在Squalalal- mlogisalation roal deal deal romodeal roup romodeal deal deal millations, 在Sqral deal deal deal deal deal deal deal demodeal demodeal missation millation mills millations millations millations millations lax 10 mills lax romod romod millations mills 。 。 在Sral romodeal romodal 10 mal romodal romodal romod rodal rodrodro ro ro rod rodrodal rodal rod rod ro rod mal rod mal rod mal rod rod mal rod mal rod mal rodal rodal rodal rodal rodal rod rod rod rod rod rod rod rodal ro ro rodal rod mal rotal 美元中, 立, 立, 立, 立, 立