在强大的五氯苯甲醚中连接连接连接连接连接连接连接和不连接优化:噪音、外部线和缺失数据 (Bridging Convex and Nonconvex Optimization in Robust PCA: Noise, Outliers, and Missing Data)

This paper delivers improved theoretical guarantees for the convex programming approach in low-rank matrix estimation, in the presence of (1) random noise, (2) gross sparse outliers, and (3) missing data. This problem, often dubbed as robust principal component analysis (robust PCA), finds applications in various domains. Despite the wide applicability of convex relaxation, the available statistical support (particularly the stability analysis vis-\`a-vis random noise) remains highly suboptimal, which we strengthen in this paper. When the unknown matrix is well-conditioned, incoherent, and of constant rank, we demonstrate that a principled convex program achieves near-optimal statistical accuracy, in terms of both the Euclidean loss and the $\ell_{\infty}$ loss. All of this happens even when nearly a constant fraction of observations are corrupted by outliers with arbitrary magnitudes. The key analysis idea lies in bridging the convex program in use and an auxiliary nonconvex optimization algorithm, and hence the title of this paper.

翻译：本文为低级矩阵估算的组合编程方法提供了更好的理论保障,其中显示:(1) 随机噪音,(2) 极度稀少的外源和(3) 缺失的数据。这个问题通常被称为稳健的主要组成部分分析(robust CPA), 在不同领域都有应用。尽管松动的放松具有广泛适用性,但现有的统计支持(特别是相对于随机噪音的稳定分析)仍然极不理想,我们在本文中强化了这一点。当未知的组合程序条件良好、不连贯且级别不变时,我们证明一个原则性的组合程序在Euclidean损失和$\ell\ ⁇ infty}$损失两方面都达到了接近最佳的统计准确性。所有这些都发生于几乎一成不变的观测被具有任意规模的外源破坏之时。关键的分析理念在于连接使用中的螺旋程序,以及辅助的非convex优化算法, 以及本文的标题。

相关内容

PCA

关注 3

在统计中，主成分分析（PCA）是一种通过最大化每个维度的方差来将较高维度空间中的数据投影到较低维度空间中的方法。给定二维，三维或更高维空间中的点集合，可以将“最佳拟合”线定义为最小化从点到线的平均平方距离的线。可以从垂直于第一条直线的方向类似地选择下一条最佳拟合线。重复此过程会产生一个正交的基础，其中数据的不同单个维度是不相关的。这些基向量称为主成分。

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

【AAAI2020】拓扑贝叶斯优化与持久性图：Topological Bayesian Optimization with Persistence Diagrams

专知会员服务

11+阅读 · 2020年1月17日

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

专知会员服务

23+阅读 · 2019年11月21日