在线 Lewis 体重抽样 (Online Lewis Weight Sampling)

The seminal work of Cohen and Peng introduced Lewis weight sampling to the theoretical computer science community, yielding fast row sampling algorithms for approximating $d$-dimensional subspaces of $\ell_p$ up to $(1+\epsilon)$ error. Several works have extended this important primitive to other settings, including the online coreset, sliding window, and adversarial streaming models. However, these results are only for $p\in\{1,2\}$, and results for $p=1$ require a suboptimal $\tilde O(d^2/\epsilon^2)$ samples. In this work, we design the first nearly optimal $\ell_p$ subspace embeddings for all $p\in(0,\infty)$ in the online coreset, sliding window, and the adversarial streaming models. In all three models, our algorithms store $\tilde O(d^{1\lor(p/2)}/\epsilon^2)$ rows. This answers a substantial generalization of the main open question of [BDMMUWZ2020], and gives the first results for all $p\notin\{1,2\}$. Towards our result, we give the first analysis of "one-shot'' Lewis weight sampling of sampling rows proportionally to their Lewis weights, with sample complexity $\tilde O(d^{p/2}/\epsilon^2)$ for $p>2$. Previously, this scheme was only known to have sample complexity $\tilde O(d^{p/2}/\epsilon^5)$, whereas $\tilde O(d^{p/2}/\epsilon^2)$ is known if a more sophisticated recursive sampling is used. The recursive sampling cannot be implemented online, thus necessitating an analysis of one-shot Lewis weight sampling. Our analysis uses a novel connection to online numerical linear algebra. As an application, we obtain the first one-pass streaming coreset algorithms for $(1+\epsilon)$ approximation of important generalized linear models, such as logistic regression and $p$-probit regression. Our upper bounds are parameterized by a complexity parameter $\mu$ introduced by [MSSW2018], and we show the first lower bounds showing that a linear dependence on $\mu$ is necessary.

翻译：科恩和彭的原始工作将 Lewis 重量取样引入了理论计算机科学界, 生成了快速行取样算法, 用于约等于美元( 1 ⁇ ) 美元( 1 ⁇ ) 的维基次空间。一些作品将这一重要的原始范围扩展至其他设置, 包括在线核心集、滑动窗口和对抗流模型。然而, 这些结果只针对 $\\ ⁇ 1 美元( 1美元), 美元( p= 1 美元) 需要低于最佳的 $ O ( d ⁇ 2/\ eeplon2) 的样本美元。在这项工作中, 我们设计了第一个接近最佳的 $( pell_ p$) 的维基次空间嵌入 $( $( 1 ⁇ ) 美元的基底层。当我们第一次通过 ODMUW\\ 202 的深度分析, 开始的结果是以美元( =xxx) 的直升。