Many existing algorithms for streaming geometric data analysis have been plagued by exponential dependencies in the space complexity, which are undesirable for processing high-dimensional data sets. In particular, once $d\geq\log n$, there are no known non-trivial streaming algorithms for problems such as maintaining convex hulls and L\"owner-John ellipsoids of $n$ points, despite a long line of work in streaming computational geometry since [AHV04]. We simultaneously improve these results to $\mathrm{poly}(d,\log n)$ bits of space by trading off with a $\mathrm{poly}(d,\log n)$ factor distortion. We achieve these results in a unified manner, by designing the first streaming algorithm for maintaining a coreset for $\ell_\infty$ subspace embeddings with $\mathrm{poly}(d,\log n)$ space and $\mathrm{poly}(d,\log n)$ distortion. Our algorithm also gives similar guarantees in the \emph{online coreset} model. Along the way, we sharpen results for online numerical linear algebra by replacing a log condition number dependence with a $\log n$ dependence, answering a question of [BDM+20]. Our techniques provide a novel connection between leverage scores, a fundamental object in numerical linear algebra, and computational geometry. For $\ell_p$ subspace embeddings, we give nearly optimal trade-offs between space and distortion for one-pass streaming algorithms. For instance, we give a deterministic coreset using $O(d^2\log n)$ space and $O((d\log n)^{1/2-1/p})$ distortion for $p>2$, whereas previous deterministic algorithms incurred a $\mathrm{poly}(n)$ factor in the space or the distortion [CDW18]. Our techniques have implications in the offline setting, where we give optimal trade-offs between the space complexity and distortion of subspace sketch data structures. To do this, we give an elementary proof of a "change of density" theorem of [LT80] and make it algorithmic.
翻译:用于进行地球测量数据分析的许多现有算法一直受到空间复杂度指数依赖性的困扰,而这种依赖对于处理高维数据集来说是不可取的。 特别是, $d\ geq\ log n$, 在维护 convex 外壳和 L\" 所有人 - John elliverside 等问题上没有已知的非三维流算法, 尽管自 [AHV04] 以来在流解计算性几何方面做了大量工作。 我们同时将这些结果改进到 $mathrm{poly} (d,\log nn) 。 通过以 $ more demologal commation 来交换(d, d\ n) comm drigial deal_ alligistration ral restitude, 我们用 ligial listal deal lives a ligal ligal ligal exmode, 我们用 ligal deal liversal ligal licomma list a modeal deal list list listrate licommax a mess.