We study dynamic algorithms robust to adaptive input generated from sources with bounded capabilities, such as sparsity or limited interaction. For example, we consider robust linear algebraic algorithms when the updates to the input are sparse but given by an adversary with access to a query oracle. We also study robust algorithms in the standard centralized setting, where an adversary queries an algorithm in an adaptive manner, but the number of interactions between the adversary and the algorithm is bounded. We first recall a unified framework of [HKM+20, BKM+22, ACSS23] for answering $Q$ adaptive queries that incurs $\widetilde{\mathcal{O}}(\sqrt{Q})$ overhead in space, which is roughly a quadratic improvement over the na\"{i}ve implementation, and only incurs a logarithmic overhead in query time. Although the general framework has diverse applications in machine learning and data science, such as adaptive distance estimation, kernel density estimation, linear regression, range queries, and point queries and serves as a preliminary benchmark, we demonstrate even better algorithmic improvements for (1) reducing the pre-processing time for adaptive distance estimation and (2) permitting an unlimited number of adaptive queries for kernel density estimation. Finally, we complement our theoretical results with additional empirical evaluations.
翻译:我们研究了对来自具有有限能力的源产生的自适应输入做出稳健反应的动态算法,例如,我们考虑当更新输入是稀疏的,但却由具有查询oracle的对手生成时的稳健线性代数算法。我们还研究了在标准集中设置中的强大算法,其中对手以自适应方式查询算法,但对手与算法之间的交互次数受到限制。我们首先回顾了[HKM + 20,BKM + 22,ACSS23]的统一框架,该框架适用于回答 $Q$ 自适应查询,其空间开销为 $\widetilde{\mathcal{O}}(\sqrt{Q})$,大致上比不能胜任的实施方式改善了一个二次,只增加了查询时间的对数开销。虽然一般框架在机器学习和数据科学中具有不同的应用,例如自适应距离估计、核密度估计、线性回归、范围查询和点查询,并且作为初步基准,但我们证明了更好的算法改进,以(1) 减少自适应距离估计的预处理时间和(2) 允许在核密度估计中使用无限个自适应查询。最后,我们通过额外的实证评估来补充我们的理论结果。
标注:
我们研究对来自具有有限能力的源产生的自适应输入做出稳健反应的动态算法,例如,我们考虑当更新输入是稀疏的,但却由具有查询oracle的对手生成时的稳健线性代数算法。 -> *Query oracle*
我们还研究了在标准集中设置中的强大算法,其中对手以自适应方式查询算法,但对手与算法之间的交互次数受到限制。-> *centralized setting*