具有批量更新的低尺寸相近近邻近近邻 (Parallel Nearest Neighbors in Low Dimensions with Batch Updates) - 专知论文

会员服务 ·

0

张成子空间 · 近邻 · 情景 · 确切的 · FAST ·

2021 年 11 月 7 日

Parallel Nearest Neighbors in Low Dimensions with Batch Updates

翻译：具有批量更新的低尺寸相近近邻近近邻

Magdalen Dobson,Guy Blelloch

We present a set of parallel algorithms for computing exact k-nearest neighbors in low dimensions. Many k-nearest neighbor algorithms use either a kd-tree or the Morton ordering of the point set; our algorithms combine these approaches using a data structure we call the \textit{zd-tree}. We show that this combination is both theoretically efficient under common assumptions, and fast in practice. For point sets of size $n$ with bounded expansion constant and bounded ratio, the zd-tree can be built in $O(n)$ work with $O(n^{\epsilon})$ span for constant $\epsilon<1$, and searching for the $k$-nearest neighbors of a point takes expected $O(k\log k)$ time. We benchmark our k-nearest neighbor algorithms against existing parallel k-nearest neighbor algorithms, showing that our implementations are generally faster than the state of the art as well as achieving 75x speedup on 144 hyperthreads. Furthermore, the zd-tree supports parallel batch-dynamic insertions and deletions; to our knowledge, it is the first k-nearest neighbor data structure to support such updates. On point sets with bounded expansion constant and bounded ratio, a batch-dynamic update of size $k$ requires $O(k \log n/k)$ work with $O(k^{\epsilon} + \text{polylog}(n))$ span.

翻译：我们用一套平行算法来计算精确的 k- 近邻的低维值。许多 k- 最近邻的算法使用 kd- tree 或 Morton 定点数; 我们的算法结合了这些方法, 我们称之为\ textit{ zd- tree} 的数据结构。我们显示, 在共同的假设下, 这种组合在理论上是有效的, 在实践中也是快速的。对于有约束扩展常数和约束比率的点数, zd- tree 可以用$O (n) 的工作用$O (n) 来构建。此外, zd- tree 用于恒定的 $ (n- epsilon) < 1$, 寻找某个点的最近邻的 $- 美元则需要 $(k\ log) 期待美元 (k\ log) 时间。我们用现有的平行 K- nearn- 邻居的算法将我们最近的算法作为基准, 显示我们的执行速度一般比艺术状态快, 并在144 超超版读上达到 75x 速度。此外, 。 zd- tree 需要平行的批量递增量美元的的的和美元递增量和递增 K- sal- sal- rial- rial- rialxxxxxx 比例。

0

相关内容

张成子空间

张成子空间

边缘机器学习，21页ppt

边缘机器学习，21页ppt

专知会员服务

84+阅读 · 2021年6月21日

【AAAI2021】基于图神经网络的文本语义匹配算法

【AAAI2021】基于图神经网络的文本语义匹配算法

专知会员服务

50+阅读 · 2021年1月30日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

【DeepMind-NeurIPS 2020】元训练代理实现Bayes-optimal代理

【DeepMind-NeurIPS 2020】元训练代理实现Bayes-optimal代理

专知会员服务

12+阅读 · 2020年11月1日

【KDD2020】基于矩阵和张量因子分解的高效自动机器学习搜索，Efficient AutoML Pipeline Search with Matrix and Tensor Factorization

【KDD2020】基于矩阵和张量因子分解的高效自动机器学习搜索，Efficient AutoML Pipeline Search with Matrix and Tensor Factorization

专知会员服务

13+阅读 · 2020年6月10日

【WWW 2020 】基于关系对抗网络的低资源知识图谱补全，Relation Adversarial Network for Low Resource Knowledge Graph Completion

【WWW 2020 】基于关系对抗网络的低资源知识图谱补全，Relation Adversarial Network for Low Resource Knowledge Graph Completion

专知会员服务

37+阅读 · 2020年6月7日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

为什么批处理规范会导致梯度爆炸，Why Batch Norm Causes Exploding Gradients

为什么批处理规范会导致梯度爆炸，Why Batch Norm Causes Exploding Gradients

专知会员服务

17+阅读 · 2020年4月2日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

LibRec 精选：AutoML for Contextual Bandits

LibRec 精选：AutoML for Contextual Bandits

LibRec智能推荐

7+阅读 · 2019年9月19日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

已删除

将门创投

6+阅读 · 2019年4月22日

TensorFlow 2.0新特性之Ragged Tensor

TensorFlow 2.0新特性之Ragged Tensor

深度学习每日摘要

18+阅读 · 2019年4月5日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

保序最优传输：Order-preserving Optimal Transport

保序最优传输：Order-preserving Optimal Transport

我爱读PAMI

6+阅读 · 2018年9月16日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

干货 | 如何理解深度学习分布式训练中的large batch size与learning rate的关系？

干货 | 如何理解深度学习分布式训练中的large batch size与learning rate的关系？

AI科技评论

5+阅读 · 2017年11月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Tree-based Search Graph for Approximate Nearest Neighbor Search

Arxiv

0+阅读 · 2022年1月10日

Optimal Experimental Design for Staggered Rollouts

Arxiv

0+阅读 · 2022年1月10日

Selectable Heaps and Optimal Lazy Search Trees

Arxiv

0+阅读 · 2022年1月10日

A note on concatenation of quasi-Monte Carlo and plain Monte Carlo rules in high dimensions

Arxiv

0+阅读 · 2022年1月7日

Dynamic Suffix Array with Polylogarithmic Queries and Updates

Arxiv

0+阅读 · 2022年1月4日

Star Discrepancy Subset Selection: Problem Formulation and Efficient Approaches for Low Dimensions

Arxiv

0+阅读 · 2022年1月4日

Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret

Arxiv

8+阅读 · 2021年4月22日

Unbalanced minibatch Optimal Transport; applications to Domain Adaptation

Arxiv

3+阅读 · 2021年3月5日

Optimal Algorithms for Non-Smooth Distributed Optimization in Networks

Arxiv

7+阅读 · 2018年6月1日

Being Robust (in High Dimensions) Can Be Practical

Arxiv

3+阅读 · 2017年12月14日

VIP会员

文章信息

相关主题

张成子空间

相关VIP内容

边缘机器学习，21页ppt

边缘机器学习，21页ppt

专知会员服务

84+阅读 · 2021年6月21日

【AAAI2021】基于图神经网络的文本语义匹配算法

【AAAI2021】基于图神经网络的文本语义匹配算法

专知会员服务

50+阅读 · 2021年1月30日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

【DeepMind-NeurIPS 2020】元训练代理实现Bayes-optimal代理

【DeepMind-NeurIPS 2020】元训练代理实现Bayes-optimal代理

专知会员服务

12+阅读 · 2020年11月1日

【KDD2020】基于矩阵和张量因子分解的高效自动机器学习搜索，Efficient AutoML Pipeline Search with Matrix and Tensor Factorization

【KDD2020】基于矩阵和张量因子分解的高效自动机器学习搜索，Efficient AutoML Pipeline Search with Matrix and Tensor Factorization

专知会员服务

13+阅读 · 2020年6月10日

【WWW 2020 】基于关系对抗网络的低资源知识图谱补全，Relation Adversarial Network for Low Resource Knowledge Graph Completion

【WWW 2020 】基于关系对抗网络的低资源知识图谱补全，Relation Adversarial Network for Low Resource Knowledge Graph Completion

专知会员服务

37+阅读 · 2020年6月7日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

为什么批处理规范会导致梯度爆炸，Why Batch Norm Causes Exploding Gradients

为什么批处理规范会导致梯度爆炸，Why Batch Norm Causes Exploding Gradients

专知会员服务

17+阅读 · 2020年4月2日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《美国海军陆战队软件定义网络应用案例：分布式防火墙自动化系统》148页

《多体环境下定位导航授时（PNT）系统研究》228页

软件定义无线电（SDR）：商业与军事领域的技术、应用及未来趋势

《攻势防空作战中无人追击者/规避者最优轨迹研究（含动态交战区建模）》95页

相关资讯

LibRec 精选：AutoML for Contextual Bandits

LibRec 精选：AutoML for Contextual Bandits

LibRec智能推荐

7+阅读 · 2019年9月19日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

已删除

将门创投

6+阅读 · 2019年4月22日

TensorFlow 2.0新特性之Ragged Tensor

TensorFlow 2.0新特性之Ragged Tensor

深度学习每日摘要

18+阅读 · 2019年4月5日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

保序最优传输：Order-preserving Optimal Transport

保序最优传输：Order-preserving Optimal Transport

我爱读PAMI

6+阅读 · 2018年9月16日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

干货 | 如何理解深度学习分布式训练中的large batch size与learning rate的关系？

干货 | 如何理解深度学习分布式训练中的large batch size与learning rate的关系？

AI科技评论

5+阅读 · 2017年11月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Tree-based Search Graph for Approximate Nearest Neighbor Search

Arxiv

0+阅读 · 2022年1月10日

Optimal Experimental Design for Staggered Rollouts

Arxiv

0+阅读 · 2022年1月10日

Selectable Heaps and Optimal Lazy Search Trees

Arxiv

0+阅读 · 2022年1月10日

A note on concatenation of quasi-Monte Carlo and plain Monte Carlo rules in high dimensions

Arxiv

0+阅读 · 2022年1月7日

Dynamic Suffix Array with Polylogarithmic Queries and Updates

Arxiv

0+阅读 · 2022年1月4日

Star Discrepancy Subset Selection: Problem Formulation and Efficient Approaches for Low Dimensions

Arxiv

0+阅读 · 2022年1月4日

Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret

Arxiv

8+阅读 · 2021年4月22日

Unbalanced minibatch Optimal Transport; applications to Domain Adaptation

Arxiv

3+阅读 · 2021年3月5日

Optimal Algorithms for Non-Smooth Distributed Optimization in Networks

Arxiv

7+阅读 · 2018年6月1日

Being Robust (in High Dimensions) Can Be Practical

Arxiv

3+阅读 · 2017年12月14日

微信扫码咨询专知VIP会员