穿过屋顶的内核方法:高效率地处理数十亿个点点 (Kernel methods through the roof: handling billions of points efficiently) - 专知论文

会员服务 ·

0

核化 · 优化器 · 线性的 · 缩放 · Principle ·

2020 年 11 月 26 日

Kernel methods through the roof: handling billions of points efficiently

翻译：穿过屋顶的内核方法:高效率地处理数十亿个点点

Giacomo Meanti,Luigi Carratino,Lorenzo Rosasco,Alessandro Rudi

from arxiv, 33 pages, 7 figures, NeurIPS 2020

Kernel methods provide an elegant and principled approach to nonparametric learning, but so far could hardly be used in large scale problems, since na\"ive implementations scale poorly with data size. Recent advances have shown the benefits of a number of algorithmic ideas, for example combining optimization, numerical linear algebra and random projections. Here, we push these efforts further to develop and test a solver that takes full advantage of GPU hardware. Towards this end, we designed a preconditioned gradient solver for kernel methods exploiting both GPU acceleration and parallelization with multiple GPUs, implementing out-of-core variants of common linear algebra operations to guarantee optimal hardware utilization. Further, we optimize the numerical precision of different operations and maximize efficiency of matrix-vector multiplications. As a result we can experimentally show dramatic speedups on datasets with billions of points, while still guaranteeing state of the art performance. Additionally, we make our software available as an easy to use library.

翻译：内核方法为非参数学习提供了一种优雅和有原则的方法,但迄今为止几乎无法在大规模问题中使用,因为“反”执行规模与数据大小相去甚远。最近的进展显示了一系列算法想法的好处,例如将优化、数字线性代数和随机预测结合起来。在这里,我们进一步推动这些努力,以开发和测试一个充分利用GPU硬件的求解器。为此,我们为内核方法设计了一个有先决条件的梯度求解器,利用GPU加速和与多个GPU的平行,实施普通线性代数操作的核心变体,以保证最佳利用硬件。此外,我们优化了不同操作的数字精确度,并最大限度地提高了矩阵性变种的效率。结果我们可以实验性地显示数十亿个点的数据集上的巨大加速,同时仍然保证艺术性能的状态。此外,我们把软件作为方便使用的图书馆。

0

相关内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【2020 最新论文】节点邻近的图池化的层次表示学习 Graph Pooling with Node Proximity for Hierarchical Representation Learning

【2020 最新论文】节点邻近的图池化的层次表示学习 Graph Pooling with Node Proximity for Hierarchical Representation Learning

专知会员服务

43+阅读 · 2020年7月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【新书】数字图像处理手册第二版，Handbook of Mathematical Methods in Imaging, 2nd edition

【新书】数字图像处理手册第二版，Handbook of Mathematical Methods in Imaging, 2nd edition

专知会员服务

46+阅读 · 2020年2月11日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

机器学习线性代数速查

机器学习线性代数速查

机器学习研究会

19+阅读 · 2018年2月25日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Random and quasi-random designs in group testing

Random and quasi-random designs in group testing

Arxiv

0+阅读 · 2021年1月15日

Likelihood Ratio Exponential Families

Arxiv

0+阅读 · 2021年1月15日

Adaptive Estimation of Multivariate Piecewise Polynomials and Bounded Variation Functions by Optimal Decision Trees

Arxiv

0+阅读 · 2021年1月14日

New bounds for $k$-means and information $k$-means

Arxiv

0+阅读 · 2021年1月14日

Bayesian inference with tmbstan for a state-space model with VAR(1) state equation

Bayesian inference with tmbstan for a state-space model with VAR(1) state equation

Arxiv

0+阅读 · 2021年1月14日

Time-critical testing and search problems

Arxiv

0+阅读 · 2021年1月14日

Linear-time Learning on Distributions with Approximate Kernel Embeddings

Arxiv

0+阅读 · 2021年1月14日

Breaking the curse: a dimension free computational upper-bound for smooth optimal transport estimation

Arxiv

0+阅读 · 2021年1月13日

End to end learning and optimization on graphs

Arxiv

7+阅读 · 2019年5月31日

The Effects of Super-Resolution on Object Detection Performance in Satellite Imagery

The Effects of Super-Resolution on Object Detection Performance in Satellite Imagery

Arxiv

3+阅读 · 2018年12月10日

VIP会员

文章信息

相关主题

相关VIP内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【2020 最新论文】节点邻近的图池化的层次表示学习 Graph Pooling with Node Proximity for Hierarchical Representation Learning

【2020 最新论文】节点邻近的图池化的层次表示学习 Graph Pooling with Node Proximity for Hierarchical Representation Learning

专知会员服务

43+阅读 · 2020年7月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【新书】数字图像处理手册第二版，Handbook of Mathematical Methods in Imaging, 2nd edition

【新书】数字图像处理手册第二版，Handbook of Mathematical Methods in Imaging, 2nd edition

专知会员服务

46+阅读 · 2020年2月11日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

面向具身智能的多模态数据存储与检索：综述

《算法战争研究计划全景评估》35页

【CMU博士论文】水下三维视觉感知与生成

智能体战争：自主人工智能军备竞赛全景透视

相关资讯

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

机器学习线性代数速查

机器学习线性代数速查

机器学习研究会

19+阅读 · 2018年2月25日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Random and quasi-random designs in group testing

Random and quasi-random designs in group testing

Arxiv

0+阅读 · 2021年1月15日

Likelihood Ratio Exponential Families

Arxiv

0+阅读 · 2021年1月15日

Adaptive Estimation of Multivariate Piecewise Polynomials and Bounded Variation Functions by Optimal Decision Trees

Arxiv

0+阅读 · 2021年1月14日

New bounds for $k$-means and information $k$-means

Arxiv

0+阅读 · 2021年1月14日

Bayesian inference with tmbstan for a state-space model with VAR(1) state equation

Bayesian inference with tmbstan for a state-space model with VAR(1) state equation

Arxiv

0+阅读 · 2021年1月14日

Time-critical testing and search problems

Arxiv

0+阅读 · 2021年1月14日

Linear-time Learning on Distributions with Approximate Kernel Embeddings

Arxiv

0+阅读 · 2021年1月14日

Breaking the curse: a dimension free computational upper-bound for smooth optimal transport estimation

Arxiv

0+阅读 · 2021年1月13日

End to end learning and optimization on graphs

Arxiv

7+阅读 · 2019年5月31日

The Effects of Super-Resolution on Object Detection Performance in Satellite Imagery

The Effects of Super-Resolution on Object Detection Performance in Satellite Imagery

Arxiv

3+阅读 · 2018年12月10日

微信扫码咨询专知VIP会员