压缩的非同步渐变后代的轻量射导衍生代码 (Lightweight Projective Derivative Codes for Compressed Asynchronous Gradient Descent) - 专知论文

会员服务 ·

0

Projection · 代码 · Learning · INFORMS · Performer ·

2022 年 6 月 21 日

Lightweight Projective Derivative Codes for Compressed Asynchronous Gradient Descent

翻译：压缩的非同步渐变后代的轻量射导衍生代码

Pedro Soto,Ilia Ilmer,Haibin Guan,Jun Li

from arxiv, 15 pages, 3 figures, preprint, To appear in Proceedings of ICML 2022

Coded distributed computation has become common practice for performing gradient descent on large datasets to mitigate stragglers and other faults. This paper proposes a novel algorithm that encodes the partial derivatives themselves and furthermore optimizes the codes by performing lossy compression on the derivative codewords by maximizing the information contained in the codewords while minimizing the information between the codewords. The utility of this application of coding theory is a geometrical consequence of the observed fact in optimization research that noise is tolerable, sometimes even helpful, in gradient descent based learning algorithms since it helps avoid overfitting and local minima. This stands in contrast with much current conventional work on distributed coded computation which focuses on recovering all of the data from the workers. A second further contribution is that the low-weight nature of the coding scheme allows for asynchronous gradient updates since the code can be iteratively decoded; i.e., a worker's task can immediately be updated into the larger gradient. The directional derivative is always a linear function of the direction vectors; thus, our framework is robust since it can apply linear coding techniques to general machine learning frameworks such as deep neural networks.

翻译：代码分配计算已成为在大型数据集上进行梯度下降以减缓分流器和其他缺陷的常见做法。本文提出一种新的算法,将部分衍生物本身编码,并进一步优化代码,对衍生物编码进行损失压缩,通过最大限度地增加编码词中所含信息,同时最大限度地减少编码词中的信息。这种编码理论应用的效用是最佳化研究中观察到的以下事实的几何结果:噪音是可容忍的,有时甚至有用,在基于梯度的学习算法中进行,因为它有助于避免过度装配和本地迷你。这与目前关于分布式编码计算的许多常规工作形成对照,后者的重点是从工人那里收回所有数据。另一项贡献是,由于编码可以迭代解,编码理论的低重量性梯度更新是允许的;即工人的任务可以立即更新到更大的梯度。方向衍生物始终是方向矢量的线性功能; 因此,我们的框架是健全的,因为它可以将线性编码技术应用到一般机器的深部学习框架。

0

相关内容

Projection

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

SOX9介导TGF-β/Smads/wnt和β-catenin信号通路调控青少年椎体骺板软骨的分化

国家自然科学基金

0+阅读 · 2014年12月31日

非均质量子器件Schr？dinger-Poisson系统多尺度分析与算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

几类扩散过程的逼近及应用

国家自然科学基金

1+阅读 · 2014年12月31日

光栅剪切干涉Zernike模式法重建精度优化方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

栅瓣频率相位扫描高增益可重构锥状波束天线阵研究

国家自然科学基金

0+阅读 · 2013年12月31日

β-Sarcoglycan在mSOD1介导ALS骨骼肌病变中的机制研究

国家自然科学基金

1+阅读 · 2012年12月31日

MEMRI追踪骨髓源神经干细胞移植后与宿主脑组织功能重建实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

重调和方程基于Poisson算子的高效有限元方法

国家自然科学基金

0+阅读 · 2011年12月31日

马铃薯块茎低温贮藏过程中酸性转化酶活性调控机制及其与低温糖化的关系

国家自然科学基金

0+阅读 · 2008年12月31日

基于高增益光纤布里渊放大的宽带信号可控制慢光的研究

国家自然科学基金

0+阅读 · 2008年12月31日

On the Activation Function Dependence of the Spectral Bias of Neural Networks

Arxiv

0+阅读 · 2022年8月9日

Consistent Approximations in Composite Optimization

Arxiv

0+阅读 · 2022年8月8日

FedVQCS: Federated Learning via Vector Quantized Compressed Sensing

Arxiv

0+阅读 · 2022年8月8日

Neural Set Function Extensions: Learning with Discrete Functions in High Dimensions

Arxiv

0+阅读 · 2022年8月8日

Decomposable Non-Smooth Convex Optimization with Nearly-Linear Gradient Oracle Complexity

Arxiv

0+阅读 · 2022年8月7日

A Parallel Technique for Multi-objective Bayesian Global Optimization: Using a Batch Selection of Probability of Improvement

Arxiv

0+阅读 · 2022年8月7日

Tractable and Near-Optimal Adversarial Algorithms for Robust Estimation in Contaminated Gaussian Models

Arxiv

0+阅读 · 2022年8月6日

Deconfined Global Types for Asynchronous Sessions

Arxiv

0+阅读 · 2022年8月5日

Almost-Orthogonal Layers for Efficient General-Purpose Lipschitz Networks

Arxiv

0+阅读 · 2022年8月5日

A Forward-secure Efficient Two-factor Authentication Protocol

Arxiv

0+阅读 · 2022年8月4日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《美陆军特种作战条令》最新102页

《洛克希德SR-71“黑鸟”侦察机动力系统》21页slides

美空军作战实验室通过人工智能和指挥控制技术创新推进杀伤链

《指挥控制能力分析方法论》最新报告

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

On the Activation Function Dependence of the Spectral Bias of Neural Networks

Arxiv

0+阅读 · 2022年8月9日

Consistent Approximations in Composite Optimization

Arxiv

0+阅读 · 2022年8月8日

FedVQCS: Federated Learning via Vector Quantized Compressed Sensing

Arxiv

0+阅读 · 2022年8月8日

Neural Set Function Extensions: Learning with Discrete Functions in High Dimensions

Arxiv

0+阅读 · 2022年8月8日

Decomposable Non-Smooth Convex Optimization with Nearly-Linear Gradient Oracle Complexity

Arxiv

0+阅读 · 2022年8月7日

A Parallel Technique for Multi-objective Bayesian Global Optimization: Using a Batch Selection of Probability of Improvement

Arxiv

0+阅读 · 2022年8月7日

Tractable and Near-Optimal Adversarial Algorithms for Robust Estimation in Contaminated Gaussian Models

Arxiv

0+阅读 · 2022年8月6日

Deconfined Global Types for Asynchronous Sessions

Arxiv

0+阅读 · 2022年8月5日

Almost-Orthogonal Layers for Efficient General-Purpose Lipschitz Networks

Arxiv

0+阅读 · 2022年8月5日

A Forward-secure Efficient Two-factor Authentication Protocol

Arxiv

0+阅读 · 2022年8月4日

相关基金

SOX9介导TGF-β/Smads/wnt和β-catenin信号通路调控青少年椎体骺板软骨的分化

国家自然科学基金

0+阅读 · 2014年12月31日

非均质量子器件Schr？dinger-Poisson系统多尺度分析与算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

几类扩散过程的逼近及应用

国家自然科学基金

1+阅读 · 2014年12月31日

光栅剪切干涉Zernike模式法重建精度优化方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

栅瓣频率相位扫描高增益可重构锥状波束天线阵研究

国家自然科学基金

0+阅读 · 2013年12月31日

β-Sarcoglycan在mSOD1介导ALS骨骼肌病变中的机制研究

国家自然科学基金

1+阅读 · 2012年12月31日

MEMRI追踪骨髓源神经干细胞移植后与宿主脑组织功能重建实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

重调和方程基于Poisson算子的高效有限元方法

国家自然科学基金

0+阅读 · 2011年12月31日

马铃薯块茎低温贮藏过程中酸性转化酶活性调控机制及其与低温糖化的关系

国家自然科学基金

0+阅读 · 2008年12月31日

基于高增益光纤布里渊放大的宽带信号可控制慢光的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员