低精度斯托克梯度梯度 Langevin 动力学 (Low-Precision Stochastic Gradient Langevin Dynamics) - 专知论文

会员服务 ·

0

Learning · 可约的 · Performer · 估计/估计量 · 泛化理论 ·

2022 年 6 月 20 日

Low-Precision Stochastic Gradient Langevin Dynamics

翻译：低精度斯托克梯度梯度 Langevin 动力学

Ruqi Zhang,Andrew Gordon Wilson,Christopher De Sa

from arxiv, Published at ICML 2022

While low-precision optimization has been widely used to accelerate deep learning, low-precision sampling remains largely unexplored. As a consequence, sampling is simply infeasible in many large-scale scenarios, despite providing remarkable benefits to generalization and uncertainty estimation for neural networks. In this paper, we provide the first study of low-precision Stochastic Gradient Langevin Dynamics (SGLD), showing that its costs can be significantly reduced without sacrificing performance, due to its intrinsic ability to handle system noise. We prove that the convergence of low-precision SGLD with full-precision gradient accumulators is less affected by the quantization error than its SGD counterpart in the strongly convex setting. To further enable low-precision gradient accumulators, we develop a new quantization function for SGLD that preserves the variance in each update step. We demonstrate that low-precision SGLD achieves comparable performance to full-precision SGLD with only 8 bits on a variety of deep learning tasks.

翻译：虽然低精确度优化已被广泛用于加速深层学习,但低精确度取样基本上仍未探索,因此,在许多大规模假设情况下,取样根本是行不通的,尽管取样为神经网络的概括性和不确定性估计提供了显著的惠益。在本文件中,我们提供了第一个低精确度慢精确度定级朗埃文动态(SGLD)的研究,表明由于低精确度定级与完全精确度梯度累积器的结合,其成本可以大幅降低而不会牺牲性能,因为其处理系统噪音的内在能力。我们证明,低精确度SGLD与全面精确度梯度累积器的结合,比其在强凝固度环境下的SGD对应器受到的偏差差差影响要小。为了进一步启用低精确度梯度累积器,我们为SGLD开发了一个新的分级函数,以保持每个更新步骤的差异。我们证明,低精确度SGLD由于能取得与完全精确度SGLD相似的性能,只有8位数的深度学习任务。

0

相关内容

Learning

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

LncRNA参与Arc调控海马神经元突触重塑在癫痫发生中的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

GIT1CC2结构域在保护脊髓缺血再灌注损伤（SCII）中的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Perp在类风湿性关节炎外周Th17细胞存活中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

Fe-Co弥散分布的复相稀土永磁材料热变形磁织构形成机制及双相耦合作用

国家自然科学基金

0+阅读 · 2012年12月31日

双溶剂高分子溶液的monte carlo研究

国家自然科学基金

0+阅读 · 2012年12月31日

高温质子交换膜燃料电池铂催化剂耐久性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

HDAC6介导的乙酰化表观遗传修饰在PCOS胰岛素抵抗中的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

靶向抑制HDAC3调控阿尔茨海默病记忆-神经环路重塑的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

蛋白茶多酚复合物的分子模拟和小角X光散射研究

国家自然科学基金

0+阅读 · 2012年12月31日

Adiponectin在肝脏缺血再灌注损伤中的抗肝细胞凋亡机制

国家自然科学基金

0+阅读 · 2009年12月31日

On the Activation Function Dependence of the Spectral Bias of Neural Networks

Arxiv

0+阅读 · 2022年8月9日

Formalization of a Stochastic Approximation Theorem

Arxiv

0+阅读 · 2022年8月8日

FedVQCS: Federated Learning via Vector Quantized Compressed Sensing

Arxiv

0+阅读 · 2022年8月8日

Robust Congestion Control for Demand-Based Optimization in Precoded Multi-Beam High Throughput Satellite Communications

Arxiv

0+阅读 · 2022年8月8日

Stochastic Scaling in Loss Functions for Physics-Informed Neural Networks

Arxiv

0+阅读 · 2022年8月7日

A Parallel Technique for Multi-objective Bayesian Global Optimization: Using a Batch Selection of Probability of Improvement

Arxiv

0+阅读 · 2022年8月7日

Data-driven Control of Agent-based Models: an Equation/Variable-free Machine Learning Approach

Arxiv

0+阅读 · 2022年8月5日

Evaluating Active Learning Heuristics for Sequential Diagnosis

Arxiv

0+阅读 · 2022年8月5日

Continuous Beam Alignment for Mobile MIMO

Arxiv

0+阅读 · 2022年8月5日

Balanced Multimodal Learning via On-the-fly Gradient Modulation

Arxiv

13+阅读 · 2022年3月29日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

On the Activation Function Dependence of the Spectral Bias of Neural Networks

Arxiv

0+阅读 · 2022年8月9日

Formalization of a Stochastic Approximation Theorem

Arxiv

0+阅读 · 2022年8月8日

FedVQCS: Federated Learning via Vector Quantized Compressed Sensing

Arxiv

0+阅读 · 2022年8月8日

Robust Congestion Control for Demand-Based Optimization in Precoded Multi-Beam High Throughput Satellite Communications

Arxiv

0+阅读 · 2022年8月8日

Stochastic Scaling in Loss Functions for Physics-Informed Neural Networks

Arxiv

0+阅读 · 2022年8月7日

A Parallel Technique for Multi-objective Bayesian Global Optimization: Using a Batch Selection of Probability of Improvement

Arxiv

0+阅读 · 2022年8月7日

Data-driven Control of Agent-based Models: an Equation/Variable-free Machine Learning Approach

Arxiv

0+阅读 · 2022年8月5日

Evaluating Active Learning Heuristics for Sequential Diagnosis

Arxiv

0+阅读 · 2022年8月5日

Continuous Beam Alignment for Mobile MIMO

Arxiv

0+阅读 · 2022年8月5日

Balanced Multimodal Learning via On-the-fly Gradient Modulation

Arxiv

13+阅读 · 2022年3月29日

相关基金

LncRNA参与Arc调控海马神经元突触重塑在癫痫发生中的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

GIT1CC2结构域在保护脊髓缺血再灌注损伤（SCII）中的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Perp在类风湿性关节炎外周Th17细胞存活中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

Fe-Co弥散分布的复相稀土永磁材料热变形磁织构形成机制及双相耦合作用

国家自然科学基金

0+阅读 · 2012年12月31日

双溶剂高分子溶液的monte carlo研究

国家自然科学基金

0+阅读 · 2012年12月31日

高温质子交换膜燃料电池铂催化剂耐久性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

HDAC6介导的乙酰化表观遗传修饰在PCOS胰岛素抵抗中的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

靶向抑制HDAC3调控阿尔茨海默病记忆-神经环路重塑的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

蛋白茶多酚复合物的分子模拟和小角X光散射研究

国家自然科学基金

0+阅读 · 2012年12月31日

Adiponectin在肝脏缺血再灌注损伤中的抗肝细胞凋亡机制

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员