随机 K- FACs: 使用随机数字线性线性代数加速 K- FACs (Randomized K-FACs: Speeding up K-FAC with Randomized Numerical Linear Algebra) - 专知论文

会员服务 ·

0

分解的 · 线性的 · 易处理的 · 特征值分解 · 求逆 ·

2022 年 9 月 28 日

Randomized K-FACs: Speeding up K-FAC with Randomized Numerical Linear Algebra

翻译：随机 K- FACs: 使用随机数字线性线性代数加速 K- FACs

Constantin Octavian Puiu

from arxiv, Version 2: corrected all typos

K-FAC is a successful tractable implementation of Natural Gradient for Deep Learning, which nevertheless suffers from the requirement to compute the inverse of the Kronecker factors (through an eigen-decomposition). This can be very time-consuming (or even prohibitive) when these factors are large. In this paper, we theoretically show that, owing to the exponential-average construction paradigm of the Kronecker factors that is typically used, their eigen-spectrum must decay. We show numerically that in practice this decay is very rapid, leading to the idea that we could save substantial computation by only focusing on the first few eigen-modes when inverting the Kronecker-factors. Randomized Numerical Linear Algebra provides us with the necessary tools to do so. Numerical results show we obtain $\approx2.5\times$ reduction in per-epoch time and $\approx3.3\times$ reduction in time to target accuracy. We compare our proposed K-FAC sped-up versions with a more computationally efficient NG implementation, SENG, and observe we perform on par with it.

翻译：K- FAC 是一个成功的“ 深层学习自然梯度” 的成功执行, 但它仍然受制于对克伦克尔因素进行反向计算的要求( 通过 eigen 分解) 。当这些因素巨大时, 这可能非常耗时( 甚至令人望而却步 ) 。在本文中, 我们理论上显示, 由于通常使用的克伦克尔因素的指数平均构建模式, 他们的脑分光必须衰减。我们从数字上显示, 实际上这种衰减非常迅速, 导致一种想法, 即当Kronecker- factors 反转时, 我们只关注最初的几种机率模型, 就可以节省大量计算。随机化的Numical Algebra 为我们提供了这样做的必要工具。数值结果显示, 由于通常使用的Kronecker因素的指数平均构建模式, 他们的脑分光谱必须衰减1美元, 和 $\ approx3.3 y times 在时间上降低目标精确度。我们比较了我们提议的K- FAC 加速计算方法的精度, 与更具有计算效率的GNG 。

0

相关内容

分解的

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

专知会员服务

72+阅读 · 2022年7月11日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

大扰动下不可压缩Navier-Stokes方程的稳定性态

国家自然科学基金

0+阅读 · 2015年12月31日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

可压缩Navier-Stokes方程和Boltzmann方程解的渐近行为

国家自然科学基金

0+阅读 · 2013年12月31日

渐近锥流形上色散方程的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

小胶质细胞P2X7信号通路介导室旁核OT和AVP能神经元敏感化参与急性心肌梗死的调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于超细纤维网络支架的导电高分子复合材料的构筑及其导电机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

边界理论、外逼近，与分形上的微分方程

国家自然科学基金

0+阅读 · 2012年12月31日

可压缩Navier-Stokes方程全局光滑解的适定性问题

国家自然科学基金

0+阅读 · 2012年12月31日

遍历哈密顿系统的谱理论

国家自然科学基金

0+阅读 · 2009年12月31日

Reweighting the RCT for generalization: finite sample error and variable selection

Arxiv

0+阅读 · 2022年11月4日

Spatially Selective Deep Non-linear Filters for Speaker Extraction

Arxiv

0+阅读 · 2022年11月4日

Distributed Downlink Precoding and Equalization in Satellite Swarms

Arxiv

0+阅读 · 2022年11月4日

Reconfigurable Intelligent Surface-Aided Spectrum Sharing Coexisting with Multiple Primary Networks

Arxiv

0+阅读 · 2022年11月4日

Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee

Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee

Arxiv

0+阅读 · 2022年11月3日

The Evidence Lower Bound of Variational Autoencoders Converges to a Sum of Three Entropies

Arxiv

0+阅读 · 2022年11月3日

Numerical schemes for a multi-species BGK model with velocity-dependent collision frequency

Arxiv

0+阅读 · 2022年11月3日

Reliable emulation of complex functionals by active learning with error control

Arxiv

0+阅读 · 2022年11月3日

A causal approach to functional mediation analysis with application to a smoking cessation intervention

Arxiv

0+阅读 · 2022年11月2日

A linear second-order maximum bound principle-preserving BDF scheme for the Allen-Cahn equation with general mobility

Arxiv

0+阅读 · 2022年11月2日

VIP会员

文章信息

相关主题

特征值分解

相关VIP内容

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

专知会员服务

72+阅读 · 2022年7月11日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

Reweighting the RCT for generalization: finite sample error and variable selection

Arxiv

0+阅读 · 2022年11月4日

Spatially Selective Deep Non-linear Filters for Speaker Extraction

Arxiv

0+阅读 · 2022年11月4日

Distributed Downlink Precoding and Equalization in Satellite Swarms

Arxiv

0+阅读 · 2022年11月4日

Reconfigurable Intelligent Surface-Aided Spectrum Sharing Coexisting with Multiple Primary Networks

Arxiv

0+阅读 · 2022年11月4日

Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee

Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee

Arxiv

0+阅读 · 2022年11月3日

The Evidence Lower Bound of Variational Autoencoders Converges to a Sum of Three Entropies

Arxiv

0+阅读 · 2022年11月3日

Numerical schemes for a multi-species BGK model with velocity-dependent collision frequency

Arxiv

0+阅读 · 2022年11月3日

Reliable emulation of complex functionals by active learning with error control

Arxiv

0+阅读 · 2022年11月3日

A causal approach to functional mediation analysis with application to a smoking cessation intervention

Arxiv

0+阅读 · 2022年11月2日

A linear second-order maximum bound principle-preserving BDF scheme for the Allen-Cahn equation with general mobility

Arxiv

0+阅读 · 2022年11月2日

相关基金

大扰动下不可压缩Navier-Stokes方程的稳定性态

国家自然科学基金

0+阅读 · 2015年12月31日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

可压缩Navier-Stokes方程和Boltzmann方程解的渐近行为

国家自然科学基金

0+阅读 · 2013年12月31日

渐近锥流形上色散方程的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

小胶质细胞P2X7信号通路介导室旁核OT和AVP能神经元敏感化参与急性心肌梗死的调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于超细纤维网络支架的导电高分子复合材料的构筑及其导电机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

边界理论、外逼近，与分形上的微分方程

国家自然科学基金

0+阅读 · 2012年12月31日

可压缩Navier-Stokes方程全局光滑解的适定性问题

国家自然科学基金

0+阅读 · 2012年12月31日

遍历哈密顿系统的谱理论

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员