平根和反平根 (Fast Differentiable Matrix Square Root and Inverse Square Root) - 专知论文

会员服务 ·

0

方阵 · 奇异值分解 · 后向 · FAST · 前向 ·

2022 年 10 月 19 日

Fast Differentiable Matrix Square Root and Inverse Square Root

翻译：平根和反平根

Yue Song,Nicu Sebe,Wei Wang

from arxiv, T-PAMI 2022. arXiv admin note: substantial text overlap with arXiv:2201.08663

Computing the matrix square root and its inverse in a differentiable manner is important in a variety of computer vision tasks. Previous methods either adopt the Singular Value Decomposition (SVD) to explicitly factorize the matrix or use the Newton-Schulz iteration (NS iteration) to derive the approximate solution. However, both methods are not computationally efficient enough in either the forward pass or the backward pass. In this paper, we propose two more efficient variants to compute the differentiable matrix square root and the inverse square root. For the forward propagation, one method is to use Matrix Taylor Polynomial (MTP), and the other method is to use Matrix Pad\'e Approximants (MPA). The backward gradient is computed by iteratively solving the continuous-time Lyapunov equation using the matrix sign function. A series of numerical tests show that both methods yield considerable speed-up compared with the SVD or the NS iteration. Moreover, we validate the effectiveness of our methods in several real-world applications, including de-correlated batch normalization, second-order vision transformer, global covariance pooling for large-scale and fine-grained recognition, attentive covariance pooling for video recognition, and neural style transfer. The experimental results demonstrate that our methods can also achieve competitive and even slightly better performances. The Pytorch implementation is available at https://github.com/KingJamesSong/FastDifferentiableMatSqrt

翻译：以不同的方式计算矩阵平方根及其反向在各种计算机视觉任务中很重要。以往的方法要么采用星值分解( SVD) 来明确对矩阵进行分解, 要么使用牛顿- 舒尔茨循环( NS 迭代) 来得出近似解决方案。但是, 这两种方法的计算效率都不够高, 无论是在远端路口还是后向路口。在本文中, 我们建议两种更高效的变量来计算不同的矩阵平方根和反方根。对于前方传播, 一种方法是使用Taylor 质调调调调( MTP), 而另一种方法是使用 Mexm Pad\' e Approximants( MMPA) 。后向梯度的计算方法是通过迭接式解决连续时间的 Lyapunov 方程式, 使用矩阵符号函数。一系列数字测试显示, 这两种方法与 SVD 或NS Iteration相比, 都具有相当大的超速效果。此外, 我们验证了我们的方法在一些真实世界应用程序应用程序应用中的有效性, 包括与分级的分级对等平级平级平级平级平级平级平级平级平级平级平级平流/ 的平级平级图像的图像视觉视觉视觉视觉视觉视野, 视野的视觉可转换,,, 和同步同步同步同步的图像可展示的图像可展示的图像可演化, 实验性平流性平流性平流性平流性平流性平流法度可演化, 。

0

相关内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

铀的电荷密度波转变及维度调控研究

国家自然科学基金

0+阅读 · 2015年12月31日

《数学学报》期刊

国家自然科学基金

5+阅读 · 2015年12月31日

多参数传热反问题的RBF-MLPG方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

代数免疫函数的性质与构造

国家自然科学基金

0+阅读 · 2013年12月31日

原子对分布函数方法对ZrNiSn基half-Heusler热电材料结构缺陷的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

Skutterudite/AgSbTe2系纳米复合热电材料研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于HHT的超光谱图像高精度分类算法研究

国家自然科学基金

0+阅读 · 2009年12月31日

GmMADS1在大豆花发育中的调控机理研究

国家自然科学基金

0+阅读 · 2008年12月31日

树、格及Hurwitz排列中的计数问题

国家自然科学基金

0+阅读 · 2008年12月31日

TTRISK: Tensor Train Decomposition Algorithm for Risk Averse Optimization

Arxiv

0+阅读 · 2022年12月1日

Sub-quadratic Algorithms for Kernel Matrices via Kernel Density Estimation

Arxiv

0+阅读 · 2022年12月1日

CENN: Conservative energy method based on neural networks with subdomains for solving variational problems involving heterogeneous and complex geometries

Arxiv

0+阅读 · 2022年12月1日

Sublinear Algorithms for $(1.5+ε)$-Approximate Matching

Arxiv

0+阅读 · 2022年12月1日

Plateau-free Differentiable Path Tracing

Arxiv

0+阅读 · 2022年11月30日

Open-Vocabulary DETR with Conditional Matching

Arxiv

0+阅读 · 2022年11月30日

Subsampling for tensor least squares: Optimization and statistical perspectives

Arxiv

0+阅读 · 2022年11月30日

Differentiable User Models

Arxiv

0+阅读 · 2022年11月29日

Sublinear Time Algorithms and Complexity of Approximate Maximum Matching

Arxiv

0+阅读 · 2022年11月29日

Efficient Update of Redundancy Matrices for Truss and Frame Structures

Arxiv

0+阅读 · 2022年11月28日

VIP会员

文章信息

相关主题

奇异值分解

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

TTRISK: Tensor Train Decomposition Algorithm for Risk Averse Optimization

Arxiv

0+阅读 · 2022年12月1日

Sub-quadratic Algorithms for Kernel Matrices via Kernel Density Estimation

Arxiv

0+阅读 · 2022年12月1日

CENN: Conservative energy method based on neural networks with subdomains for solving variational problems involving heterogeneous and complex geometries

Arxiv

0+阅读 · 2022年12月1日

Sublinear Algorithms for $(1.5+ε)$-Approximate Matching

Arxiv

0+阅读 · 2022年12月1日

Plateau-free Differentiable Path Tracing

Arxiv

0+阅读 · 2022年11月30日

Open-Vocabulary DETR with Conditional Matching

Arxiv

0+阅读 · 2022年11月30日

Subsampling for tensor least squares: Optimization and statistical perspectives

Arxiv

0+阅读 · 2022年11月30日

Differentiable User Models

Arxiv

0+阅读 · 2022年11月29日

Sublinear Time Algorithms and Complexity of Approximate Maximum Matching

Arxiv

0+阅读 · 2022年11月29日

Efficient Update of Redundancy Matrices for Truss and Frame Structures

Arxiv

0+阅读 · 2022年11月28日

相关基金

铀的电荷密度波转变及维度调控研究

国家自然科学基金

0+阅读 · 2015年12月31日

《数学学报》期刊

国家自然科学基金

5+阅读 · 2015年12月31日

多参数传热反问题的RBF-MLPG方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

代数免疫函数的性质与构造

国家自然科学基金

0+阅读 · 2013年12月31日

原子对分布函数方法对ZrNiSn基half-Heusler热电材料结构缺陷的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

Skutterudite/AgSbTe2系纳米复合热电材料研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于HHT的超光谱图像高精度分类算法研究

国家自然科学基金

0+阅读 · 2009年12月31日

GmMADS1在大豆花发育中的调控机理研究

国家自然科学基金

0+阅读 · 2008年12月31日

树、格及Hurwitz排列中的计数问题

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员