具有刻度三角因素的AMG的 ILU 滑滑器 (ILU Smoothers for AMG with Scaled Triangular Factors) - 专知论文

会员服务 ·

0

可约的 · 分解的 · 缩放 · 有向 · Performer ·

2022 年 8 月 3 日

ILU Smoothers for AMG with Scaled Triangular Factors

翻译：具有刻度三角因素的AMG的 ILU 滑滑器

Stephen Thomas,Arielle Carr,Paul Mullowney,Kasia Świrydowicz,Marc Day

from arxiv, v2 updated citation information; v3 updated results; v4 abstract updated, new results added; v5 new experimental analysis and results added

ILU smoothers are effective in the algebraic multigrid (AMG) V-cycle for reducing high-frequency components of the residual error. However, direct triangular solves are comparatively slow on GPUs. Previous work by Chow and Patel (2015) and Antz et al. (2015) demonstrated the advantages of Jacobi relaxation as an alternative. Depending on the threshold and fill-level parameters chosen, the factors are highly non-normal and Jacobi is unlikely to converge in a low number of iterations. The Ruiz algorithm applies row or row/column scaling to U in order to reduce the departure from normality. The inherently sequential solve is replaced with a Richardson iteration. There are several advantages beyond the lower compute time. Scaling is performed locally for a diagonal block of the global matrix because it is applied directly to the factor. An ILUT Schur complement smoother maintains a constant GMRES iteration count as the number of MPI ranks increases and thus parallel strong-scaling is improved. The new algorithms are included in hypre, and achieve improved time to solution for several Exascale applications, including the Nalu-Wind and PeleLM pressure solvers. For large problem sizes, GMRES+AMG with iterative triangular solves execute at least five times faster than with direct on massively-parallel GPUs.

翻译：ILU 滑动在代数多格(AMG) V 周期中有效,可以减少剩余错误的高频部件。但是, 直接三角解决方案在 GPU 上相对缓慢。 Chow 和 Patel (2015) 和 Antz 等人(2015) 以往的工作展示了Jacobi 放松作为一种替代方法的优点。根据所选择的临界值和填充值参数, 这些因素极不正常, Jacobi 不太可能在低迭代数中聚合。 Ruiz 算法将行或行/ 栏缩放到 U, 以减少偏离常态。内在的连续解决方案被 Richardson 迭代为替换。在较低的计算时间以外, 还有一些优势。缩放是本地为全球矩阵的对角块, 因为它直接应用到因素。 ILUT Schur 补充器保持恒定的 GRES 升调, 计为MPI 级数, 从而平行的加缩缩。新的算法包含在 Hyprepe, 中, 改进了多个 Exscalal 解式应用程序的解决方案,, 包括直径 GMGLV- LDRW 和。

0

相关内容

可约的

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

专知会员服务

72+阅读 · 2022年7月11日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

一份简单《图神经网络》教程，28页ppt

一份简单《图神经网络》教程，28页ppt

专知会员服务

127+阅读 · 2020年8月2日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

全光开关用过渡金属Fe、Co、Ni量子点玻璃的制备及三阶非线性光学性质的研究

国家自然科学基金

0+阅读 · 2015年12月31日

掺杂的稀土氧化物非晶态纳米管可控制备及其热电性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

预投射代数及其相关问题

国家自然科学基金

0+阅读 · 2013年12月31日

p-MgNiO/立方相n-MgZnO异质结界面特性、物性调控及其深紫外发光器件研究

国家自然科学基金

0+阅读 · 2013年12月31日

Tip60在oxLDL诱导的血管平滑肌细胞自噬及增殖中的作用机制

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

块体热电材料的热变形诱导再结晶与性能优化

国家自然科学基金

0+阅读 · 2012年12月31日

Ghrelin对胰岛β细胞分泌胰岛素和增殖的影响及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

胚胎干细胞来源感光细胞在变性视网膜中的整合机制及Muller细胞的影响

国家自然科学基金

0+阅读 · 2012年12月31日

血管内皮生长因子A突变在先天性左室流出道梗阻畸形发生中的功能研究

国家自然科学基金

0+阅读 · 2011年12月31日

A Nonlocal Graph-PDE and Higher-Order Geometric Integration for Image Labeling

Arxiv

0+阅读 · 2022年10月4日

Analysis of the performance of U-Net neural networks for the segmentation of living cells

Arxiv

0+阅读 · 2022年10月4日

Harnessing spectral representations for subgraph alignment

Arxiv

0+阅读 · 2022年10月3日

InitialGAN: A Language GAN with Completely Random Initialization

Arxiv

0+阅读 · 2022年10月3日

Bayesian Inference using the Proximal Mapping: Uncertainty Quantification under Varying Dimensionality

Arxiv

0+阅读 · 2022年10月3日

Yurinskii's Coupling for Martingales

Arxiv

0+阅读 · 2022年10月1日

A $\star$-product solver with spectral accuracy for non-autonomous ordinary differential equations

Arxiv

0+阅读 · 2022年9月30日

Efficient hyperbolic-parabolic models on multi-dimensional unbounded domains using an extended DG approach

Arxiv

0+阅读 · 2022年9月30日

Scale-invariant Bayesian Neural Networks with Connectivity Tangent Kernel

Arxiv

0+阅读 · 2022年9月30日

Pure-Circuit: Strong Inapproximability for PPAD

Arxiv

0+阅读 · 2022年9月30日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

专知会员服务

72+阅读 · 2022年7月11日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

一份简单《图神经网络》教程，28页ppt

一份简单《图神经网络》教程，28页ppt

专知会员服务

127+阅读 · 2020年8月2日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

小规模训练指南：打造世界级大语言模型的关键方法

无人机编队飞行：复杂环境中作战的策略、挑战与应用

大模型APP，AI时代第一个爆款

从数据中心视角出发的高效大语言模型训练综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

相关论文

A Nonlocal Graph-PDE and Higher-Order Geometric Integration for Image Labeling

Arxiv

0+阅读 · 2022年10月4日

Analysis of the performance of U-Net neural networks for the segmentation of living cells

Arxiv

0+阅读 · 2022年10月4日

Harnessing spectral representations for subgraph alignment

Arxiv

0+阅读 · 2022年10月3日

InitialGAN: A Language GAN with Completely Random Initialization

Arxiv

0+阅读 · 2022年10月3日

Bayesian Inference using the Proximal Mapping: Uncertainty Quantification under Varying Dimensionality

Arxiv

0+阅读 · 2022年10月3日

Yurinskii's Coupling for Martingales

Arxiv

0+阅读 · 2022年10月1日

A $\star$-product solver with spectral accuracy for non-autonomous ordinary differential equations

Arxiv

0+阅读 · 2022年9月30日

Efficient hyperbolic-parabolic models on multi-dimensional unbounded domains using an extended DG approach

Arxiv

0+阅读 · 2022年9月30日

Scale-invariant Bayesian Neural Networks with Connectivity Tangent Kernel

Arxiv

0+阅读 · 2022年9月30日

Pure-Circuit: Strong Inapproximability for PPAD

Arxiv

0+阅读 · 2022年9月30日

相关基金

全光开关用过渡金属Fe、Co、Ni量子点玻璃的制备及三阶非线性光学性质的研究

国家自然科学基金

0+阅读 · 2015年12月31日

掺杂的稀土氧化物非晶态纳米管可控制备及其热电性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

预投射代数及其相关问题

国家自然科学基金

0+阅读 · 2013年12月31日

p-MgNiO/立方相n-MgZnO异质结界面特性、物性调控及其深紫外发光器件研究

国家自然科学基金

0+阅读 · 2013年12月31日

Tip60在oxLDL诱导的血管平滑肌细胞自噬及增殖中的作用机制

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

块体热电材料的热变形诱导再结晶与性能优化

国家自然科学基金

0+阅读 · 2012年12月31日

Ghrelin对胰岛β细胞分泌胰岛素和增殖的影响及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

胚胎干细胞来源感光细胞在变性视网膜中的整合机制及Muller细胞的影响

国家自然科学基金

0+阅读 · 2012年12月31日

血管内皮生长因子A突变在先天性左室流出道梗阻畸形发生中的功能研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员