Parallel optimizations for the 2D Hierarchical Poincar\'e-Steklov (HPS) discretization scheme are described. HPS is a multi-domain spectral collocation scheme that allows for combining very high order discretizations with direct solvers, making the discretization powerful in resolving highly oscillatory solutions to high accuracy. HPS can be viewed as a domain decomposition scheme where the domains are connected directly through the use of a sparse direct solver. This manuscript describes optimizations of HPS that are simple to implement, and that leverage batched linear algebra on modern hybrid architectures to improve the practical speed of the solver. In particular, the manuscript demonstrates that the traditionally high cost of performing local static condensation for discretizations involving very high local order $p$ can be reduced dramatically.
翻译:描述2D 等级波因卡\ e- Steklov (HPS) 离散方案的平行优化。 HPS 是一个多域光谱合用方案,它能够将非常高顺序离散与直接解答器相结合,使得离散能以高精度解决高度分解解决方案。 HPS 可以被视为一个域分解方案,通过使用稀疏的直接解答器将域直接连接起来。本稿描述HPS 的优化,易于实施,并且利用现代混合结构的分批线性代数提高解析器的实际速度。 特别是, 手稿表明,对于涉及非常高本地顺序的离散化进行本地静态凝聚的成本历来很高,可以大幅降低。