This paper presents the first parallel implementation of the novel "Interpolated Factored Green Function" (IFGF) method introduced recently for the accelerated evaluation of discrete integral operators arising in wave scattering and other areas (Bauinger and Bruno, Jour. Computat. Phys., 2021). On the basis of the hierarchical IFGF interpolation strategy, the proposed (hybrid MPI-OpenMP) parallel implementation results in highly efficient data communication, and it exhibits in practice excellent parallel scaling up to large numbers of cores - without any hard limitations on the number of cores concurrently employed with high efficiency. Moreover, on any given number of cores, the proposed parallel approach preserves the O(N log N) computing cost inherent in the sequential version of the IFGF algorithm. Unlike other approaches, the IFGF method does not utilize the Fast Fourier Transform (FFT), and it is thus better suited than other methods for efficient parallelization in distributed-memory computer systems. In particular, the IFGF method relies on a "peer-to-peer" strategy wherein, at every level, field propagation is directly enacted via "exchanges" between "peer" polynomials of low and constant degree, without data accumulation in large-scale "telephone-central" mathematical constructs such as those in the Fast Multipole Method (FMM) or pure FFT-based approaches. A variety of numerical results presented in this paper illustrate the character of the proposed parallel algorithm, including excellent weak and strong parallel scaling properties in all cases considered - for problems of up to 4,096 wavelengths in acoustic size, and scaling tests spanning from 1 compute core to all 1,680 cores available in the High Performance Computing cluster used.
翻译:本文介绍了最近为加速评价波散及其他地区产生的离散整体操作员而采用的新颖的“加固的绿色功能”(IFGF)方法(Bauinger和Bruno, Jour.Computat.Phys., 2021年)的首次平行实施。根据IFGF的等级分解战略,拟议的(混合的MPI-OpenMP)平行实施方法在高效的数据通信方面产生了高度高效的平行效果,在实际中显示了向大量核心的平行扩展――对同时高效使用的核心数量没有任何严格的限制。此外,关于任何特定的核心数量,拟议的平行方法保留了IFGF的O(N log N) 顺序版算法所固有的成本。与其他方法不同,IFGFFF方法没有使用快速的四重变换(FFT),因此它比其他方法更适合在分布式计算机系统中高效的平行化。 特别是,IFFFFF方法基于“peer-pe-peer-peer”战略,在每一级别上都直接通过“货币级的快速的货币递增的货币递增的货币递增”的货币递化数据测试,在“不断的货币中,包括不断变压的货币的货币的货币的货币的货币压的不断的计算中,在1级变压压中直接的计算中采用。