A computational fluid dynamics (CFD) simulation framework for predicting complex flows is developed on the Tensor Processing Unit (TPU) platform. The TPU architecture is featured with accelerated performance of dense matrix multiplication, large high bandwidth memory, and a fast inter-chip interconnect, which makes it attractive for high-performance scientific computing. The CFD framework solves the variable-density Navier-Stokes equation using a Low-Mach approximation, and the governing equations are discretized by a finite difference method on a collocated structured mesh. It uses the graph-based TensorFlow as the programming paradigm. The accuracy and performance of this framework is studied both numerically and analytically, specifically focusing on effects of TPU-native single precision floating point arithmetic on solution accuracy. The algorithm and implementation are validated with canonical 2D and 3D Taylor Green vortex simulations. To demonstrate the capability for simulating turbulent flows, simulations are conducted for two configurations, namely the decaying homogeneous isotropic turbulence and a turbulent planar jet. Both simulations show good statistical agreement with reference solutions. The performance analysis shows a linear weak scaling and a super-linear strong scaling up to a full TPU v3 pod with 2048 cores.
翻译:在Tensor处理股(TPU)平台上开发了预测复杂流量的计算流动态模拟框架(CFD)模拟框架(CFD),用于预测复杂流量的模拟框架。TPU架构的特点是密集矩阵乘法加速、高带内存和快速断裂互连功能加速运行,从而吸引高性能科学计算。CFD框架使用低兆赫近比解决可变密度纳维-Stokes方程式问题,而治理方程式则通过对合用结构网格的网状网状网状网状网状的有限差异方法分解。它使用基于图形的TensorFlow作为编程范式。这个框架的准确性和性能通过数字和分析两种方式得到研究,具体侧重于TPU的单一精确浮动点对解决方案准确性的影响。CFD框架的算法和执行通过Canononical 2D 和 3D Taylor Greekt Putex模拟来验证。为了展示刺激动荡流的能力,对两种配置进行了模拟,即腐蚀性等同质的同质流流和扰动平平式平式平式平式平式平板式平流。两个模拟都显示了一个带有较弱的模拟,显示一个较弱的高级的模拟显示的模拟显示的模拟式的模拟式的模拟显示。