Physics-based simulation has been actively employed in generating offline visual effects in the film and animation industry. However, the computations required for high-quality scenarios are generally immense, deterring its adoption in real-time applications, e.g., virtual production, avatar live-streaming, and cloud gaming. We summarize the principles that can accelerate the computation pipeline on single-GPU and multi-GPU platforms through extensive investigation and comprehension of modern GPU architecture. We further demonstrate the effectiveness of these principles by applying them to the material point method to build up our framework, which achieves $1.7\times$--$8.6\times$ speedup on a single GPU and $2.5\times$--$14.8\times$ on four GPUs compared to the state-of-the-art. Our pipeline is specifically designed for real-time applications (i.e., scenarios with small to medium particles) and achieves significant multi-GPU efficiency. We demonstrate our pipeline by simulating a snow scenario with 1.33M particles and a fountain scenario with 143K particles in real-time (on average, 68.5 and 55.9 frame-per-second, respectively) on four NVIDIA Tesla V100 GPUs interconnected with NVLinks.
翻译:在电影和动画业中,在产生离线视觉效应方面积极采用了物理模拟,然而,高质量设想方案所需的计算方法一般是巨大的,妨碍了在实时应用中采用,例如虚拟生产、阿凡达现场流和云游戏。我们总结了通过广泛调查和理解现代GPU结构,可以加快单GPU和多GPU平台的计算管道的原则。我们进一步展示了这些原则的有效性,将这些原则应用于建设我们框架的物质点方法,即实现1.7美元-8.6美元-8.6美元的速度,在单一GPU和2.5美元-14.8美元的时间上,在4个GPU上采用,而与最新工艺相比,我们为4个GPU(即中小粒子假设)和多GPU平台专门设计了实时应用的计算管道。我们用1.33M粒子模拟了雪情景,用143K粒子实时(平均为68.5美元)和5美元-15.9美元喷泉情景,在VIFA4-NV-100秒上,我们展示了管道。