This work proposes a new GPU thread map for standard $m$-simplex domains, that scales its speedup with dimension and is energy efficient compared to other state of the art approaches. The main contributions of this work are the formulation of the new block-space map $\mathcal{H}: \mathbb{Z}^m \mapsto \mathbb{Z}^m$ which is analyzed in terms of resource usage, and its experimental evaluation in terms of speedup over a bounding box approach and energy efficiency as elements per second per Watt. Results from the analysis show that $\mathcal{H}$ has a potential speedup of up to $2\times$ and $6\times$ for $2$ and $3$-simplices, respectively. Experimental evaluation shows that $\mathcal{H}$ is competitive for $2$-simplices, reaching $1.2\times \sim 2.0\times$ of speedup for different tests, which is on par with the fastest state of the art approaches. For $3$-simplices $\mathcal{H}$ reaches up to $1.3\times \sim 6.0\times$ of speedup making it the fastest of all. The extension of $\mathcal{H}$ to higher dimensional $m$-simplices is feasible and has a potential speedup that scales as $m!$ given a proper selection of parameters $r, \beta$ which are the scaling and replication factors, respectively. In terms of energy consumption, although $\mathcal{H}$ is among the highest in power consumption, it compensates by its short duration, making it one of the most energy efficient approaches. Lastly, further improvements with Tensor and Ray Tracing Cores are analyzed, giving insights to leverage each one of them. The results obtained in this work show that $\mathcal{H}$ is a scalable and energy efficient map that can contribute to the efficiency of GPU applications when they need to process standard $m$-simplex domains, such as Cellular Automata or PDE simulations.
翻译:这项工作为标准 $m 标准 $ spreax 域提出了一个新的 GPU 线索映射图, 该图将加速度与维度比, 并且与艺术的其他状态方法相比, 节能效率也更高。 这项工作的主要贡献是制定新的块空间映射$\ mathcal{H} :\ mathbb ⁇ m\ mmapsto\ mamathb ⁇ m 美元, 该图用资源使用情况分析, 其实验性评价是: 加速一个捆绑框方式和节能效率, 以每秒每秒的元素来测量它的节能。 分析结果表明, $macal{H} 可能加速到2美元 美元 和 6美元 美元 的速率。 实验性评估显示, $mcall\ h} 将能量的年期提升到 美元 。 当电量的年度比值达到1美元时, 。 3 美元比值 美元比值增加 美元 速度 。