Graphics Processing Units (GPUs) have revolutionized the computing landscape over the past decade. However, the growing energy demands of data centres and computing facilities equipped with GPUs come with significant capital and environmental costs. The energy consumption of GPU applications greatly depend on how well they are optimized. Auto-tuning is an effective and commonly applied technique of finding the optimal combination of algorithm, application, and hardware parameters to optimize performance of a GPU application. In this paper, we introduce new energy monitoring and optimization capabilities in Kernel Tuner, a generic auto-tuning tool for GPU applications. These capabilities enable us to investigate the difference between tuning for execution time and various approaches to improve energy efficiency, and investigate the differences in tuning difficulty. Additionally, our model for GPU power consumption greatly reduces the large tuning search space by providing clock frequencies for which a GPU is likely most energy efficient.
 翻译:过去十年来,图形处理单位(GPU)使计算格局发生了革命性的变化,然而,数据中心和计算机设备配备GPU的能源需求不断增加,带来了巨大的资本和环境成本。GPU应用的能源消耗在很大程度上取决于其优化程度。自动调整是一种有效且常用的技术,可以找到算法、应用和硬件参数的最佳组合,优化GPU应用的性能。在本文中,我们引入了Kernel Tuner的新的能源监测和优化能力,这是一个通用的GPU应用自动调控工具。这些能力使我们能够调查执行时间调控与提高能效的各种方法之间的差异,并调查调控难度的差异。此外,我们的GPU电力消费模式通过提供GPU可能最高效的时钟频率,大大减少了大型的搜索空间。