宙斯:理解和优化DNN培训的GPU能源消耗 (Zeus: Understanding and Optimizing GPU Energy Consumption of DNN Training) - 专知论文

会员服务 ·

0

DNN · 优化器 · 可理解性 · Performer · GPU ·

2022 年 9 月 29 日

Zeus: Understanding and Optimizing GPU Energy Consumption of DNN Training

翻译：宙斯:理解和优化DNN培训的GPU能源消耗

Jie You,Jae-Won Chung,Mosharaf Chowdhury

from arxiv, NSDI 2023 | Homepage https://ml.energy/zeus

Training deep neural networks (DNNs) is becoming increasingly more resource- and energy-intensive every year. Unfortunately, existing works primarily focus on optimizing DNN training for faster completion, often without considering the impact on energy efficiency. In this paper, we observe that common practices to improve training performance can often lead to inefficient energy usage. More importantly, we demonstrate that there is a tradeoff between energy consumption and performance optimization. To this end, we propose Zeus, an optimization framework to navigate this tradeoff by automatically finding optimal job- and GPU-level configurations for recurring DNN training jobs. Zeus uses an online exploration-exploitation approach in conjunction with just-in-time energy profiling, averting the need for expensive offline measurements, while adapting to data drifts over time. Our evaluation shows that Zeus can improve the energy efficiency of DNN training by 15.3%-75.8% for diverse workloads.

翻译：不幸的是,现有的工程主要侧重于优化DNN培训,以便更快完成,往往没有考虑到对能源效率的影响。在本论文中,我们观察到,提高培训绩效的共同做法往往会导致能源使用效率低下。更重要的是,我们证明能源消耗与优化性能之间存在着平衡。为此,我们提议宙斯为这一权衡提供一个优化框架,为DNN的经常性培训工作自动找到最佳的工作和GPU级别配置。宙斯采用在线探索开发方法,同时及时进行能源状况分析,避免对昂贵的离线测量的需求,同时适应数据随时间的漂移。我们的评估表明,宙斯可以提高DNN培训的能效,在各种工作量中提高15.3%至75.8%。

0

相关内容

DNN

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

高压大电流IGBT模块内部多物理场分析与拓扑优化研究

国家自然科学基金

0+阅读 · 2014年12月31日

Forward-Looking与Backward-Looking相结合的投资组合管理

国家自然科学基金

1+阅读 · 2014年12月31日

大规模爆炸场数值模拟实时交互可视化软件

国家自然科学基金

1+阅读 · 2014年12月31日

高山峡谷区域暴雨泥石流成灾临界阈值研究

国家自然科学基金

0+阅读 · 2013年12月31日

三维集成电路热管理及TSV结构优化技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

可扩展至数十万核的全隐式时空耦合并行区域分解算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向气动CFD非线性求解的GPU/CPU混合并行JFNK算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

柽柳Dof转录因子的耐盐调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

大规模名字路由表高速查找技术的研究

国家自然科学基金

0+阅读 · 2012年12月31日

全碳三维互连新结构中电磁-热耦合效应和传输特性研究

国家自然科学基金

0+阅读 · 2011年12月31日

The Augmentation-Speed Tradeoff for Consistent Network Updates

Arxiv

0+阅读 · 2022年11月7日

OPT-Mimic: Imitation of Optimized Trajectories for Dynamic Quadruped Behaviors

Arxiv

0+阅读 · 2022年11月4日

Minimum Latency Training of Sequence Transducers for Streaming End-to-End Speech Recognition

Arxiv

0+阅读 · 2022年11月4日

SOLAR: A Highly Optimized Data Loading Framework for Distributed Training of CNN-based Scientific Surrogates

Arxiv

0+阅读 · 2022年11月4日

ImageNet-X: Understanding Model Mistakes with Factor of Variation Annotations

Arxiv

0+阅读 · 2022年11月3日

iGniter: Interference-Aware GPU Resource Provisioning for Predictable DNN Inference in the Cloud

Arxiv

0+阅读 · 2022年11月3日

Fine-Tuning Pre-Trained Language Models Effectively by Optimizing Subnetworks Adaptively

Arxiv

0+阅读 · 2022年11月3日

MERCURY: Accelerating DNN Training By Exploiting Input Similarity

Arxiv

0+阅读 · 2022年11月2日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Neural Architecture Search without Training

Neural Architecture Search without Training

Arxiv

10+阅读 · 2021年6月11日

VIP会员

文章信息

相关主题

相关VIP内容

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

相关论文

The Augmentation-Speed Tradeoff for Consistent Network Updates

Arxiv

0+阅读 · 2022年11月7日

OPT-Mimic: Imitation of Optimized Trajectories for Dynamic Quadruped Behaviors

Arxiv

0+阅读 · 2022年11月4日

Minimum Latency Training of Sequence Transducers for Streaming End-to-End Speech Recognition

Arxiv

0+阅读 · 2022年11月4日

SOLAR: A Highly Optimized Data Loading Framework for Distributed Training of CNN-based Scientific Surrogates

Arxiv

0+阅读 · 2022年11月4日

ImageNet-X: Understanding Model Mistakes with Factor of Variation Annotations

Arxiv

0+阅读 · 2022年11月3日

iGniter: Interference-Aware GPU Resource Provisioning for Predictable DNN Inference in the Cloud

Arxiv

0+阅读 · 2022年11月3日

Fine-Tuning Pre-Trained Language Models Effectively by Optimizing Subnetworks Adaptively

Arxiv

0+阅读 · 2022年11月3日

MERCURY: Accelerating DNN Training By Exploiting Input Similarity

Arxiv

0+阅读 · 2022年11月2日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Neural Architecture Search without Training

Neural Architecture Search without Training

Arxiv

10+阅读 · 2021年6月11日

相关基金

高压大电流IGBT模块内部多物理场分析与拓扑优化研究

国家自然科学基金

0+阅读 · 2014年12月31日

Forward-Looking与Backward-Looking相结合的投资组合管理

国家自然科学基金

1+阅读 · 2014年12月31日

大规模爆炸场数值模拟实时交互可视化软件

国家自然科学基金

1+阅读 · 2014年12月31日

高山峡谷区域暴雨泥石流成灾临界阈值研究

国家自然科学基金

0+阅读 · 2013年12月31日

三维集成电路热管理及TSV结构优化技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

可扩展至数十万核的全隐式时空耦合并行区域分解算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向气动CFD非线性求解的GPU/CPU混合并行JFNK算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

柽柳Dof转录因子的耐盐调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

大规模名字路由表高速查找技术的研究

国家自然科学基金

0+阅读 · 2012年12月31日

全碳三维互连新结构中电磁-热耦合效应和传输特性研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员