驾驶舱:培训深神经网络的实用调试工具 (Cockpit: A Practical Debugging Tool for Training Deep Neural Networks) - 专知论文

会员服务 ·

0

INFORMS · Networking · Neural Networks · Processing（编程语言） · 学成 ·

2021 年 2 月 12 日

Cockpit: A Practical Debugging Tool for Training Deep Neural Networks

翻译：驾驶舱:培训深神经网络的实用调试工具

Frank Schneider,Felix Dangel,Philipp Hennig

from arxiv, Main text: 10 pages, 7 figures, 1 table; Supplements: 17 pages, 9 figures, 1 table

When engineers train deep learning models, they are very much "flying blind". Commonly used approaches for real-time training diagnostics, such as monitoring the train/test loss, are limited. Assessing a network's training process solely through these performance indicators is akin to debugging software without access to internal states through a debugger. To address this, we present Cockpit, a collection of instruments that enable a closer look into the inner workings of a learning machine, and a more informative and meaningful status report for practitioners. It facilitates the identification of learning phases and failure modes, like ill-chosen hyperparameters. These instruments leverage novel higher-order information about the gradient distribution and curvature, which has only recently become efficiently accessible. We believe that such a debugging tool, which we open-source for PyTorch, represents an important step to improve troubleshooting the training process, reveal new insights, and help develop novel methods and heuristics.

翻译：当工程师们训练深层次学习模式时,他们非常“盲目地飞行”。通常使用的实时培训诊断方法,例如监测火车/测试损失,是有限的。仅仅通过这些业绩指标来评估网络的培训过程,类似于在无法通过调试器进入内部状态的情况下对软件进行调试。为了解决这个问题,我们提供“驾驶舱”,这是一套工具,能够更仔细地查看学习机器的内部工作,并且为实践者提供一份更加丰富和有意义的状况报告。它有助于识别学习阶段和失败模式,如坏掉的超参数。这些仪器利用关于梯度分布和曲线的新的更高层次信息,最近才有效获得这些信息。我们认为,这种调试工具是改进训练过程的故障排除、揭示新洞察力和帮助开发新方法和超理论的一个重要步骤。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

【新书】MATLAB深度学习与机器学习、神经网络和人工智能（MATLAB Deep Learning With Machine Learning, Neural Networks and Artificial Intelligence），162页pdf，

【新书】MATLAB深度学习与机器学习、神经网络和人工智能（MATLAB Deep Learning With Machine Learning, Neural Networks and Artificial Intelligence），162页pdf，

专知会员服务

92+阅读 · 2020年1月13日

MATLAB玩转深度学习？新书「MATLAB Deep Learning」162页pdf

MATLAB玩转深度学习？新书「MATLAB Deep Learning」162页pdf

专知会员服务

103+阅读 · 2020年1月13日

【电子书推荐】入门深度学习的Python书籍，Grokking Deep Learning最新版，Google DeepMind|Andrew Trask

【电子书推荐】入门深度学习的Python书籍，Grokking Deep Learning最新版，Google DeepMind|Andrew Trask

专知会员服务

28+阅读 · 2019年12月5日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

机器学习相关资源(框架、库、软件)大列表

机器学习相关资源(框架、库、软件)大列表

专知会员服务

40+阅读 · 2019年10月9日

移动端机器学习资源合集

移动端机器学习资源合集

专知

8+阅读 · 2019年4月21日

已删除

将门创投

3+阅读 · 2018年11月20日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

Synthetic training data generation for deep learning based quality inspection

Arxiv

0+阅读 · 2021年4月7日

Feature-Extracting Functions for Neural Logic Rule Learning

Arxiv

0+阅读 · 2021年4月6日

Medical Entity Disambiguation Using Graph Neural Networks

Arxiv

1+阅读 · 2021年4月3日

Partition-Guided GANs

Arxiv

1+阅读 · 2021年4月2日

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

Arxiv

14+阅读 · 2021年1月31日

Deep Neural Network Based Relation Extraction: An Overview

Arxiv

14+阅读 · 2021年1月6日

Data Augmentation for Graph Neural Networks

Arxiv

38+阅读 · 2020年12月2日

Large Batch Optimization for Deep Learning: Training BERT in 76 minutes

Large Batch Optimization for Deep Learning: Training BERT in 76 minutes

Arxiv

3+阅读 · 2019年9月25日

Relation Networks for Object Detection

Arxiv

4+阅读 · 2018年6月14日

TBD: Benchmarking and Analyzing Deep Neural Network Training

Arxiv

3+阅读 · 2018年3月16日

VIP会员

文章信息

相关主题

Neural Networks

Processing（编程语言）

相关VIP内容

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

【新书】MATLAB深度学习与机器学习、神经网络和人工智能（MATLAB Deep Learning With Machine Learning, Neural Networks and Artificial Intelligence），162页pdf，

【新书】MATLAB深度学习与机器学习、神经网络和人工智能（MATLAB Deep Learning With Machine Learning, Neural Networks and Artificial Intelligence），162页pdf，

专知会员服务

92+阅读 · 2020年1月13日

MATLAB玩转深度学习？新书「MATLAB Deep Learning」162页pdf

MATLAB玩转深度学习？新书「MATLAB Deep Learning」162页pdf

专知会员服务

103+阅读 · 2020年1月13日

【电子书推荐】入门深度学习的Python书籍，Grokking Deep Learning最新版，Google DeepMind|Andrew Trask

【电子书推荐】入门深度学习的Python书籍，Grokking Deep Learning最新版，Google DeepMind|Andrew Trask

专知会员服务

28+阅读 · 2019年12月5日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

机器学习相关资源(框架、库、软件)大列表

机器学习相关资源(框架、库、软件)大列表

专知会员服务

40+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能治理的未来

模态感知的特征匹配：单一模态与跨模态技术的全面综述

无监督行人重识别研究综述

【牛津博士论文】面向神经影像应用的可扩展且可解释的空间模型

相关资讯

移动端机器学习资源合集

移动端机器学习资源合集

专知

8+阅读 · 2019年4月21日

已删除

将门创投

3+阅读 · 2018年11月20日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

相关论文

Synthetic training data generation for deep learning based quality inspection

Arxiv

0+阅读 · 2021年4月7日

Feature-Extracting Functions for Neural Logic Rule Learning

Arxiv

0+阅读 · 2021年4月6日

Medical Entity Disambiguation Using Graph Neural Networks

Arxiv

1+阅读 · 2021年4月3日

Partition-Guided GANs

Arxiv

1+阅读 · 2021年4月2日

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

Arxiv

14+阅读 · 2021年1月31日

Deep Neural Network Based Relation Extraction: An Overview

Arxiv

14+阅读 · 2021年1月6日

Data Augmentation for Graph Neural Networks

Arxiv

38+阅读 · 2020年12月2日

Large Batch Optimization for Deep Learning: Training BERT in 76 minutes

Large Batch Optimization for Deep Learning: Training BERT in 76 minutes

Arxiv

3+阅读 · 2019年9月25日

Relation Networks for Object Detection

Arxiv

4+阅读 · 2018年6月14日

TBD: Benchmarking and Analyzing Deep Neural Network Training

Arxiv

3+阅读 · 2018年3月16日

微信扫码咨询专知VIP会员