神经网络培训中的随机性:说明工具使用的影响 (Randomness In Neural Network Training: Characterizing The Impact of Tooling)

The quest for determinism in machine learning has disproportionately focused on characterizing the impact of noise introduced by algorithmic design choices. In this work, we address a less well understood and studied question: how does our choice of tooling introduce randomness to deep neural network training. We conduct large scale experiments across different types of hardware, accelerators, state of art networks, and open-source datasets, to characterize how tooling choices contribute to the level of non-determinism in a system, the impact of said non-determinism, and the cost of eliminating different sources of noise. Our findings are surprising, and suggest that the impact of non-determinism in nuanced. While top-line metrics such as top-1 accuracy are not noticeably impacted, model performance on certain parts of the data distribution is far more sensitive to the introduction of randomness. Our results suggest that deterministic tooling is critical for AI safety. However, we also find that the cost of ensuring determinism varies dramatically between neural network architectures and hardware types, e.g., with overhead up to $746\%$, $241\%$, and $196\%$ on a spectrum of widely used GPU accelerator architectures, relative to non-deterministic training. The source code used in this paper is available at https://github.com/usyd-fsalab/NeuralNetworkRandomness.

翻译：在机器学习中,对确定确定论的追求过多地侧重于确定由算法设计选择所引入的噪音的影响。在这项工作中,我们讨论了一个认识和研究不足的问题:我们选择工具是如何将随机性引入深神经网络培训的。我们在不同类型的硬件、加速器、艺术网络状态和公开源数据集中进行大规模实验,以说明工具选择如何有助于一个系统中的非确定论的水平、上述非确定论的影响以及消除不同噪音源的成本。我们的调查结果令人吃惊,并表明非确定论在细微的微调中产生了影响。虽然我们选择工具是如何将随机性引入深神经网络培训。我们在不同类型的硬件硬件硬件硬件硬件中进行大规模实验,例如,我们选择工具性工具是如何促成在一个系统中的非确定论、加速器、先进器、先进器、先进器等顶级指标的模型在数据分配的某些部分上表现对于引入随机性意义更为敏感。我们的结果表明,确定论工具对于AI安全至关重要。然而,我们也发现确保确定论的成本在神经网络结构与硬件类型之间差别很大。例如,我们发现确保确定论的成本在纸质网络结构和硬件类型之间有很大差异,例如,在可广泛使用成本到可达746美元、241-Nemb-del-deisma-deal_在使用这一结构上使用。

相关内容

Neural Networks

关注 1647

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日