根据本福德法重新思考神经网络 (Rethinking Neural Networks With Benford's Law) - 专知论文

会员服务 ·

0

验证集 · 模型评估 · Neural Networks · Networking · 情景 ·

2021 年 8 月 18 日

Rethinking Neural Networks With Benford's Law

翻译：根据本福德法重新思考神经网络

Surya Kant Sahu,Abhinav Java,Arshad Shaikh,Yannic Kilcher

Benford's Law (BL) or the Significant Digit Law defines the probability distribution of the first digit of numerical values in a data sample. This Law is observed in many naturally occurring datasets. It can be seen as a measure of naturalness of a given distribution and finds its application in areas like anomaly and fraud detection. In this work, we address the following question: Is the distribution of the Neural Network parameters related to the network's generalization capability? To that end, we first define a metric, MLH (Model Enthalpy),that measures the closeness of a set of numbers to Benford's Law and we show empirically that it is a strong predictor of Validation Accuracy. Second, we use MLH as an alternative to Validation Accuracy for Early Stopping, removing the need for a Validation set. We provide experimental evidence that even if the optimal size of the validation set is known before-hand, the peak test accuracy attained is lower than not using a validation set at all. Finally, we investigate the connection of BL to Free Energy Principle and First Law of Thermodynamics, showing that MLH is a component of the internal energy of the learning system and optimization as an analogy to minimizing the total energy to attain equilibrium.

翻译：Benford 法律 (BL) 或重要数字法定义了数据样本中数字值首位数的概率分布。这部法律在许多自然发生的数据集中得到遵守。它可以被视为一个特定分布的自然性度的量度, 并发现其在异常和欺诈检测等领域的应用。在这项工作中, 我们处理以下问题: 神经网络参数的分布是否与网络的概括能力相关? 为此, 我们首先定义了衡量一组数字与 Benford 法律的接近程度的衡量标准 MLH (Mdel Enthalpy) 。我们从经验上显示它是一个验证准确性很强的预测器。其次, 我们使用 MLH 来替代早期停止的校准准确性, 消除对校准集的需要。我们提供实验性证据表明, 即使先已知校准集的最佳尺寸, 达到的峰值测试准确度也低于完全没有使用校准集。最后, 我们调查了 BLL 与自由能源原则的联系, 以及 TheL 法律第一定律的校准准确性。我们发现, MLH 是将能源最优化的内部系统学习总的能源优化。

0

相关内容

验证集

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

31+阅读 · 2021年9月29日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

【电子书推荐】机器学习、神经网络和统计分类（Machine Learning, Neural Networks, and Statistical Classification）

【电子书推荐】机器学习、神经网络和统计分类（Machine Learning, Neural Networks, and Statistical Classification）

专知会员服务

29+阅读 · 2019年11月19日

【ICCV 2019 Toturial】Interpretable Machine Learning for Computer Vision（用于计算机视觉的可解释性机器学习）

【ICCV 2019 Toturial】Interpretable Machine Learning for Computer Vision（用于计算机视觉的可解释性机器学习）

专知会员服务

32+阅读 · 2019年10月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

意识是一种数学模式

意识是一种数学模式

CreateAMind

3+阅读 · 2019年6月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

自然语言处理（二）机器翻译篇 (NLP: machine translation)

自然语言处理（二）机器翻译篇 (NLP: machine translation)

DeepLearning中文论坛

12+阅读 · 2015年7月1日

Adversarial Unlearning of Backdoors via Implicit Hypergradient

Adversarial Unlearning of Backdoors via Implicit Hypergradient

Arxiv

0+阅读 · 2021年10月14日

Generalized minimum 0-extension problem and discrete convexity

Arxiv

0+阅读 · 2021年10月14日

Machine Learning For Elliptic PDEs: Fast Rate Generalization Bound, Neural Scaling Law and Minimax Optimality

Arxiv

0+阅读 · 2021年10月13日

Newer is not always better: Rethinking transferability metrics, their peculiarities, stability and performance

Arxiv

0+阅读 · 2021年10月13日

Metaparametric Neural Networks for Survival Analysis

Arxiv

0+阅读 · 2021年10月13日

How Neural Networks Extrapolate: From Feedforward to Graph Neural Networks

Arxiv

5+阅读 · 2021年2月21日

Interpreting Neural Networks as Gradual Argumentation Frameworks (Including Proof Appendix)

Interpreting Neural Networks as Gradual Argumentation Frameworks (Including Proof Appendix)

Arxiv

7+阅读 · 2020年12月10日

Subgraph Neural Networks

Arxiv

27+阅读 · 2020年6月19日

Learning with Interpretable Structure from RNN

Arxiv

19+阅读 · 2018年10月25日

A Dual Approach to Scalable Verification of Deep Networks

A Dual Approach to Scalable Verification of Deep Networks

Arxiv

3+阅读 · 2018年8月3日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

31+阅读 · 2021年9月29日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

【电子书推荐】机器学习、神经网络和统计分类（Machine Learning, Neural Networks, and Statistical Classification）

【电子书推荐】机器学习、神经网络和统计分类（Machine Learning, Neural Networks, and Statistical Classification）

专知会员服务

29+阅读 · 2019年11月19日

【ICCV 2019 Toturial】Interpretable Machine Learning for Computer Vision（用于计算机视觉的可解释性机器学习）

【ICCV 2019 Toturial】Interpretable Machine Learning for Computer Vision（用于计算机视觉的可解释性机器学习）

专知会员服务

32+阅读 · 2019年10月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大型语言模型遇上文本属性图：一种融合框架与应用的综述

人工智能赋能自主武器与人类控制第三部分：人类控制与系统操作员 | 35页

【博士论文】用于概率程序与生成模型的变分推断

军事指挥控制系统：2025年5种用途

相关资讯

意识是一种数学模式

意识是一种数学模式

CreateAMind

3+阅读 · 2019年6月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

自然语言处理（二）机器翻译篇 (NLP: machine translation)

自然语言处理（二）机器翻译篇 (NLP: machine translation)

DeepLearning中文论坛

12+阅读 · 2015年7月1日

相关论文

Adversarial Unlearning of Backdoors via Implicit Hypergradient

Adversarial Unlearning of Backdoors via Implicit Hypergradient

Arxiv

0+阅读 · 2021年10月14日

Generalized minimum 0-extension problem and discrete convexity

Arxiv

0+阅读 · 2021年10月14日

Machine Learning For Elliptic PDEs: Fast Rate Generalization Bound, Neural Scaling Law and Minimax Optimality

Arxiv

0+阅读 · 2021年10月13日

Newer is not always better: Rethinking transferability metrics, their peculiarities, stability and performance

Arxiv

0+阅读 · 2021年10月13日

Metaparametric Neural Networks for Survival Analysis

Arxiv

0+阅读 · 2021年10月13日

How Neural Networks Extrapolate: From Feedforward to Graph Neural Networks

Arxiv

5+阅读 · 2021年2月21日

Interpreting Neural Networks as Gradual Argumentation Frameworks (Including Proof Appendix)

Interpreting Neural Networks as Gradual Argumentation Frameworks (Including Proof Appendix)

Arxiv

7+阅读 · 2020年12月10日

Subgraph Neural Networks

Arxiv

27+阅读 · 2020年6月19日

Learning with Interpretable Structure from RNN

Arxiv

19+阅读 · 2018年10月25日

A Dual Approach to Scalable Verification of Deep Networks

A Dual Approach to Scalable Verification of Deep Networks

Arxiv

3+阅读 · 2018年8月3日

微信扫码咨询专知VIP会员