了解神经网络和通过信息顺序累积累积借镜对个人中子的重要性 (Understanding Neural Networks and Individual Neuron Importance via Information-Ordered Cumulative Ablation) - 专知论文

会员服务 ·

0

INFORMS · 可理解性 · Neural Networks · Networking · 互信息 ·

2021 年 6 月 9 日

Understanding Neural Networks and Individual Neuron Importance via Information-Ordered Cumulative Ablation

翻译：了解神经网络和通过信息顺序累积累积借镜对个人中子的重要性

Rana Ali Amjad,Kairen Liu,Bernhard C. Geiger

from arxiv, 12 pages; accepted for publication in IEEE Transactions on Neural Networks and Learning Systems

In this work, we investigate the use of three information-theoretic quantities -- entropy, mutual information with the class variable, and a class selectivity measure based on Kullback-Leibler divergence -- to understand and study the behavior of already trained fully-connected feed-forward neural networks. We analyze the connection between these information-theoretic quantities and classification performance on the test set by cumulatively ablating neurons in networks trained on MNIST, FashionMNIST, and CIFAR-10. Our results parallel those recently published by Morcos et al., indicating that class selectivity is not a good indicator for classification performance. However, looking at individual layers separately, both mutual information and class selectivity are positively correlated with classification performance, at least for networks with ReLU activation functions. We provide explanations for this phenomenon and conclude that it is ill-advised to compare the proposed information-theoretic quantities across layers. Furthermore, we show that cumulative ablation of neurons with ascending or descending information-theoretic quantities can be used to formulate hypotheses regarding the joint behavior of multiple neurons, such as redundancy and synergy, with comparably low computational cost. We also draw connections to the information bottleneck theory for neural networks.

翻译：在这项工作中,我们调查使用三种信息理论数量 -- -- 昆虫、与阶级变量的相互信息,以及基于库尔贝克-利伯尔差异的阶级选择性措施 -- -- 来理解和研究已经受过训练的完全连接的向神经网络的进化传感网络的行为。我们分析了在MNIST、时装MIS和CIFAR-10培训的网络中累积消化神经元所设定的测试信息理论数量和分类性能的分类性能之间的关联。我们的结果与Morcos等人最近公布的结果相平行,表明阶级选择性不是分类性能的好指标。然而,分别看各个层次,相互信息和阶级选择性与分类性能有正相关关系,至少对于具有RELU激活功能的网络而言。我们对这种现象作出解释,并得出结论认为,对跨层次的拟议信息理论数量进行对比是不明智的。此外,我们表明,神经元与信息升降或降序数量之间的累积性关系可以用来为多个神经神经联合行为的假设,例如与神经元的理论连接,我们也可以进行低量的计算。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

【AAAI2021】记忆门控循环网络

【AAAI2021】记忆门控循环网络

专知会员服务

50+阅读 · 2020年12月28日

【AAAI2021】层次图胶囊网络

【AAAI2021】层次图胶囊网络

专知会员服务

84+阅读 · 2020年12月18日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

【清华大学】图随机神经网络，Graph Random Neural Networks

【清华大学】图随机神经网络，Graph Random Neural Networks

专知会员服务

156+阅读 · 2020年5月26日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

深度神经网络模型的个体差异，Individual differences among deep neural network models

深度神经网络模型的个体差异，Individual differences among deep neural network models

专知会员服务

10+阅读 · 2020年1月11日

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

专知会员服务

11+阅读 · 2019年11月2日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

神经网络学习率设置

神经网络学习率设置

机器学习研究会

4+阅读 · 2018年3月3日

人工智能 | 国际会议/SCI期刊约稿信息9条

人工智能 | 国际会议/SCI期刊约稿信息9条

Call4Papers

3+阅读 · 2018年1月12日

最佳实践：深度学习用于自然语言处理（三）

最佳实践：深度学习用于自然语言处理（三）

待字闺中

3+阅读 · 2017年8月20日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Nonlinear computations in spiking neural networks through multiplicative synapses

Arxiv

0+阅读 · 2021年8月2日

Modeling partitions of individuals

Arxiv

0+阅读 · 2021年8月2日

ReCU: Reviving the Dead Weights in Binary Neural Networks

Arxiv

0+阅读 · 2021年8月2日

The Effects of Mild Over-parameterization on the Optimization Landscape of Shallow ReLU Neural Networks

Arxiv

0+阅读 · 2021年7月30日

Adaptive Optimizers with Sparse Group Lasso for Neural Networks in CTR Prediction

Arxiv

0+阅读 · 2021年7月30日

Learning to Solve the AC-OPF using Sensitivity-Informed Deep Neural Networks

Arxiv

0+阅读 · 2021年7月29日

Fast Margin Maximization via Dual Acceleration

Arxiv

4+阅读 · 2021年7月1日

Scaling Properties of Deep Residual Networks

Arxiv

13+阅读 · 2021年5月25日

Understanding Attention and Generalization in Graph Neural Networks

Arxiv

4+阅读 · 2019年10月28日

Understanding disentangling in $β$-VAE

Arxiv

4+阅读 · 2018年4月10日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

【AAAI2021】记忆门控循环网络

【AAAI2021】记忆门控循环网络

专知会员服务

50+阅读 · 2020年12月28日

【AAAI2021】层次图胶囊网络

【AAAI2021】层次图胶囊网络

专知会员服务

84+阅读 · 2020年12月18日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

【清华大学】图随机神经网络，Graph Random Neural Networks

【清华大学】图随机神经网络，Graph Random Neural Networks

专知会员服务

156+阅读 · 2020年5月26日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

深度神经网络模型的个体差异，Individual differences among deep neural network models

深度神经网络模型的个体差异，Individual differences among deep neural network models

专知会员服务

10+阅读 · 2020年1月11日

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

专知会员服务

11+阅读 · 2019年11月2日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

新质生成式AI赋能产业变革的实践与路径

用于多模态大模型的离散标记化：全面综述

Nature综述：金融网络中的物理学

【CMU博士论文】通信高效且差分隐私的优化方法

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

神经网络学习率设置

神经网络学习率设置

机器学习研究会

4+阅读 · 2018年3月3日

人工智能 | 国际会议/SCI期刊约稿信息9条

人工智能 | 国际会议/SCI期刊约稿信息9条

Call4Papers

3+阅读 · 2018年1月12日

最佳实践：深度学习用于自然语言处理（三）

最佳实践：深度学习用于自然语言处理（三）

待字闺中

3+阅读 · 2017年8月20日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Nonlinear computations in spiking neural networks through multiplicative synapses

Arxiv

0+阅读 · 2021年8月2日

Modeling partitions of individuals

Arxiv

0+阅读 · 2021年8月2日

ReCU: Reviving the Dead Weights in Binary Neural Networks

Arxiv

0+阅读 · 2021年8月2日

The Effects of Mild Over-parameterization on the Optimization Landscape of Shallow ReLU Neural Networks

Arxiv

0+阅读 · 2021年7月30日

Adaptive Optimizers with Sparse Group Lasso for Neural Networks in CTR Prediction

Arxiv

0+阅读 · 2021年7月30日

Learning to Solve the AC-OPF using Sensitivity-Informed Deep Neural Networks

Arxiv

0+阅读 · 2021年7月29日

Fast Margin Maximization via Dual Acceleration

Arxiv

4+阅读 · 2021年7月1日

Scaling Properties of Deep Residual Networks

Arxiv

13+阅读 · 2021年5月25日

Understanding Attention and Generalization in Graph Neural Networks

Arxiv

4+阅读 · 2019年10月28日

Understanding disentangling in $β$-VAE

Arxiv

4+阅读 · 2018年4月10日

微信扫码咨询专知VIP会员