$L_2$BN：通过均衡特征的$L_2$范数增强批归一化 ($L_2$BN: Enhancing Batch Normalization by Equalizing the $L_2$ Norms of Features) - 专知论文

会员服务 ·

0

批量规范化 · 规范化的 · 规范化 · 样本 · Extensibility ·

2023 年 3 月 21 日

$L_2$BN: Enhancing Batch Normalization by Equalizing the $L_2$ Norms of Features

翻译：$L_2$BN：通过均衡特征的$L_2$范数增强批归一化

Zhennan Wang,Kehan Li,Runyi Yu,Yian Zhao,Pengchong Qiao,Chang Liu,Fan Xu,Xiangyang Ji,Guoli Song,Jie Chen

from arxiv, 12 pages, 8 figures

In this paper, we analyze batch normalization from the perspective of discriminability and find the disadvantages ignored by previous studies: the difference in $l_2$ norms of sample features can hinder batch normalization from obtaining more distinguished inter-class features and more compact intra-class features. To address this issue, we propose a simple yet effective method to equalize the $l_2$ norms of sample features. Concretely, we $l_2$-normalize each sample feature before feeding them into batch normalization, and therefore the features are of the same magnitude. Since the proposed method combines the $l_2$ normalization and batch normalization, we name our method $L_2$BN. The $L_2$BN can strengthen the compactness of intra-class features and enlarge the discrepancy of inter-class features. The $L_2$BN is easy to implement and can exert its effect without any additional parameters or hyper-parameters. We evaluate the effectiveness of $L_2$BN through extensive experiments with various models on image classification and acoustic scene classification tasks. The results demonstrate that the $L_2$BN can boost the generalization ability of various neural network models and achieve considerable performance improvements.

翻译：在本文中，我们从可分辨性的角度分析了批归一化，并发现之前的研究忽略了一些缺点：样本特征的$L_2$范数差异可能会妨碍批归一化获取更明显的类间特征和更紧凑的类内特征。为了解决这个问题，我们提出了一种简单而有效的方法，在输入批归一化之前等式化样本特征的$L_2$范数。具体地，我们将每个样本特征进行$L_2$归一化，因此特征具有相同的量级。由于所提出的方法是将$L_2$归一化和批归一化相结合，因此我们将方法命名为$L_2$BN。$L_2$BN 可以增强类内特征的紧凑性并放大类间特征的差异性。$L_2$BN 易于实现，并且可以在没有任何额外参数或超参数的情况下发挥作用。我们通过在图像分类和声学场景分类任务上对各种模型进行广泛实验来评估$L_2$BN的有效性。结果表明，$L_2$BN 可以增强各种神经网络模型的泛化能力并实现明显的性能提升。

0

相关内容

批量规范化

批量规范化

【AAAI2022】锚框排序知识蒸馏的目标检测

【AAAI2022】锚框排序知识蒸馏的目标检测

专知会员服务

26+阅读 · 2022年2月10日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

【AAAI 2020】InteractE: 通过增加特征交互来改进基于卷积的知识图谱嵌入， InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

【AAAI 2020】InteractE: 通过增加特征交互来改进基于卷积的知识图谱嵌入， InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

专知会员服务

53+阅读 · 2020年6月7日

【伯克利】再思考 Transformer中的Batch Normalization

【伯克利】再思考 Transformer中的Batch Normalization

专知会员服务

41+阅读 · 2020年3月21日

【康奈尔大学-Facebook】特征归一化与数据增强，Feature Normalization

【康奈尔大学-Facebook】特征归一化与数据增强，Feature Normalization

专知会员服务

57+阅读 · 2020年3月9日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

特征筛选还在用XGB的Feature Importance？试试Permutation Importance

特征筛选还在用XGB的Feature Importance？试试Permutation Importance

PaperWeekly

0+阅读 · 2022年9月30日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

SCR-3在雌激素促巨核细胞分化中的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

北京暖区局地暴雨的形成机理及其数值预报效果的改进研究

国家自然科学基金

0+阅读 · 2014年12月31日

结构矩阵计算的扰动理论与算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

LSECtin对胃癌肝转移能力的影响及其机制的研究

国家自然科学基金

0+阅读 · 2014年12月31日

函数空间中关于积分算子的Wiener引理及有界性的研究

国家自然科学基金

1+阅读 · 2014年12月31日

热带气旋眼壁高频振荡特征及其与气旋强度的内在联系和机理

国家自然科学基金

0+阅读 · 2013年12月31日

Spiked模型中特征值和特征向量的理论分析与推断

国家自然科学基金

1+阅读 · 2012年12月31日

高维数据特征选择的稳定性研究

国家自然科学基金

0+阅读 · 2012年12月31日

“用户行为数据”稀疏表示的理论与方法

国家自然科学基金

1+阅读 · 2012年12月31日

钙离子和钙离子通道在子宫内膜癌雌激素非基因转录效应中的作用机制探讨

国家自然科学基金

0+阅读 · 2009年12月31日

FedOBD: Opportunistic Block Dropout for Efficiently Training Large-scale Neural Networks through Federated Learning

Arxiv

0+阅读 · 2023年5月11日

Pyramid Texture Filtering

Arxiv

0+阅读 · 2023年5月11日

Boosting Pruned Networks with Linear Over-parameterization

Arxiv

0+阅读 · 2023年5月10日

Adapter-TST: A Parameter Efficient Method for Multiple-Attribute Text Style Transfer

Arxiv

0+阅读 · 2023年5月10日

P4SGD: Programmable Switch Enhanced Model-Parallel Training on Generalized Linear Models on Distributed FPGAs

Arxiv

0+阅读 · 2023年5月10日

Latent Stratification for Incrementality Experiments

Arxiv

0+阅读 · 2023年5月9日

Adaptive Localized Reduced Basis Methods for Large Scale Parameterized Systems

Arxiv

0+阅读 · 2023年5月9日

On Feature Normalization and Data Augmentation

On Feature Normalization and Data Augmentation

Arxiv

15+阅读 · 2020年2月25日

InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

Arxiv

13+阅读 · 2019年11月1日

Feature Denoising for Improving Adversarial Robustness

Feature Denoising for Improving Adversarial Robustness

Arxiv

15+阅读 · 2018年12月9日

VIP会员

文章信息

相关主题

批量规范化

相关VIP内容

【AAAI2022】锚框排序知识蒸馏的目标检测

【AAAI2022】锚框排序知识蒸馏的目标检测

专知会员服务

26+阅读 · 2022年2月10日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

【AAAI 2020】InteractE: 通过增加特征交互来改进基于卷积的知识图谱嵌入， InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

【AAAI 2020】InteractE: 通过增加特征交互来改进基于卷积的知识图谱嵌入， InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

专知会员服务

53+阅读 · 2020年6月7日

【伯克利】再思考 Transformer中的Batch Normalization

【伯克利】再思考 Transformer中的Batch Normalization

专知会员服务

41+阅读 · 2020年3月21日

【康奈尔大学-Facebook】特征归一化与数据增强，Feature Normalization

【康奈尔大学-Facebook】特征归一化与数据增强，Feature Normalization

专知会员服务

57+阅读 · 2020年3月9日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

热门VIP内容

开通专知VIP会员享更多权益服务

《面向无人机集群的避障动态传感器覆盖算法》最新38页

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

相关资讯

特征筛选还在用XGB的Feature Importance？试试Permutation Importance

特征筛选还在用XGB的Feature Importance？试试Permutation Importance

PaperWeekly

0+阅读 · 2022年9月30日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

相关论文

FedOBD: Opportunistic Block Dropout for Efficiently Training Large-scale Neural Networks through Federated Learning

Arxiv

0+阅读 · 2023年5月11日

Pyramid Texture Filtering

Arxiv

0+阅读 · 2023年5月11日

Boosting Pruned Networks with Linear Over-parameterization

Arxiv

0+阅读 · 2023年5月10日

Adapter-TST: A Parameter Efficient Method for Multiple-Attribute Text Style Transfer

Arxiv

0+阅读 · 2023年5月10日

P4SGD: Programmable Switch Enhanced Model-Parallel Training on Generalized Linear Models on Distributed FPGAs

Arxiv

0+阅读 · 2023年5月10日

Latent Stratification for Incrementality Experiments

Arxiv

0+阅读 · 2023年5月9日

Adaptive Localized Reduced Basis Methods for Large Scale Parameterized Systems

Arxiv

0+阅读 · 2023年5月9日

On Feature Normalization and Data Augmentation

On Feature Normalization and Data Augmentation

Arxiv

15+阅读 · 2020年2月25日

InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

Arxiv

13+阅读 · 2019年11月1日

Feature Denoising for Improving Adversarial Robustness

Feature Denoising for Improving Adversarial Robustness

Arxiv

15+阅读 · 2018年12月9日

相关基金

SCR-3在雌激素促巨核细胞分化中的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

北京暖区局地暴雨的形成机理及其数值预报效果的改进研究

国家自然科学基金

0+阅读 · 2014年12月31日

结构矩阵计算的扰动理论与算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

LSECtin对胃癌肝转移能力的影响及其机制的研究

国家自然科学基金

0+阅读 · 2014年12月31日

函数空间中关于积分算子的Wiener引理及有界性的研究

国家自然科学基金

1+阅读 · 2014年12月31日

热带气旋眼壁高频振荡特征及其与气旋强度的内在联系和机理

国家自然科学基金

0+阅读 · 2013年12月31日

Spiked模型中特征值和特征向量的理论分析与推断

国家自然科学基金

1+阅读 · 2012年12月31日

高维数据特征选择的稳定性研究

国家自然科学基金

0+阅读 · 2012年12月31日

“用户行为数据”稀疏表示的理论与方法

国家自然科学基金

1+阅读 · 2012年12月31日

钙离子和钙离子通道在子宫内膜癌雌激素非基因转录效应中的作用机制探讨

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员