通过强迫地物抽取和压缩,以基于酶基的脉冲压缩,在深卷发神经网络中减缓严重的超度分数, (Mitigating severe over-parameterization in deep convolutional neural networks through forced feature abstraction and compression with an entropy-based heuristic)

MoDELS · Networking · 卷积神经网络 · Neural Networks · 卷积 ·

2021 年 6 月 27 日

Mitigating severe over-parameterization in deep convolutional neural networks through forced feature abstraction and compression with an entropy-based heuristic

翻译：通过强迫地物抽取和压缩,以基于酶基的脉冲压缩,在深卷发神经网络中减缓严重的超度分数,

Nidhi Gowdra,Roopak Sinha,Stephen MacDonell,Wei Qi Yan

from arxiv, Journal paper, 14 pages, 3 tables, 3 figures

Convolutional Neural Networks (CNNs) such as ResNet-50, DenseNet-40 and ResNeXt-56 are severely over-parameterized, necessitating a consequent increase in the computational resources required for model training which scales exponentially for increments in model depth. In this paper, we propose an Entropy-Based Convolutional Layer Estimation (EBCLE) heuristic which is robust and simple, yet effective in resolving the problem of over-parameterization with regards to network depth of CNN model. The EBCLE heuristic employs a priori knowledge of the entropic data distribution of input datasets to determine an upper bound for convolutional network depth, beyond which identity transformations are prevalent offering insignificant contributions for enhancing model performance. Restricting depth redundancies by forcing feature compression and abstraction restricts over-parameterization while decreasing training time by 24.99% - 78.59% without degradation in model performance. We present empirical evidence to emphasize the relative effectiveness of broader, yet shallower models trained using the EBCLE heuristic, which maintains or outperforms baseline classification accuracies of narrower yet deeper models. The EBCLE heuristic is architecturally agnostic and EBCLE based CNN models restrict depth redundancies resulting in enhanced utilization of the available computational resources. The proposed EBCLE heuristic is a compelling technique for researchers to analytically justify their HyperParameter (HP) choices for CNNs. Empirical validation of the EBCLE heuristic in training CNN models was established on five benchmarking datasets (ImageNet32, CIFAR-10/100, STL-10, MNIST) and four network architectures (DenseNet, ResNet, ResNeXt and EfficientNet B0-B2) with appropriate statistical tests employed to infer any conclusive claims presented in this paper.

翻译：RESNet-50、DenseNet-40和ResNeXt-56等革命神经网络(CNN)严重超标,因此需要随之增加模型培训所需的计算资源,而模型深度的增量则要以指数为指数。在本文中,我们建议采用基于环境的革命层层模拟(EBCLE),它既有力又简单,但能有效解决与CNN的网络选择深度有关的超标化问题。EBCLE Heuristem利用了输入数据集的配置数据分配的先验性知识,以确定脉冲网络深度的上限,超出这一范围,身份转换为增强模型性能提供了微不足道的贡献。通过强制特征压缩和抽象化来限制超标度的深度,同时将培训时间减少24.99%至78.59%,而不会降低模型性能。我们提出了经验证据,用EBCLEEE的超值培训模型和较浅的模型,从而维持或超过CREBEVO的分析性深度分析模型,从而使得ELEAR-C的更深层次的深度数据分类。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【Cell】神经算法推理，Neural algorithmic reasoning

专知会员服务

29+阅读 · 2021年7月16日

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

专知会员服务

26+阅读 · 2020年3月26日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日