Due to the advent of modern embedded systems and mobile devices with constrained resources, there is a great demand for incredibly efficient deep neural networks for machine learning purposes. There is also a growing concern of privacy and confidentiality of user data within the general public when their data is processed and stored in an external server which has further fueled the need for developing such efficient neural networks for real-time inference on local embedded systems. The scope of our work presented in this paper is limited to image classification using a convolutional neural network. A Convolutional Neural Network (CNN) is a class of Deep Neural Network (DNN) widely used in the analysis of visual images captured by an image sensor, designed to extract information and convert it into meaningful representations for real-time inference of the input data. In this paper, we propose a neoteric variant of deep convolutional neural network architecture to ameliorate the performance of existing CNN architectures for real-time inference on embedded systems. We show that this architecture, dubbed CondenseNeXt, is remarkably efficient in comparison to the baseline neural network architecture, CondenseNet, by reducing trainable parameters and FLOPs required to train the network whilst maintaining a balance between the trained model size of less than 3.0 MB and accuracy trade-off resulting in an unprecedented computational efficiency.
翻译:由于现代嵌入系统和资源有限的移动装置的出现,对机器学习目的的高度高效的深神经网络的需求非常巨大,而且当用户数据被处理和储存在外部服务器上时,公众对用户数据的隐私和保密问题日益感到关切,因为外部服务器进一步增加了发展这种高效神经网络的需要,以便实时推断本地嵌入系统。本文介绍的我们的工作范围限于利用一个神经神经神经网络进行图像分类。一个革命神经网络(CNN)是一个深神经网络(DNNN)的阶级,广泛用于分析图像传感器所摄取的视觉图像,目的是提取信息并将其转换为有意义的表述,以便实时推断输入数据。在本文件中,我们提出一个深电动神经网络结构的新化变体变体,以改善现有CNN结构在嵌入系统上实时推断的性能。我们显示,这个结构,调制成的CondenseNeXt, 与基本神经网络结构、CondenseNet(CondenseNet)相比,效率非常高,目的是用来提取信息,将信息转换为有意义的表达方式,从而降低对贸易效率的模型和计算效率的要求。