Deploying convolutional neural networks (CNNs) on mobile devices is difficult due to the limited memory and computation resources. We aim to design efficient neural networks for heterogeneous devices including CPU and GPU, by exploiting the redundancy in feature maps, which has rarely been investigated in neural architecture design. For CPU-like devices, we propose a novel CPU-efficient Ghost (C-Ghost) module to generate more feature maps from cheap operations. Based on a set of intrinsic feature maps, we apply a series of linear transformations with cheap cost to generate many ghost feature maps that could fully reveal information underlying intrinsic features. The proposed C-Ghost module can be taken as a plug-and-play component to upgrade existing convolutional neural networks. C-Ghost bottlenecks are designed to stack C-Ghost modules, and then the lightweight C-GhostNet can be easily established. We further consider the efficient networks for GPU devices. Without involving too many GPU-inefficient operations (e.g.,, depth-wise convolution) in a building stage, we propose to utilize the stage-wise feature redundancy to formulate GPU-efficient Ghost (G-Ghost) stage structure. The features in a stage are split into two parts where the first part is processed using the original block with fewer output channels for generating intrinsic features, and the other are generated using cheap operations by exploiting stage-wise redundancy. Experiments conducted on benchmarks demonstrate the effectiveness of the proposed C-Ghost module and the G-Ghost stage. C-GhostNet and G-GhostNet can achieve the optimal trade-off of accuracy and latency for CPU and GPU, respectively. Code is available at https://github.com/huawei-noah/CV-Backbones.
翻译:由于记忆和计算资源有限,很难在移动设备上部署心神经网络(CNN),因为存储和计算资源有限。我们的目标是设计高效的神经网络,用于包括CPU和GPU在内的各种装置。我们的目标是利用功能图中的冗余,在神经结构设计中很少对此进行调查。对于类似 CPU 的设备,我们提议建立一个新型的CPU-效率Ghost(C-Ghost)模块,以便从廉价操作中产生更多的特效地图。基于一套内在特征图,我们使用一系列廉价成本成本的线性转换来生成许多能够充分揭示内在特征的信息的幽灵特征图。拟议的 C-Ghost 模块可以作为一个插接插和播放组件,升级现有的脉动神经网络。C-Ghost瓶颈旨在堆装C-Ghost模块,然后轻量的C-Ghost网络可以很容易建立。我们进一步考虑GPU装置的高效网络。在建构阶段,不需要太多的CPU-wer/hick-complain操作(e-hick-comal),我们提议利用阶段的C-host-host-host-host-dealdealdestress-destratealde-de-destrevact se) 将C-sc-dedudududududustration 阶段的C-destration 将C-de 和G-deal-deal-de-de-deal-de-deal-deal-de-deal-destration 化G-deal-deal-deal-deal-deal-deal-de-de-de-deal-deal-deal-de-de-deal-de-de-de-de-deal-de-de-de-de-deal-deal-deal-de-de-de-de-de-de-de-de-de-de-deal-deal-deal-deal-deal-deal-deal-de-de-deal-deal-dection-deal-deal-deal-de-de-de-deal-deal-deal-deal-deal-deal-de-de-de-de-de-de-de-de-