Conventional deep convolutional neural networks (CNNs) apply convolution operators uniformly in space across all feature maps for hundreds of layers - this incurs a high computational cost for real-time applications. For many problems such as object detection and semantic segmentation, we are able to obtain a low-cost computation mask, either from a priori problem knowledge, or from a low-resolution segmentation network. We show that such computation masks can be used to reduce computation in the high-resolution main network. Variants of sparse activation CNNs have previously been explored on small-scale tasks and showed no degradation in terms of object classification accuracy, but often measured gains in terms of theoretical FLOPs without realizing a practical speed-up when compared to highly optimized dense convolution implementations. In this work, we leverage the sparsity structure of computation masks and propose a novel tiling-based sparse convolution algorithm. We verified the effectiveness of our sparse CNN on LiDAR-based 3D object detection, and we report significant wall-clock speed-ups compared to dense convolution without noticeable loss of accuracy.
翻译:在数百层的所有地貌地图上,常规的深层电动神经网络(CNNs)在空间上统一应用各种变速操作者,这给实时应用带来了很高的计算成本。对于物体探测和语义分解等许多问题,我们可以从先验问题知识或低分辨率分解网络获得低成本的计算面罩。我们表明,这种计算面罩可以用来减少高分辨率主网络的计算。曾经探索过稀有激活CNN的变异功能,在物体分类精确度方面没有出现退化,但在理论FLOP方面却经常衡量收益,而与高度优化的密集演化实施相比,并没有实现实际的加速。在这项工作中,我们利用计算面罩的宽度结构,提出一个新的基于电动的稀有卷动演算法。我们在基于LIDAR的3DD物体探测上核实了我们稀有的CNN的光电磁波的功效,我们报告了与不明显丧失精确度的密度共变速相比,墙时速增速率。