The field of video compression has developed some of the most sophisticated and efficient compression algorithms known in the literature, enabling very high compressibility for little loss of information. Whilst some of these techniques are domain specific, many of their underlying principles are universal in that they can be adapted and applied for compressing different types of data. In this work we present DeepCABAC, a compression algorithm for deep neural networks that is based on one of the state-of-the-art video coding techniques. Concretely, it applies a Context-based Adaptive Binary Arithmetic Coder (CABAC) to the network's parameters, which was originally designed for the H.264/AVC video coding standard and became the state-of-the-art for lossless compression. Moreover, DeepCABAC employs a novel quantization scheme that minimizes the rate-distortion function while simultaneously taking the impact of quantization onto the accuracy of the network into account. Experimental results show that DeepCABAC consistently attains higher compression rates than previously proposed coding techniques for neural network compression. For instance, it is able to compress the VGG16 ImageNet model by x63.6 with no loss of accuracy, thus being able to represent the entire network with merely 8.7MB. The source code for encoding and decoding can be found at https://github.com/fraunhoferhhi/DeepCABAC.
翻译:视频压缩领域开发了一些文献中已知的最尖端和最高效的压缩算法, 使得对信息略为丢失的参数可以非常高的压缩。 虽然其中一些技术是域性特定, 但其中许多基本原理是普遍性的, 因为它们可以被调整和用于压缩不同类型的数据。 在这项工作中, 我们展示了深海神经网络的压缩算法DeepCABAC, 这是基于最先进的视频编码技术之一的深神经网络的压缩算法。 具体地说, 它对网络参数应用基于环境的适应性二进制调二进制代码( CABAC), 它最初是为 H.264/ AVC 视频编码标准设计的, 成为无损压缩数据的最佳标准。 此外, DeepCABAC 采用了一种新的分解方案, 最大限度地减少电压功能, 同时将四分解的影响与网络的准确性联系起来。 实验结果表明, 深CABAC 与先前提议的神经网络压缩网络的调控技术相比, 不断达到更高的压缩率。 例如, 它能够将8GABC 的精确度与整个网络进行压缩。