This paper studies the Binary Neural Networks (BNNs) in which weights and activations are both binarized into 1-bit values, thus greatly reducing the memory usage and computational complexity. Since the modern deep neural networks are of sophisticated design with complex architecture for the accuracy reason, the diversity on distributions of weights and activations is very high. Therefore, the conventional sign function cannot be well used for effectively binarizing full-precision values in BNNs. To this end, we present a simple yet effective approach called AdaBin to adaptively obtain the optimal binary sets $\{b_1, b_2\}$ ($b_1, b_2\in \mathbb{R}$) of weights and activations for each layer instead of a fixed set (\textit{i.e.}, $\{-1, +1\}$). In this way, the proposed method can better fit different distributions and increase the representation ability of binarized features. In practice, we use the center position and distance of 1-bit values to define a new binary quantization function. For the weights, we propose an equalization method to align the symmetrical center of binary distribution to real-valued distribution, and minimize the Kullback-Leibler divergence of them. Meanwhile, we introduce a gradient-based optimization method to get these two parameters for activations, which are jointly trained in an end-to-end manner. Experimental results on benchmark models and datasets demonstrate that the proposed AdaBin is able to achieve state-of-the-art performance. For instance, we obtain a 66.4% Top-1 accuracy on the ImageNet using ResNet-18 architecture, and a 69.4 mAP on PASCAL VOC using SSD300. The PyTorch code is available at \url{https://github.com/huawei-noah/Efficient-Computing/tree/master/BinaryNetworks/AdaBin} and the MindSpore code is available at \url{https://gitee.com/mindspore/models/tree/master/research/cv/AdaBin}.
翻译:本文研究的是 bineral Neural 网络 {Binary Neal 网络 {BNNS}, 其中重量和激活被二进制为 1 比特值 。 因此, 由于现代深层神经网络设计复杂, 且结构结构精密, 重量和激活的多样性非常高 。 因此, 常规信号功能无法很好地用于在 BNNS 中有效地将全面精度值二进化。 为此, 我们提出了一个简单而有效的方法, 叫做 AdaBin, 以适应方式获得最佳的二进制 $b_ 1, b_ 2 美元 (b_ 1, b_ 2\ ) 和计算复杂。 由于现代神经网络网络网络网络网络网络网络网络设计设计精密, 精密设计精密的元和启动。 使用Sqalityality- disality 代码, 我们用一个智能的智能智能智能智能智能智能数据分配方法, 将Sqreality- discoalal 用于Smaria lishal develristal listal distressal distrational distration sess list sal laft liver slaft laft lifol.