Deep neural networks (DNNs) have been widely and successfully applied to various applications, but they require large amounts of memory and computational power. This severely restricts their deployment on resource-limited devices. To address this issue, many efforts have been made on training low-bit weight DNNs. In this paper, we focus on training ternary weight \{-1, 0, +1\} networks which can avoid multiplications and dramatically reduce the memory and computation requirements. A ternary weight network can be considered as a sparser version of the binary weight counterpart by replacing some -1s or 1s in the binary weights with 0s, thus leading to more efficient inference but more memory cost. However, the existing approaches to training ternary weight networks cannot control the sparsity (i.e., percentage of 0s) of the ternary weights, which undermines the advantage of ternary weights. In this paper, we propose to our best knowledge the first sparsity-control approach (SCA) to training ternary weight networks, which is simply achieved by a weight discretization regularizer (WDR). SCA is different from all the existing regularizer-based approaches in that it can control the sparsity of the ternary weights through a controller $\alpha$ and does not rely on gradient estimators. We theoretically and empirically show that the sparsity of the trained ternary weights is positively related to $\alpha$. SCA is extremely simple, easy-to-implement, and is shown to consistently outperform the state-of-the-art approaches significantly over several benchmark datasets and even matches the performances of the full-precision weight counterparts.
翻译:深神经网络(DNNS)被广泛和成功地应用到各种应用中,但它们需要大量的内存和计算能力。这严重限制了它们在资源有限的设备上的部署。为了解决这个问题,已经为培训低比重DNNS做出了许多努力。在本文中,我们侧重于培训可避免倍增和大幅降低内存和计算要求的胸肌重量+1+++++++++++网络。在本文中,可以把双重对等网络视为较稀释的双重对等网络,用0取代了二进制重量的约-1-1个,从而导致更高效的推断,但更多的内存成本。然而,现有的培训耐重网络培训低比重DNNNNNNS。我们侧重于培训胸重++1++++++++++++++++++++网络,这样可以避免倍增倍增和大幅降低内存和计算要求。我们向我们最了解的是,对于培训国家重量网络的第一个通缩控制方法(SCA),这仅仅是用重量对重量重量的大幅离重的重量对等重的硬的硬分数的平分法, 平分级平平平平平平的硬的平平的平平的平比法法法,显示的平平平平平平平平平平平的平的平的平平平平平平平平的平平的平的平的平的平平的平的平的平的平平平平平平平平平平平平平平平的平的平的平的平的平平平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平基的平基平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平的平