FATNN:快速和准确的短期神经网络 (FATNN: Fast and Accurate Ternary Neural Networks)

Ternary Neural Networks (TNNs) have received much attention due to being potentially orders of magnitude faster in inference, as well as more power efficient, than full-precision counterparts. However, 2 bits are required to encode the ternary representation with only 3 quantization levels leveraged. As a result, conventional TNNs have similar memory consumption and speed compared with the standard 2-bit models, but have worse representational capability. Moreover, there is still a significant gap in accuracy between TNNs and full-precision networks, hampering their deployment to real applications. To tackle these two challenges, in this work, we first show that, under some mild constraints, computational complexity of the ternary inner product can be reduced by a factor of 2. Second, to mitigate the performance gap, we elaborately design an implementation-dependent ternary quantization algorithm. The proposed framework is termed Fast and Accurate Ternary Neural Networks (FATNN). Experiments on image classification demonstrate that our FATNN surpasses the state-of-the-arts by a significant margin in accuracy. More importantly, speedup evaluation compared with various precisions is analyzed on several platforms, which serves as a strong benchmark for further research.

翻译：远程神经网络(TNN)由于具有比完全精准的对应方更快速的推论以及更高效的电力,因此受到高度重视,因为其潜在规模可能比完全精准的对应方的推导速度要快得多,但是,需要2位数才能将长期代表制编码成单数,只有3个四分制水平得到杠杆。因此,传统临时网络与标准2位模型的记忆消耗量和速度相似,但代表性能力更差。此外,TNN和全面精密网络之间在准确性方面仍然存在着巨大差距,阻碍了将其部署到真正的应用。为了应对这两项挑战,我们首先显示,在有些轻微的限制下,内部内部产品在计算上的复杂性可以降低2.倍,以缩小性差。第二,我们精心设计了一个与标准2位模型相适应的执行性永久化算法。拟议框架被称为快速和准确的Ternal Neur网络(FATNNNN) 。图像分类实验表明,我们的FATNNNN用一个显著的差幅超越了状态,在精确性上可以进一步分析。更重要的是,与各种精确性基准相比,对各种评估进行了进一步分析。

相关内容

Neural Networks

关注 1651

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

【NeurIPS2020-MIT】子图神经网络，Subgraph Neural Networks

专知会员服务

46+阅读 · 2020年9月28日

如何画出漂亮BERT模型图？这份10页PPT帮你快速搞定，来自Jimmy Lin

专知会员服务

88+阅读 · 2020年7月22日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

【阿尔托大学】图神经网络，Graph Neural Networks，附60页ppt

专知会员服务

184+阅读 · 2020年4月26日