最优化端对端有线电视新闻网加速使用全寿命成本 (Optimizing the Whole-life Cost in End-to-end CNN Acceleration)

The acceleration of CNNs has gained increasing atten-tion since their success in computer vision. With the heterogeneous functional layers that cannot be pro-cessed by the accelerators proposed for convolution layers only, modern end-to-end CNN acceleration so-lutions either transform the diverse computation into matrix/vector arithmetic, which loses data reuse op-portunities in convolution, or introduce dedicated functional unit to each kind of layer, which results in underutilization and high update expense. To enhance the whole-life cost efficiency, we need an acceleration solution that is efficient in processing CNN layers and has the generality to apply to all kinds of existing and emerging layers. To this end, we pro-pose GCONV Chain, a method to convert the entire CNN computation into a chain of standard general convolutions (GCONV) that can be efficiently pro-cessed by the existing CNN accelerators. This paper comprehensively analyzes the GCONV Chain model and proposes a full-stack implementation to support GCONV Chain. On one hand, the results on seven var-ious CNNs demonstrate that GCONV Chain improves the performance and energy efficiency of existing CNN accelerators by an average of 3.4x and 3.2x re-spectively. On the other hand, we show that GCONV Chain provides low whole-life costs for CNN accelera-tion, including both developer efforts and total cost of ownership for the users.

翻译：CNN的加速率自计算机愿景成功以来不断提高。由于只为进化层提议的加速器无法推动不同功能层,现代端对端CNN加速度的解决方案要么将多种计算转换成矩阵/矢量计算,从而在进化中丧失数据再利用机会,要么向每种层引入专门的功能单位,从而导致利用不足和高更新费用。为了提高整个寿命成本效率,我们需要一种高效处理CNN层的加速解决方案,并具有适用于所有现有和新兴层的通用性。为此,我们将GCONV链作为将CNN整个计算转换成标准总体演算链(GCONSV)的一种方法,这一方法可以有效地被现有的CNNAA加速器加速器取代。本文全面分析GCONV链模式,并提议全面实施支持GCONV链。一方面,7个具有反向型的CNNCNCN的系统总成本和CONCAR(包括GCNCAR)的当前平均成本和成本效率。