We introduce Tuna, a static analysis approach to optimizing deep neural network programs. The optimization of tensor operations such as convolutions and matrix multiplications is the key to improving the performance of deep neural networks. Many deep learning model optimization mechanisms today use dynamic analysis, which relies on experimental execution on a target device to build a data-driven cost model of the program. The reliance on dynamic profiling not only requires access to target hardware at compilation time but also incurs significant cost in machine resources. We introduce an approach that profiles the program by constructing features based on the target hardware characteristics in order. We use static analysis of the relative performance of tensor operations to optimize the deep learning program. Experiments show that our approach can achieve up to 11x performance compared to dynamic profiling based methods with the same compilation time.
翻译:我们采用静态分析方法来优化深神经网络程序,即金枪鱼,这是一种静态分析方法;优化变异和矩阵乘积等高温操作是提高深神经网络性能的关键;许多深学习模型优化机制今天使用动态分析,依靠实验性执行来建立数据驱动程序的成本模型;对动态剖析的依赖不仅要求在编译时使用目标硬件,而且机器资源也产生巨大的成本;我们采用一种方法,根据目标硬件特性建立功能来描述程序;我们用静态分析高温操作的相对性能来优化深度学习程序;实验表明,我们的方法可以达到11x的性能,而与以动态剖析为基础的方法相比,在编译时间可以达到11x的性能。