LETI: 延缓度估计工具和神经网络对移动GPU的推断调查 (LETI: Latency Estimation Tool and Investigation of Neural Networks inference on Mobile GPU)

A lot of deep learning applications are desired to be run on mobile devices. Both accuracy and inference time are meaningful for a lot of them. While the number of FLOPs is usually used as a proxy for neural network latency, it may be not the best choice. In order to obtain a better approximation of latency, research community uses look-up tables of all possible layers for latency calculation for the final prediction of the inference on mobile CPU. It requires only a small number of experiments. Unfortunately, on mobile GPU this method is not applicable in a straight-forward way and shows low precision. In this work, we consider latency approximation on mobile GPU as a data and hardware-specific problem. Our main goal is to construct a convenient latency estimation tool for investigation(LETI) of neural network inference and building robust and accurate latency prediction models for each specific task. To achieve this goal, we build open-source tools which provide a convenient way to conduct massive experiments on different target devices focusing on mobile GPU. After evaluation of the dataset, we learn the regression model on experimental data and use it for future latency prediction and analysis. We experimentally demonstrate the applicability of such an approach on a subset of popular NAS-Benchmark 101 dataset and also evaluate the most popular neural network architectures for two mobile GPUs. As a result, we construct latency prediction model with good precision on the target evaluation subset. We consider LETI as a useful tool for neural architecture search or massive latency evaluation. The project is available at https://github.com/leti-ai

翻译：想要在移动设备上运行大量深层次的学习应用程序。不幸的是, 在移动 GPU 上, 这种方法在直向前方的搜索方式中不适用, 并且显示的精确度较低。虽然 FLOP 的数量通常用作神经网络内嵌值的代金。虽然 FLOP 的数量通常用作神经网络内嵌值的代金工具, 但这不是最佳选择。为了更好地接近潜伏, 研究界使用所有可能层次的搜索表来进行潜伏计算, 以便最终预测移动 CPU 的推断值。它只需要进行少量的实验。不幸的是, 在移动 GPU 上, 移动 GPU 的这一方法不能适用于直向前方的搜索方式, 并且显示低精确度。在这项工作中, 我们把移动 GPUPU 的延缩缩贴近度近值视为一个数据和硬件特有的问题。我们的主要目标是为实验性 GLETI 网络调查建立一个精确度估计工具的精确度工具。为了实现这一目标, 我们建立开放源工具工具, 在移动 GPUPU 评估后, 我们用一个回归模型来进行大规模的模型。

相关内容

Neural Networks

关注 1648

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

【CMU】可扩展人工智能白皮书

专知会员服务

28+阅读 · 2021年7月3日

【干货书】Python自然语言处理，504页pdf

专知会员服务

132+阅读 · 2021年6月18日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

神经网络的元学习，综述论文，23页pdf，Meta-Learning in Neural Networks: A Survey

专知会员服务

84+阅读 · 2020年4月11日