衡量什么是最重要的:优化微量ML的神经网络 (Measuring what Really Matters: Optimizing Neural Networks for TinyML) - 专知论文

会员服务 ·

0

优化器 · TinyML · Performer · Neural Networks · Networking ·

2021 年 4 月 21 日

Measuring what Really Matters: Optimizing Neural Networks for TinyML

翻译：衡量什么是最重要的:优化微量ML的神经网络

Lennart Heim,Andreas Biri,Zhongnan Qu,Lothar Thiele

With the surge of inexpensive computational and memory resources, neural networks (NNs) have experienced an unprecedented growth in architectural and computational complexity. Introducing NNs to resource-constrained devices enables cost-efficient deployments, widespread availability, and the preservation of sensitive data. This work addresses the challenges of bringing Machine Learning to MCUs, where we focus on the ubiquitous ARM Cortex-M architecture. The detailed effects and trade-offs that optimization methods, software frameworks, and MCU hardware architecture have on key performance metrics such as inference latency and energy consumption have not been previously studied in depth for state-of-the-art frameworks such as TensorFlow Lite Micro. We find that empirical investigations which measure the perceptible metrics - performance as experienced by the user - are indispensable, as the impact of specialized instructions and layer types can be subtle. To this end, we propose an implementation-aware design as a cost-effective method for verification and benchmarking. Employing our developed toolchain, we demonstrate how existing NN deployments on resource-constrained devices can be improved by systematically optimizing NNs to their targeted application scenario.

翻译：随着廉价计算和记忆资源的激增,神经网络在建筑和计算复杂性方面经历了前所未有的增长。在资源限制的装置中引入神经网络能够实现成本效益高的部署、广泛提供和保存敏感数据。这项工作解决了将机器学习引入多边协调单位的挑战,我们集中关注无处不在的ARM Cortex-M结构。优化方法、软件框架和多边协调单位硬件结构对于关键性能衡量标准(如推论延缓和能源消耗)的详细影响和取舍,以前没有深入研究过诸如TensorFlow Lite Micro等最先进的框架。我们发现,通过系统地优化专用指令和层类型的影响,衡量可视指标的实证调查是不可或缺的,为此,我们提议采用一种具有成本效益的设计,作为核查和基准的成本效益方法。使用我们开发的工具链,我们展示了如何通过系统地优化非专用专用装置的定向应用来改进现有的资源限制装置。

0

相关内容

优化器

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

一份简单《图神经网络》教程，28页ppt

一份简单《图神经网络》教程，28页ppt

专知会员服务

127+阅读 · 2020年8月2日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【Google】微型化机器学习教程，17页ppt，Getting Started with TinyML

【Google】微型化机器学习教程，17页ppt，Getting Started with TinyML

专知会员服务

71+阅读 · 2020年3月28日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

移动端机器学习资源合集

移动端机器学习资源合集

专知

8+阅读 · 2019年4月21日

【泡泡一分钟】CReaM: 使用卷积网络估计深度的浓缩实时模型

【泡泡一分钟】CReaM: 使用卷积网络估计深度的浓缩实时模型

泡泡机器人SLAM

5+阅读 · 2019年3月24日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

计算机类 | 11月截稿会议信息9条

计算机类 | 11月截稿会议信息9条

Call4Papers

6+阅读 · 2018年10月14日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

【推荐】卷积神经网络类间不平衡问题系统研究

【推荐】卷积神经网络类间不平衡问题系统研究

机器学习研究会

6+阅读 · 2017年10月18日

BiPointNet: Binary Neural Network for Point Clouds

Arxiv

0+阅读 · 2021年6月11日

High-Performance FPGA-based Accelerator for Bayesian Neural Networks

Arxiv

0+阅读 · 2021年6月10日

Deep Implicit Surface Point Prediction Networks

Arxiv

0+阅读 · 2021年6月10日

Towards a Survey of Visualization Methods for Power Grids

Arxiv

0+阅读 · 2021年5月19日

Real-Time Video Super-Resolution on Smartphones with Deep Learning, Mobile AI 2021 Challenge: Report

Arxiv

0+阅读 · 2021年5月17日

Topological Uncertainty: Monitoring trained neural networks through persistence of activation graphs

Arxiv

0+阅读 · 2021年5月7日

What Are Bayesian Neural Network Posteriors Really Like?

Arxiv

8+阅读 · 2021年4月29日

Tiny SSD: A Tiny Single-shot Detection Deep Convolutional Neural Network for Real-time Embedded Object Detection

Arxiv

7+阅读 · 2018年2月19日

Implementing the Deep Q-Network

Arxiv

3+阅读 · 2017年11月20日

Fast YOLO: A Fast You Only Look Once System for Real-time Embedded Object Detection in Video

Arxiv

5+阅读 · 2017年9月18日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

一份简单《图神经网络》教程，28页ppt

一份简单《图神经网络》教程，28页ppt

专知会员服务

127+阅读 · 2020年8月2日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【Google】微型化机器学习教程，17页ppt，Getting Started with TinyML

【Google】微型化机器学习教程，17页ppt，Getting Started with TinyML

专知会员服务

71+阅读 · 2020年3月28日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

小规模训练指南：打造世界级大语言模型的关键方法

无人机编队飞行：复杂环境中作战的策略、挑战与应用

大模型APP，AI时代第一个爆款

从数据中心视角出发的高效大语言模型训练综述

相关资讯

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

移动端机器学习资源合集

移动端机器学习资源合集

专知

8+阅读 · 2019年4月21日

【泡泡一分钟】CReaM: 使用卷积网络估计深度的浓缩实时模型

【泡泡一分钟】CReaM: 使用卷积网络估计深度的浓缩实时模型

泡泡机器人SLAM

5+阅读 · 2019年3月24日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

计算机类 | 11月截稿会议信息9条

计算机类 | 11月截稿会议信息9条

Call4Papers

6+阅读 · 2018年10月14日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

【推荐】卷积神经网络类间不平衡问题系统研究

【推荐】卷积神经网络类间不平衡问题系统研究

机器学习研究会

6+阅读 · 2017年10月18日

相关论文

BiPointNet: Binary Neural Network for Point Clouds

Arxiv

0+阅读 · 2021年6月11日

High-Performance FPGA-based Accelerator for Bayesian Neural Networks

Arxiv

0+阅读 · 2021年6月10日

Deep Implicit Surface Point Prediction Networks

Arxiv

0+阅读 · 2021年6月10日

Towards a Survey of Visualization Methods for Power Grids

Arxiv

0+阅读 · 2021年5月19日

Real-Time Video Super-Resolution on Smartphones with Deep Learning, Mobile AI 2021 Challenge: Report

Arxiv

0+阅读 · 2021年5月17日

Topological Uncertainty: Monitoring trained neural networks through persistence of activation graphs

Arxiv

0+阅读 · 2021年5月7日

What Are Bayesian Neural Network Posteriors Really Like?

Arxiv

8+阅读 · 2021年4月29日

Tiny SSD: A Tiny Single-shot Detection Deep Convolutional Neural Network for Real-time Embedded Object Detection

Arxiv

7+阅读 · 2018年2月19日

Implementing the Deep Q-Network

Arxiv

3+阅读 · 2017年11月20日

Fast YOLO: A Fast You Only Look Once System for Real-time Embedded Object Detection in Video

Arxiv

5+阅读 · 2017年9月18日

微信扫码咨询专知VIP会员