在时间限制下卸载最大推推推精确度的离边缘设备卸载值定值 (Offloading Algorithms for Maximizing Inference Accuracy on Edge Device Under a Time Constraint) - 专知论文

会员服务 ·

0

模型评估 · 推断 · 边 · 边缘设备 · 约束 ·

2021 年 12 月 21 日

Offloading Algorithms for Maximizing Inference Accuracy on Edge Device Under a Time Constraint

翻译：在时间限制下卸载最大推推推精确度的离边缘设备卸载值定值

Andrea Fresa,Jaya Prakash Champati

With the emergence of edge computing, the problem of offloading jobs between an Edge Device (ED) and an Edge Server (ES) received significant attention in the past. Motivated by the fact that an increasing number of applications are using Machine Learning (ML) inference, we study the problem of offloading inference jobs by considering the following novel aspects: 1) in contrast to a typical computational job, the processing time of an inference job depends on the size of the ML model, and 2) recently proposed Deep Neural Networks (DNNs) for resource-constrained devices provide the choice of scaling the model size. We formulate an assignment problem with the aim of maximizing the total inference accuracy of n data samples available at the ED, subject to a time constraint T on the makespan. We propose an approximation algorithm AMR2, and prove that it results in a makespan at most 2T, and achieves a total accuracy that is lower by a small constant from optimal total accuracy. As proof of concept, we implemented AMR2 on a Raspberry Pi, equipped with MobileNet, and is connected to a server equipped with ResNet, and studied the total accuracy and makespan performance of AMR2 for image classification application.

翻译：随着边缘计算的出现,边缘设备(ED)和边缘服务器(ES)之间的卸载工作问题在过去曾受到极大关注,由于越来越多的应用程序正在使用机器学习(ML)的推论,我们研究卸载推论工作的问题,方法是考虑以下新的方面:(1) 与典型的计算工作相比,推论工作的处理时间取决于ML模型的大小,(2) 资源限制的装置最近提议的深神经网络(DNNS)提供了缩放模型大小的选择。我们设计了任务分配问题,目的是尽量扩大在ED上提供的N数据样本的完全推断准确性,但需在Meampan上有时间限制。我们建议了近似算算法AMR2,并证明它的结果是最多为2T,总精度比最精确的恒定值低。作为概念的证明,我们在安装移动网络的Raspberry Pi上安装了AMRMR2, 并与安装了AMR2服务器的性能分类和图像应用连接。

0

相关内容

模型评估

机器学习系统设计系统评估标准

【干货书】图、网络与算法，655页pdf，Graphs, Networks，and Algorithms

【干货书】图、网络与算法，655页pdf，Graphs, Networks，and Algorithms

专知会员服务

95+阅读 · 2021年9月21日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知会员服务

87+阅读 · 2020年8月28日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

如何加速NVIDIA gpu上的训练、推理和ML应用？108页ppt，Accelerating training, inference, and ML applications on NVIDIA GPUs

如何加速NVIDIA gpu上的训练、推理和ML应用？108页ppt，Accelerating training, inference, and ML applications on NVIDIA GPUs

专知会员服务

61+阅读 · 2019年12月29日

【目标跟踪 | 2019最新综述】多目标追踪综述，附38页PDF，185篇参考文献，Deep Learning in Video Multi-Object Tracking: A Survey

【目标跟踪 | 2019最新综述】多目标追踪综述，附38页PDF，185篇参考文献，Deep Learning in Video Multi-Object Tracking: A Survey

专知会员服务

93+阅读 · 2019年11月15日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

已删除

将门创投

9+阅读 · 2019年11月15日

人工智能 | UAI 2019等国际会议信息4条

人工智能 | UAI 2019等国际会议信息4条

Call4Papers

6+阅读 · 2019年1月14日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

大数据 | 顶级SCI期刊专刊/国际会议信息7条

大数据 | 顶级SCI期刊专刊/国际会议信息7条

Call4Papers

10+阅读 · 2018年12月29日

The Larger The Fairer? Small Neural Networks Can Achieve Fairness for Edge Devices

Arxiv

0+阅读 · 2022年2月23日

A Survey on Offloading in Federated Cloud-Edge-Fog Systems with Traditional Optimization and Machine Learning

Arxiv

0+阅读 · 2022年2月22日

Online Learning for Orchestration of Inference in Multi-User End-Edge-Cloud Networks

Arxiv

1+阅读 · 2022年2月21日

Practical Interference Exploitation Precoding without Symbol-by-Symbol Optimization: A Block-Level Approach

Arxiv

0+阅读 · 2022年2月20日

Improving AoI via Learning-based Distributed MAC in Wireless Networks

Improving AoI via Learning-based Distributed MAC in Wireless Networks

Arxiv

0+阅读 · 2022年2月18日

Inference-optimized AI and high performance computing for gravitational wave detection at scale

Arxiv

0+阅读 · 2022年2月17日

Contrastive Active Inference

Arxiv

4+阅读 · 2021年10月19日

A Survey on Edge Intelligence

A Survey on Edge Intelligence

Arxiv

52+阅读 · 2020年3月26日

Hyper-Parameter Optimization: A Review of Algorithms and Applications

Hyper-Parameter Optimization: A Review of Algorithms and Applications

Arxiv

16+阅读 · 2020年3月12日

Fire SSD: Wide Fire Modules based Single Shot Detector on Edge Device

Arxiv

3+阅读 · 2018年10月16日

VIP会员

文章信息

相关主题

相关VIP内容

【干货书】图、网络与算法，655页pdf，Graphs, Networks，and Algorithms

【干货书】图、网络与算法，655页pdf，Graphs, Networks，and Algorithms

专知会员服务

95+阅读 · 2021年9月21日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知会员服务

87+阅读 · 2020年8月28日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

如何加速NVIDIA gpu上的训练、推理和ML应用？108页ppt，Accelerating training, inference, and ML applications on NVIDIA GPUs

如何加速NVIDIA gpu上的训练、推理和ML应用？108页ppt，Accelerating training, inference, and ML applications on NVIDIA GPUs

专知会员服务

61+阅读 · 2019年12月29日

【目标跟踪 | 2019最新综述】多目标追踪综述，附38页PDF，185篇参考文献，Deep Learning in Video Multi-Object Tracking: A Survey

【目标跟踪 | 2019最新综述】多目标追踪综述，附38页PDF，185篇参考文献，Deep Learning in Video Multi-Object Tracking: A Survey

专知会员服务

93+阅读 · 2019年11月15日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】面向企业的图学习扩展：生产级图学习与推理，485页pdf

AI智能体编程：技术、挑战与机遇综述

【国家标准】数据安全技术数据安全风险评估方法

【CMU博士论文】交互式学习的进展：替代性反馈机制与自适应因果推理

相关资讯

已删除

将门创投

9+阅读 · 2019年11月15日

人工智能 | UAI 2019等国际会议信息4条

人工智能 | UAI 2019等国际会议信息4条

Call4Papers

6+阅读 · 2019年1月14日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

大数据 | 顶级SCI期刊专刊/国际会议信息7条

大数据 | 顶级SCI期刊专刊/国际会议信息7条

Call4Papers

10+阅读 · 2018年12月29日

相关论文

The Larger The Fairer? Small Neural Networks Can Achieve Fairness for Edge Devices

Arxiv

0+阅读 · 2022年2月23日

A Survey on Offloading in Federated Cloud-Edge-Fog Systems with Traditional Optimization and Machine Learning

Arxiv

0+阅读 · 2022年2月22日

Online Learning for Orchestration of Inference in Multi-User End-Edge-Cloud Networks

Arxiv

1+阅读 · 2022年2月21日

Practical Interference Exploitation Precoding without Symbol-by-Symbol Optimization: A Block-Level Approach

Arxiv

0+阅读 · 2022年2月20日

Improving AoI via Learning-based Distributed MAC in Wireless Networks

Improving AoI via Learning-based Distributed MAC in Wireless Networks

Arxiv

0+阅读 · 2022年2月18日

Inference-optimized AI and high performance computing for gravitational wave detection at scale

Arxiv

0+阅读 · 2022年2月17日

Contrastive Active Inference

Arxiv

4+阅读 · 2021年10月19日

A Survey on Edge Intelligence

A Survey on Edge Intelligence

Arxiv

52+阅读 · 2020年3月26日

Hyper-Parameter Optimization: A Review of Algorithms and Applications

Hyper-Parameter Optimization: A Review of Algorithms and Applications

Arxiv

16+阅读 · 2020年3月12日

Fire SSD: Wide Fire Modules based Single Shot Detector on Edge Device

Arxiv

3+阅读 · 2018年10月16日

微信扫码咨询专知VIP会员