This paper considers a setting where embedded devices are used to acquire and classify images. Because of limited computing capacity, embedded devices rely on a parsimonious classification model with uneven accuracy. When local classification is deemed inaccurate, devices can decide to offload the image to an edge server with a more accurate but resource-intensive model. Resource constraints, e.g., network bandwidth, however, require regulating such transmissions to avoid congestion and high latency. The paper investigates this offloading problem when transmissions regulation is through a token bucket, a mechanism commonly used for such purposes. The goal is to devise a lightweight, online offloading policy that optimizes an application-specific metric (e.g., classification accuracy) under the constraints of the token bucket. The paper develops a policy based on a Deep Q-Network (DQN), and demonstrates both its efficacy and the feasibility of its deployment on embedded devices. Of note is the fact that the policy can handle complex input patterns, including correlation in image arrivals and classification accuracy. The evaluation is carried out by performing image classification over a local testbed using synthetic traces generated from the ImageNet image classification benchmark. Implementation of this work is available at https://github.com/qiujiaming315/edgeml-dqn.
翻译:由于计算能力有限,嵌入装置依赖于不均匀的分类模型。当本地分类被认为不准确时,装置可以决定用更准确但资源密集的模式将图像卸载到边缘服务器上。例如,网络带宽要求管理这种传输,以避免拥堵和高悬浮。由于计算能力有限,嵌入装置依赖一种不均匀的分类模型。当传输监管是通过一个象征性桶(一种通常用于此类目的的机制)时,嵌入装置会调查这种卸载问题。目标是在象征性桶的限制下设计一种轻量、在线卸载政策,优化具体应用指标(如分类精度)。该文件根据深Q-Network(DQN)制定了政策,并表明其效力和在嵌入装置上部署的可行性。值得注意的是,该政策可以处理复杂的输入模式,包括图像到达和分类准确性方面的关联性。评估是通过利用图像网图像基准生成的合成痕迹对本地测试床进行图像分类(例如分类精度)。