Emerging Internet of Things (IoT) and mobile computing applications are expected to support latency-sensitive deep neural network (DNN) workloads. To realize this vision, the Internet is evolving towards an edge-computing architecture, where computing infrastructure is located closer to the end device to help achieve low latency. However, edge computing may have limited resources compared to cloud environments and thus, cannot run large DNN models that often have high accuracy. In this work, we develop REACT, a framework that leverages cloud resources to execute large DNN models with higher accuracy to improve the accuracy of models running on edge devices. To do so, we propose a novel edge-cloud fusion algorithm that fuses edge and cloud predictions, achieving low latency and high accuracy. We extensively evaluate our approach and show that our approach can significantly improve the accuracy compared to baseline approaches. We focus specifically on object detection in videos (applicable in many video analytics scenarios) and show that the fused edge-cloud predictions can outperform the accuracy of edge-only and cloud-only scenarios by as much as 50%. We also show that REACT can achieve good performance across tradeoff points by choosing a wide range of system parameters to satisfy use-case specific constraints, such as limited network bandwidth or GPU cycles.
翻译:新兴事物互联网( IoT) 和移动计算应用程序预计将支持对长期敏感的深神经网络( DNN)工作量。 为了实现这一愿景,互联网正在向边缘计算结构演变,即计算机基础设施离终端设备更近的边缘计算结构,以帮助实现低潜伏。然而,与云层环境相比,边缘计算可能资源有限,因此无法运行通常具有高度准确性的大型 DNN模型。在这项工作中,我们开发了REACT,这是一个利用云源资源实施大型 DNN模型的框架,其精度更高,以提高边缘设备运行模型的准确性。为此,我们提出了一个新的边缘组合算法,它将连接边缘和云预测,实现低潜伏和高准确性。我们广泛评价了我们的方法,并表明我们的方法能够大大提高与基线方法相比的准确性。我们特别侧重于视频中的物体探测( 在许多视频分析假设中适用 ), 并显示电磁极预测会超过边缘和云度假设的准确性。为了提高边缘和云度模型在边缘设备上的精确性。为了做到这一点,我们提出了一个新的边缘和云状组合组合拼拼拼拼拼拼组合算算算算算算算算算算算算算算法,我们还选择了宽度范围范围的系统可以实现特定的系统, 。