With the popularity of Internet of Things (IoT), edge computing and cloud computing, more and more stream analytics applications are being developed including real-time trend prediction and object detection on top of IoT sensing data. One popular type of stream analytics is the recurrent neural network (RNN) deep learning model based time series or sequence data prediction and forecasting. Different from traditional analytics that assumes data to be processed are available ahead of time and will not change, stream analytics deals with data that are being generated continuously and data trend/distribution could change (aka concept drift), which will cause prediction/forecasting accuracy to drop over time. One other challenge is to find the best resource provisioning for stream analytics to achieve good overall latency. In this paper, we study how to best leverage edge and cloud resources to achieve better accuracy and latency for RNN-based stream analytics. We propose a novel edge-cloud integrated framework for hybrid stream analytics that support low latency inference on the edge and high capacity training on the cloud. We study the flexible deployment of our hybrid learning framework, namely edge-centric, cloud-centric and edge-cloud integrated. Further, our hybrid learning framework can dynamically combine inference results from an RNN model pre-trained based on historical data and another RNN model re-trained periodically based on the most recent data. Using real-world and simulated stream datasets, our experiments show the proposed edge-cloud deployment is the best among all three deployment types in terms of latency. For accuracy, the experiments show our dynamic learning approach performs the best among all learning approaches for all three concept drift scenarios.
翻译:随着Tings(IoT)、边缘计算和云计算互联网的普及,越来越多的流式分析应用正在开发,包括实时趋势预测和在IoT感知数据顶部检测目标,流式分析的一种流行类型是经常性神经网络(RNN)深学习模型基于时间序列或序列数据预测和预测。与假定数据将处理的传统分析方法不同,可以提前且不会改变,流式分析处理正在不断生成的数据,数据趋势/分布可能改变(Aka概念漂移),这将导致预测/预测边缘准确性逐渐下降。另一个挑战是为流式分析找到最佳资源供给,以达到良好的总体延缓度。在本论文中,我们研究如何最佳利用边缘和云性资源,提高RNNT的流前导分析的准确度和耐久度。我们提出了一个新的低位模型化综合框架,为混合流分析所有混合分析方法提供了一个新的边缘值综合框架,支持边端和高能力培训中的所有动态潜值。我们从最近的云层和高空级中学习了最深层的周期性、最核心数据框架的灵活性,我们学习框架将展示。