The ubiquitous use of IoT and machine learning applications is creating large amounts of data that require accurate and real-time processing. Although edge-based smart data processing can be enabled by deploying pretrained models, the energy and memory constraints of edge devices necessitate distributed deep learning between the edge and the cloud for complex data. In this paper, we propose a distributed AI system to exploit both the edge and the cloud for training and inference. We propose a new architecture, MEANet, with a main block, an extension block, and an adaptive block for the edge. The inference process can terminate at either the main block, the extension block, or the cloud. The MEANet is trained to categorize inputs into easy/hard/complex classes. The main block identifies instances of easy/hard classes and classifies easy classes with high confidence. Only data with high probabilities of belonging to hard classes would be sent to the extension block for prediction. Further, only if the neural network at the edge shows low confidence in the prediction, the instance is considered complex and sent to the cloud for further processing. The training technique lends to the majority of inference on edge devices while going to the cloud only for a small set of complex jobs, as determined by the edge. The performance of the proposed system is evaluated via extensive experiments using modified models of ResNets and MobileNetV2 on CIFAR-100 and ImageNet datasets. The results show that the proposed distributed model has improved accuracy and energy consumption, indicating its capacity to adapt.
翻译:虽然通过部署预先训练的模型可以实现基于边缘的智能数据处理,但边缘设备的能量和记忆限制要求边缘和云层之间对复杂数据进行分散的深度学习。在本文件中,我们提议一个分布式的人工智能系统,以利用边缘和云层进行培训和推断。我们提议一个新的结构,即Soulet,拥有主块、扩展块和边缘的适应区块。推断过程可以在主块、扩展区块或云层中结束。原始设备经过培训,将输入分为容易/硬/复杂/复杂类。主块确定容易/硬类和容易分类的类别。只有属于硬类的高度概率数据才会被送到扩展区进行预测。此外,只有在边缘的神经网络显示对预测信心低的情况下,才认为这个实例复杂,并被发送到云层中进行进一步处理。培训技术将输入成容易/硬/硬/复合类/复合类。主块确定了容易的类别,并且非常自信地分类的分类。在对模型中,通过模型的精细度进行模拟,只能将模型的精细的图像进行修改。