The main challenge to deploy deep neural network (DNN) over a mobile edge network is how to split the DNN model so as to match the network architecture as well as all the nodes' computation and communication capacity. This essentially involves two highly coupled procedures: model generating and model splitting. In this paper, a joint model split and neural architecture search (JMSNAS) framework is proposed to automatically generate and deploy a DNN model over a mobile edge network. Considering both the computing and communication resource constraints, a computational graph search problem is formulated to find the multi-split points of the DNN model, and then the model is trained to meet some accuracy requirements. Moreover, the trade-off between model accuracy and completion latency is achieved through the proper design of the objective function. The experiment results confirm the superiority of the proposed framework over the state-of-the-art split machine learning design methods.
翻译:在移动边缘网络上部署深神经网络(DNN)的主要挑战是如何将DNN模型分割开来,以便与网络结构以及所有节点计算和通信能力相匹配,这基本上涉及两个高度配合的程序:模型生成和模型分割。在本文件中,提议建立一个联合模型分裂和神经结构搜索框架,以便在移动边缘网络上自动生成和部署一个DNN模型。考虑到计算和通信资源的限制,设计了一个计算图搜索问题,以找到DNN模型的多点,然后对模型进行培训,以满足某些准确性要求。此外,模型准确性和完成时间的平衡是通过正确设计客观功能来实现的。实验结果证实拟议框架优于最先进的分离机器学习方法。