Artificial Intelligence has now taken centre stage in the smartphone industry owing to the need of bringing all processing close to the user and addressing privacy concerns. Convolution Neural Networks (CNNs), which are used by several AI applications, are highly resource and computation intensive. Although new generation smartphones come with AI-enabled chips, minimal memory and energy utilisation is essential as many applications are run concurrently on a smartphone. In light of this, optimising the workload on the smartphone by offloading a part of the processing to a cloud server is an important direction of research. In this paper, we analyse the feasibility of splitting CNNs between smartphones and cloud server by formulating a multi-objective optimisation problem that optimises the end-to-end latency, memory utilisation, and energy consumption. We design SmartSplit, a Genetic Algorithm with decision analysis based approach to solve the optimisation problem. Our experiments run with multiple CNN models show that splitting a CNN between a smartphone and a cloud server is feasible. The proposed approach, SmartSplit fares better when compared to other state-of-the-art approaches.
翻译:由于需要使所有处理都接近用户并解决隐私问题,人工智能产业现已成为人造智能智能产业的核心。由多个AI应用程序使用的进化神经网络(CNNs)具有高度的资源和计算密集性。尽管新一代智能手机带有AI驱动芯片,但最小记忆和能源利用至关重要,因为许多应用程序同时在智能手机上运行。鉴于此,通过将部分处理程序卸到云服务器来优化智能手机的工作量是一个重要的研究方向。在本文中,我们分析将智能手机和云服务器分割为智能手机和云服务器的可行性,方法是提出多目标的优化问题,使终端到终端的拉通性、记忆利用和能源消耗更加优化。我们设计了基于决策分析的SmartSplit,一个基于决定分析方法的遗传Algorithm,以解决优化问题。我们用多个CNN模型进行的实验显示,将CNN在智能手机和云服务器之间分裂是可行的。拟议的方法是SmartSplit farfore,与其他状态方法相比,智能SmartSplit freeflets fores forth。