Leveraging recent advances on mobile edge computing (MEC), edge intelligence has emerged as a promising paradigm to support mobile artificial intelligent (AI) applications at the network edge. In this paper, we consider the AI service placement problem in a multi-user MEC system, where the access point (AP) places the most up-to-date AI program at user devices to enable local computing/task execution at the user side. To fully utilize the stringent wireless spectrum and edge computing resources, the AP sends the AI service program to a user only when enabling local computing at the user yields a better system performance. We formulate a mixed-integer non-linear programming (MINLP) problem to minimize the total computation time and energy consumption of all users by jointly optimizing the service placement (i.e., which users to receive the program) and resource allocation (on local CPU frequencies, uplink bandwidth, and edge CPU frequency). To tackle the MINLP problem, we derive analytical expressions to calculate the optimal resource allocation decisions with low complexity. This allows us to efficiently obtain the optimal service placement solution by search-based algorithms such as meta-heuristic or greedy search algorithms. To enhance the algorithm scalability in large-sized networks, we further propose an ADMM (alternating direction method of multipliers) based method to decompose the optimization problem into parallel tractable MINLP subproblems. The ADMM method eliminates the need of searching in a high-dimensional space for service placement decisions and thus has a low computational complexity that grows linearly with the number of users. Simulation results show that the proposed algorithms perform extremely close to the optimum and significantly outperform the other representative benchmark algorithms.
翻译:利用移动边缘计算(MEC)的最新进步, 边缘情报已成为支持网络边缘移动人工智能(AI)应用的有希望的范例。 在本文中, 我们考虑在多用户 MEC 系统中的AI 服务配置问题, 进入点( AP) 将最新的 AI 程序放在用户设备上, 以便在用户方面进行本地计算/ 任务执行。 为了充分利用严格的无线频谱和边端计算资源, AP 只有当用户在本地计算能够产生更好的系统性能时, 才会将 AI 服务程序发送给用户。 我们设计了一个混合的 Interger 非线性程序( MINLP) 问题, 以便通过联合优化服务配置( AP) ( 用户接收程序) 和资源分配( 本地CPU 频率、 上线性带宽带宽和边端 CPU 频率) 。 为了解决 MINLP 问题, 我们从分析表达出最合适的服务配置解决方案, 通过基于搜索的算法( 例如, 内空基的内置低精度( MIM ) 的精度的精度计算方法的精度的精度计算方法), 将更细的精度排序的精度的精度排序的精度搜索系统进行更细的精度排序的精度排序的精度排序的精度排序的精度排序的精度的精度的精度排序的精度, 。