In this paper, we propose a joint radio and core resource allocation framework for NFV-enabled networks. In the proposed system model, the goal is to maximize energy efficiency (EE), by guaranteeing end-to-end (E2E) quality of service (QoS) for different service types. To this end, we formulate an optimization problem in which power and spectrum resources are allocated in the radio part. In the core part, the chaining, placement, and scheduling of functions are performed to ensure the QoS of all users. This joint optimization problem is modeled as a Markov decision process (MDP), considering time-varying characteristics of the available resources and wireless channels. A soft actor-critic deep reinforcement learning (SAC-DRL) algorithm based on the maximum entropy framework is subsequently utilized to solve the above MDP. Numerical results reveal that the proposed joint approach based on the SAC-DRL algorithm could significantly reduce energy consumption compared to the case in which R-RA and NFV-RA problems are optimized separately.
翻译:在本文件中,我们提议为NFV驱动的网络建立一个联合无线电和核心资源分配框架;在拟议的系统模型中,目标是通过保证不同服务类型的服务端到端质量(E2E),最大限度地提高能源效率(EEE);为此,我们提出了一个优化问题,在无线电部分分配电力和频谱资源;在核心部分,将功能的链条、定位和时间安排用于确保所有用户的QOS;这一联合优化问题以Markov决定程序(MDP)为模型,考虑到现有资源和无线频道的时间变化特点;随后,利用基于最大通气框架的软性行为者-丙型深度强化学习(SAC-DRL)算法(SAC-DRL)算法(SAC-DRL算法)来解决上述MDP。