Cell-free network is considered as a promising architecture for satisfying more demands of future wireless networks, where distributed access points coordinate with an edge cloud processor to jointly provide service to a smaller number of user equipments in a compact area. In this paper, the problem of uplink beamforming design is investigated for maximizing the long-term energy efficiency (EE) with the aid of deep reinforcement learning (DRL) in the cell-free network. Firstly, based on the minimum mean square error channel estimation and exploiting successive interference cancellation for signal detection, the expression of signal to interference plus noise ratio (SINR) is derived. Secondly, according to the formulation of SINR, we define the long-term EE, which is a function of beamforming matrix. Thirdly, to address the dynamic beamforming design with continuous state and action space, a DRL-enabled beamforming design is proposed based on deep deterministic policy gradient (DDPG) algorithm by taking the advantage of its double-network architecture. Finally, the results of simulation indicate that the DDPG-based beamforming design is capable of converging to the optimal EE performance. Furthermore, the influence of hyper-parameters on the EE performance of the DDPG-based beamforming design is investigated, and it is demonstrated that an appropriate discount factor and hidden layers size can facilitate the EE performance.
翻译:没有细胞的网络被认为是满足未来无线网络更多需求的有希望的架构,在无线网络中,分布式接入点与边缘云处理器协调,共同为紧凑区数目较少的用户设备提供服务;在本文中,在无细胞网络中,利用深度强化学习(DRL),对上链接波形设计问题进行了调查,以最大限度地实现长期能源效率;首先,根据最低平均差错频道估计,利用连续干扰取消信号检测,生成干扰加噪音比率信号;其次,根据SINR的配方,我们界定长期电子电子E,这是波形矩阵的函数。 第三,为解决动态波形设计与连续状态和行动空间相结合的问题,根据深度强化学习(DRL),提议基于深度确定性政策梯度(DPG)算法,利用其双网络结构,对干扰加噪音比率(SINR)的信号表示;第二,根据SINR的信号表示,我们定义长期电子EEE,这是一个功能组合的函数。 第三,为了解决动态波状设计设计设计设计与持续状态和行动空间空间空间空间空间空间,DL,利用基于双网络结构的模拟显示的DG的高级电子磁度,可以对EDDP进行最佳模拟设计影响。