We consider a radio resource management (RRM) problem in a multi-user wireless network, where the goal is to optimize a network-wide utility function subject to constraints on the ergodic average performance of users. We propose a state-augmented parameterization for the RRM policy, where alongside the instantaneous network states, the RRM policy takes as input the set of dual variables corresponding to the constraints. We provide theoretical justification for the feasibility and near-optimality of the RRM decisions generated by the proposed state-augmented algorithm. Focusing on the power allocation problem with RRM policies parameterized by a graph neural network (GNN) and dual variables sampled from the dual descent dynamics, we numerically demonstrate that the proposed approach achieves a superior trade-off between mean, minimum, and 5th percentile rates than baseline methods.
翻译:我们认为,在多用户无线网络中,无线电资源管理是一个问题,其目标是优化全网络的公用事业功能,但受用户平均性能的限制。 我们提出了RRM政策的国家强化参数化,与即时网络国家一道,RRM政策将一系列与限制相对应的双重变量作为投入。 我们为拟议国家启动算法产生的RRM决定的可行性和接近最佳性提供了理论依据。 我们侧重于由图形神经网络(GNN)和从双源动态中抽样的双重变量参数化的RRM政策的权力分配问题,我们从数字上证明,拟议方法在平均、最低和5%比率之间实现了优于基线方法的权衡。