In this paper, the minimization of the weighted sum average age of information (AoI) in a multi-source status update communication system is studied. Multiple independent sources send update packets to a common destination node in a time-slotted manner under the limit of maximum retransmission rounds. Different multiple access schemes, i.e., orthogonal multiple access (OMA) and non-orthogonal multiple access (NOMA) are exploited here over a block-fading multiple access channel (MAC). Constrained Markov decision process (CMDP) problems are formulated to describe the AoI minimization problems considering both transmission schemes. The Lagrangian method is utilised to convert CMDP problems to unconstraint Markov decision process (MDP) problems and corresponding algorithms to derive the power allocation policies are obtained. On the other hand, for the case of unknown environments, two online reinforcement learning approaches considering both multiple access schemes are proposed to achieve near-optimal age performance. Numerical simulations validate the improvement of the proposed policy in terms of weighted sum AoI compared to the fixed power transmission policy, and illustrate that NOMA is more favorable in case of larger packet size.
翻译:在本文中,研究了将多源状态更新通信系统中的信息平均年龄加权和平均年龄(AoI)的最小化问题。多独立来源在最大再传输回合的限制下,以时间分期的方式向共同目的地节点发送更新包。获得不同的多重访问计划,即正方形多重访问(OMA)和非正方形多重访问(NOMA),在这里通过一个块状式多存取通道(MAC)加以利用。在两种传输计划都考虑的情况下,制定了控制式马尔科夫决定程序(CMDP)来描述AoI最小化问题。拉格朗加法用于将CMDP问题转换为非约束性马尔科夫决策过程(MDP)问题和相应的算法,以得出权力分配政策。另一方面,对于未知的环境,提出了两种考虑两种多重访问计划的在线强化学习方法,以达到接近最理想的年龄性能。Numericalimical 模拟证实了在加权AoI与固定电源传输政策相比,拟议的政策在加权总和最大性AMA 中更为有利的规模。