Random access (RA) schemes are a topic of high interest in machine-type communication (MTC). In RA protocols, backoff techniques such as exponential backoff (EB) are used to stabilize the system to avoid low throughput and excessive delays. However, these backoff techniques show varying performance for different underlying assumptions and analytical models. Therefore, finding a better transmission policy for slotted ALOHA RA is still a challenge. In this paper, we show the potential of deep reinforcement learning (DRL) for RA. We learn a transmission policy that balances between throughput and fairness. The proposed algorithm learns transmission probabilities using previous action and binary feedback signal, and it is adaptive to different traffic arrival rates. Moreover, we propose average age of packet (AoP) as a metric to measure fairness among users. Our results show that the proposed policy outperforms the baseline EB transmission schemes in terms of throughput and fairness.
翻译:随机访问(RA)计划是一个对机器类型通信(MTC)非常感兴趣的话题。在RA协议中,使用指数回流(EB)等后退技术稳定系统,以避免低吞量和过度拖延。然而,这些回流技术显示不同基本假设和分析模型的性能不同。因此,为已排定档期的ALOHA RA寻找更好的传输政策仍是一项挑战。在本文件中,我们展示了深入强化学习(DRL)对RA的潜力。我们学习了一种平衡吞吐量和公平之间的传输政策。拟议的算法利用先前的行动和二进制反馈信号来学习传输概率,并且适应不同的交通抵达率。此外,我们提出了衡量用户公平性的标准是包的平均年龄。我们的结果显示,拟议的政策在吞吐量和公平性方面超过了基线的EB传输计划。