Motivated by the proliferation of real-time applications in multimedia communication systems, tactile Internet, and cyber-physical systems, supporting delay-constrained traffic becomes critical for such systems. In delay-constrained traffic, each packet has a hard deadline; when it is not delivered before its deadline is up, it becomes useless and will be removed from the system. In this work, we focus on designing random access schemes for delay-constrained wireless communications. We first investigate three ALOHA-based schemes and prove that the system timely throughput of all three schemes under corresponding optimal transmission probabilities asymptotically converges to $1/e$, same as the well-known throughput limit for delay-unconstrained ALOHA systems. The fundamental reason why ALOHA-based schemes cannot achieve asymptotical system timely throughput beyond $1/e$ is that all active ALOHA stations access the channel with the same probability in any slot. To go beyond $1/e$, we propose a reinforcement-learning-based scheme for delay-constrained wireless communications, called RLRA-DC, under which different stations collaboratively attain different transmission probabilities by only interacting with the access point. Our numerical result shows that the system timely throughput of RLRA-DC can be as high as 0.8 for tens of stations and can still reach 0.6 even for thousands of stations, much larger than $1/e$.
翻译:由于多媒体通信系统、触动式互联网和网络物理系统中实时应用的激增,支持受延迟限制的通信系统成为这类系统的关键。在受延迟限制的通信中,每包都有一个困难的最后期限;在最后期限到期之前未交付时,每包就会变得无用,并将从系统中删除。在这项工作中,我们侧重于为受延迟限制的无线通信设计随机访问计划。我们首先调查三个基于ALOHA的计划,并证明该系统在相应的最佳传输概率下,所有三种计划都及时通过投入,其最佳传输概率必然达到1美元/美元。在延迟不受限制的ALOHA系统中,每包都有众所周知的截量限制。为什么ALOHA计划不能在最后期限到期前交付到时,它将失去效用,而将失去作用,因为所有活跃ALOHA的台站在任何时间段都有可能以同样的可能性进入频道。要超过1美元/e美元,我们提议一个基于强化学习的延迟受限制的无线通信计划,甚至称为RIRA-DC,在这个系统下,不同站通过高额的交付率,通过我们高额的系统能够通过高额交付。