1美元/e美元的故事:1美元/e美元:ALOHA基于ALOHA的和基于加强-学习-基于学习-随机访问,以便进行延迟受限制的通信 (The Story of $1/e$: ALOHA-based and Reinforcement-Learning-based Random Access for Delay-Constrained Communications)

Motivated by the proliferation of real-time applications in multimedia communication systems, tactile Internet, networked controlled systems, and cyber-physical systems, supporting delay-constrained traffic becomes critical for such systems. In delay-constrained traffic, each packet has a hard deadline; when it is not delivered before its deadline is up, it becomes useless and will be removed from the system. In this work, we focus on designing the random access schemes for delay-constrained wireless communications. We first investigate three ALOHA-based random access schemes and prove that the system timely throughput of all three schemes under corresponding optimal transmission probabilities asymptotically converges to $1/e$, same as the well-know throughput limit for delay-unconstrained ALOHA systems. The fundamental reason why ALOHA-based schemes cannot achieve asymptotical system timely throughput beyond $1/e$ is that all active ALOHA stations access the channel with the same probability in any slot. Therefore, to go beyond $1/e$, we have to differentiate active stations' transmission probabilities to reduce the competition level. However, since all stations work in a distributed manner without any coordination, we should deploy the same policy (i.e., same piece of codes in practice) to all stations under which they will automatically differentiate their transmission probabilities. Toward that end, we propose a Reinforcement-Learning-based Random Access scheme for Delay-Constrained communications, called RLRA-DC, under which different stations collaboratively attain different transmission probabilities by only interacting with the access point. Our numerical result shows that the system timely throughput of RLRA-DC can be as high as 0.8 for tens of stations and can still reach 0.6 even for thousands of stations, much larger than $1/e$.

翻译：由于多媒体通信系统、触摸式互联网、网络控制系统和网络物理系统中实时应用的激增,支持受延迟限制的流量对于这些系统变得至关重要。在受延迟限制的流量中,每包都有一个困难的最后期限;当在最后期限到期之前未交付时,每包将变得毫无用处,并将从系统中删除。在这项工作中,我们侧重于设计随机访问计划,以延迟限制的无线通信。我们首先调查三个基于ALOHA的随机访问计划,并证明该系统在相应的最佳传输概率下及时通过所有三种计划。在同样的最佳传输概率下,我们不得不及时通过1美元/美元来完成所有三个计划的传输速度。在同步的流量中,我们不得不将活动站的传输概率与1美元/美元相近,同时在透明性汇率下,所有基于ALOHA的系统无法及时实现无干扰的流量,因此所有运行站都会在透明性政策下,在透明性规则下进行部署。我们所有运行的所有运行站都将在透明性政策下,所有运行一个相同的系统。