A novel LEarning-based Spectrum Sensing and Access (LESSA) framework is proposed, wherein a cognitive radio (CR) learns a time-frequency correlation model underlying spectrum occupancy of licensed users (LUs) in a radio ecosystem; concurrently, it devises an approximately optimal spectrum sensing and access policy under sensing constraints. A Baum-Welch algorithm is proposed to learn a parametric Markov transition model of LU spectrum occupancy based on noisy spectrum measurements. Spectrum sensing and access are cast as a Partially-Observable Markov Decision Process, approximately optimized via randomized point-based value iteration. Fragmentation, Hamming-distance state filters and Monte-Carlo methods are proposed to alleviate the inherent computational complexity, and a weighted reward metric to regulate the trade-off between CR throughput and LU interference. Numerical evaluations demonstrate that LESSA performs within 5 percent of a genie-aided upper bound with foreknowledge of LU spectrum occupancy, and outperforms state-of-the-art algorithms across the entire trade-off region: 71 percent over correlation-based clustering, 26 percent over Neyman-Pearson detection, 6 percent over the Viterbi algorithm, and 9 percent over an adaptive Deep Q-Network. LESSA is then extended to a distributed Multi-Agent setting (MA-LESSA), by proposing novel neighbor discovery and channel access rank allocation. MA-LESSA improves CR throughput by 43 percent over cooperative TD-SARSA, 84 percent over cooperative greedy distributed learning, and 3x over non-cooperative learning via g-statistics and ACKs. Finally, MA-LESSA is implemented on the DARPA SC2 platform, manifesting superior performance over competitors in a real-world TDWR-UNII WLAN emulation; its implementation feasibility is further validated on a testbed of ESP32 radios, exhibiting 96 percent success probability.
翻译:提议了一个基于LEARIN的新型LEARIN Spectration and Access(LESSA)框架,在这个框架中,一个认知性无线电(CR)学习一个时间-频率相关模型,该模型是无线电生态系统中许可用户(LUs)的频谱占用率基础;同时,它设计了一个在感应限制下,大致最佳的频谱遥感和访问政策。一个 Baum-Welch算法,以学习基于噪音频谱测量的LU频谱占用率参数Markov过渡模型。 光谱感和访问被作为部分可观测的Markov 决策程序,通过随机化点基基点的值复制值复制,大约优化了时间-时间-时间关系关系模型模型(CRCRR)在光谱占用率上的比例。 提议进行分解、哈明州级(Hamminginginginging)和蒙特卡尔-Carload-Carloool-SALs-SALs-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SAL-SA-SA-SA-S-S-SA-S-S-I-I-I-SD-SB-I-SD-I-I-SD-I-S-SLVI-SD-I-S-S-S-S-S-I-I-I-I-I-I-SD-I-I-I-I-I-I-I-I-I-I-I-I-I-I-SD-I-I-I-I-I-SD-SD-SD-I-I-SD-SD-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-