A multi-armed bandit (MAB)-based decentralized channel exploration framework both adapting unknown traffics of neighboring access points (APs) and ensuring convergence is proposed. As the throughput provided by a typical AP in wireless local area network (WLAN) is significantly affected by neighboring APs' channels due to carrier sense operations, the neighbor awareness, i.e., being aware of channels of neighboring APs, is valuable. The main scope of this paper is to incorporate this neighbor awareness into an MAB-based channel exploration as conventional MAB-based WLAN channel exploration schemes lacks this perspective. To this end, we propose contention-driven feature extraction (CDFE), which extracts the adjacency relation of a contention graph. This allows to formulate the traffic-adaptive channel exploration as contextual MAB (CMAB) problem with joint linear upper confidence bound (JLinUCB) exploration where the graph edge of the feature is leveraged as the weights of a linear throughput estimator. Moreover, we address the problem of non-convergence -- the channel exploration cycle -- which is an inherent difficulty in selfish decentralized learning. To prevent such a cycle, we propose a penalized JLinUCB (P-JLinUCB) based on the key idea of introducing a discount parameter to the reward for exploiting a different channel before and after the learning round.
翻译:多武装土匪(MAB)基于多武装的分散式信道勘探框架(MAB)基于多武装的河道勘探框架(MAB)基于多武装的河道勘探框架(MAB)既适应邻近接入点的未知流量,又确保汇合。由于承运人的感知操作,一个典型的AP在无线局域网网络中提供的投入因邻近的通道而受到邻近的通道的影响很大,邻居的认识,即了解邻近的AP的渠道,即了解邻近的AP的渠道,是宝贵的。本文件的主要范围是将这一认识纳入以MAB为基础的河道勘探,作为传统的MAB基于WLAN的频道勘探计划而缺乏这一视角。为此,我们建议采用争议驱动的地段特征提取(CDFE CDFE ), 从而可以将交通适应性通道勘探作为相关的MAB(CAB) 渠道, 联合线性高层信任(JinUB) 探索, 将地图边缘作为线性通过估计的重量。此外,我们讨论了非兼容性的问题 -- 频道勘探周期 -- 这是自私分散式分化学习的内在困难。我们建议了基于低级的循环学习。