在无人驾驶航空飞行器通信网络中为部分重叠的通道分配而大力学习Fuzzy-Larch (Robust Fuzzy-Learning For Partially Overlapping Channels Allocation In UAV Communication Networks)

In this paper, we consider a mesh-structured unmanned aerial vehicle (UAV) networks exploiting partially overlapping channels (POCs). For general data-collection tasks in UAV networks, we aim to optimize the network throughput with constraints on transmission power and quality of service (QoS). As far as the highly mobile and constantly changing UAV networks are concerned, unfortunately, most existing methods rely on definite information which is vulnerable to the dynamic environment, rendering system performance to be less effective. In order to combat dynamic topology and varying interference of UAV networks, a robust and distributed learning scheme is proposed. Rather than the perfect channel state information (CSI), we introduce uncertainties to characterize the dynamic channel gains among UAV nodes, which are then interpreted with fuzzy numbers. Instead of the traditional observation space where the channel capacity is a crisp reward, we implement the learning and decision process in a mapped fuzzy space. This allows the system to achieve a smoother and more robust performance by optimizing in an alternate space. To this end, we design a fuzzy payoffs function (FPF) to describe the fluctuated utility, and the problem of POCs assignment is formulated as a fuzzy payoffs game (FPG). Assisted by an attractive property of fuzzy bi-matrix games, the existence of fuzzy Nash equilibrium (FNE) for our formulated FPG is proved. Our robust fuzzy-learning algorithm could reach the equilibrium solution via a least-deviation method. Finally, numerical simulations are provided to demonstrate the advantages of our new scheme over the existing scheme.

翻译：在本文中,我们考虑的是利用部分重叠渠道的网状结构型无人驾驶飞行器(UAV)网络。关于UAV网络的一般数据收集任务,我们的目标是在传输动力和服务质量的限制下优化网络输送量。不幸的是,对于高度移动和不断变化的UAV网络而言,大多数现有方法都依赖易于受动态环境影响的明确信息,使系统性能降低效力。为了打击动态地形学和对UAV网络的强力干扰,我们提出了一个强有力和分布式的学习计划。与完美的频道状态信息(CSI)相比,我们引入了不确定性,将UAV节点的动态频道收益定性,然后用模糊的数字加以解释。对于传统的观测空间,即频道能力是精确的奖励,我们大多数现有方法依靠模糊空间执行学习和决策过程。这样,系统就可以通过优化替代性的学习空间实现更平稳和更稳健的业绩。为此,我们设计了一个模糊的付款功能(FPFFF),用来描述波动的节流效用,然后用模糊数字式的节算方法来说明我们现有的平价规则的存在。