Unmanned aerial vehicles (UAVs) are expected to be an integral part of wireless networks. In this paper, we aim to find collision-free paths for multiple cellular-connected UAVs, while satisfying requirements of connectivity with ground base stations (GBSs) in the presence of a dynamic jammer. We first formulate the problem as a sequential decision making problem in discrete domain, with connectivity, collision avoidance, and kinematic constraints. We, then, propose an offline temporal difference (TD) learning algorithm with online signal-to-interference-plus-noise ratio (SINR) mapping to solve the problem. More specifically, a value network is constructed and trained offline by TD method to encode the interactions among the UAVs and between the UAVs and the environment; and an online SINR mapping deep neural network (DNN) is designed and trained by supervised learning, to encode the influence and changes due to the jammer. Numerical results show that, without any information on the jammer, the proposed algorithm can achieve performance levels close to that of the ideal scenario with the perfect SINR-map. Real-time navigation for multi-UAVs can be efficiently performed with high success rates, and collisions are avoided.
翻译:无人驾驶航空飞行器(UAVs)预计将成为无线网络的一个组成部分。 在本文中,我们的目标是寻找多个蜂窝连接的无人驾驶航空器的无碰撞路径,同时满足在动态干扰器面前与地面基地站(GBS)连接的要求。我们首先将这一问题作为离散域的相继决策问题,提出连通、避免碰撞和运动限制。然后,我们提出一个离线时间差异(TD)学习算法,以在线信号到干涉加噪音比率(SINR)绘图解决问题。更具体地说,一个价值网络是用TD方法建造和培训离线的,以编码无人驾驶飞行器之间以及无人驾驶飞行器与环境之间的相互作用;一个在线SIRNR测绘深神经网络(DNNN)的设计和培训,通过有监督的学习来说明干扰的影响和变化。 数字结果显示,在没有关于干扰器的任何信息的情况下,拟议的算法可以达到接近理想情景的性能水平,而SINR-M-map是完美的,通过实时导航和高度避免的多式导航成功率进行。