Unmanned aerial vehicles (UAVs) are now beginning to be deployed for enhancing the network performance and coverage in wireless communication. However, due to the limitation of their on-board power and flight time, it is challenging to obtain an optimal resource allocation scheme for the UAV-assisted Internet of Things (IoT). In this paper, we design a new UAV-assisted IoT systems relying on the shortest flight path of the UAVs while maximising the amount of data collected from IoT devices. Then, a deep reinforcement learning-based technique is conceived for finding the optimal trajectory and throughput in a specific coverage area. After training, the UAV has the ability to autonomously collect all the data from user nodes at a significant total sum-rate improvement while minimising the associated resources used. Numerical results are provided to highlight how our techniques strike a balance between the throughput attained, trajectory, and the time spent. More explicitly, we characterise the attainable performance in terms of the UAV trajectory, the expected reward and the total sum-rate.
翻译:为加强无线通信的网络性能和覆盖范围,现在开始部署无人驾驶飞行器(无人驾驶飞行器),以提高无线通信的网络性能和覆盖范围;然而,由于无人驾驶飞行器的机载动力和飞行时间有限,要为无人驾驶飞行器协助的物联网(IoT)获得最佳资源分配计划是困难的。在本文件中,我们设计了新的无人驾驶飞行器协助的IoT系统,依靠无人驾驶飞行器最短的飞行路径,同时尽量扩大从IoT装置收集的数据量。然后,设计了一种深入强化的学习技术,以便在特定覆盖区找到最佳轨迹和吞吐。经过培训后,无人驾驶飞行器有能力从用户节点自动收集所有数据,但总和率要大幅度改进,同时尽量减少使用的相关资源。提供了数字结果,以突出我们的技术如何在无人驾驶飞行器的完成量、轨迹和所花费的时间之间取得平衡。我们更明确地说明从无人驾驶飞行器轨迹、预期的奖励和总和总和总速率方面可以实现的业绩。