Deep learning (DL) has seen great success in the computer vision (CV) field, and related techniques have been used in security, healthcare, remote sensing, and many other fields. As a parallel development, visual data has become universal in daily life, easily generated by ubiquitous low-cost cameras. Therefore, exploring DL-based CV may yield useful information about objects, such as their number, locations, distribution, motion, etc. Intuitively, DL-based CV can also facilitate and improve the designs of wireless communications, especially in dynamic network scenarios. However, so far, such work is rare in the literature. The primary purpose of this article, then, is to introduce ideas about applying DL-based CV in wireless communications to bring some novel degrees of freedom to both theoretical research and engineering applications. To illustrate how DL-based CV can be applied in wireless communications, an example of using a DL-based CV with a millimeter-wave (mmWave) system is given to realize optimal mmWave multiple-input and multiple-output (MIMO) beamforming in mobile scenarios. In this example, we propose a framework to predict future beam indices from previously observed beam indices and images of street views using ResNet, 3-dimensional ResNext, and a long short-term memory network. The experimental results show that our frameworks achieve much higher accuracy than the baseline method, and that visual data can significantly improve the performance of the MIMO beamforming system. Finally, we discuss the opportunities and challenges of applying DL-based CV in wireless communications.
翻译:深入的学习(DL)在计算机视觉(CV)领域取得了巨大成功,相关技术在安全、保健、遥感和其他许多领域也得到了应用。作为平行发展,视觉数据在日常生活中已经普及,容易由无处不在的低成本相机产生。因此,探索基于DL的CV可能会产生关于对象的有用信息,如其数量、地点、分布、运动等。直观地,基于DL的CV还可以促进和改进无线通信的设计,特别是在动态网络情景中。然而,迄今为止,这类工作在文献中是少见的。因此,这一文章的主要目的是在无线通信中引入应用基于DL的CV的理念,为理论研究和工程应用带来一些新程度的自由。为了说明基于DL的CV如何应用于无线通信,一个使用毫米波(mmWave)的基于DL的 CV系统实现最佳的多输出和多输出(MIMO),在移动图像中大大地应用基于移动图像的模型和M-RL的模型模型,我们用一个长期的模型来显示我们所观测的模型和M-RFM的模型。