Millimeter-wave (mmWave) and terahertz (THz) communication systems adopt large antenna arrays to ensure adequate receive signal power. However, adjusting the narrow beams of these antenna arrays typically incurs high beam training overhead that scales with the number of antennas. Recently proposed vision-aided beam prediction solutions, which utilize \textit{raw RGB images} captured at the basestation to predict the optimal beams, have shown initial promising results. However, they still have a considerable computational complexity, limiting their adoption in the real world. To address these challenges, this paper focuses on developing and comparing various approaches that extract lightweight semantic information from the visual data. The results show that the proposed solutions can significantly decrease the computational requirements while achieving similar beam prediction accuracy compared to the previously proposed vision-aided solutions.
翻译:毫米波(mmWave)和泰赫兹(THz)通信系统采用大型天线阵列以确保充分接收信号电能;然而,调整这些天线阵列的窄射束通常需要用天线数量进行高射束训练。最近提出的利用基地台拍摄的光束预测方案,利用光束图像来预测最佳光束,显示了初步的有希望的结果。然而,这些天线阵列仍然具有相当大的计算复杂性,限制了它们在现实世界中的采用。为应对这些挑战,本文件侧重于制定和比较从视觉数据中提取轻量语义信息的各种方法。结果显示,拟议的解决方案可以大大减少计算要求,同时实现与先前提出的光量辅助解决方案相似的光束预测准确性。