The beam alignment (BA) problem consists in accurately aligning the transmitter and receiver beams to establish a reliable communication link in wireless communication systems. Existing BA methods search the entire beam space to identify the optimal transmit-receive beam pair. This incurs a significant latency when the number of antennas is large. In this work, we develop a bandit-based fast BA algorithm to reduce BA latency for millimeter-wave (mmWave) communications. Our algorithm is named Two-Phase Heteroscedastic Track-and-Stop (2PHT\&S). We first formulate the BA problem as a pure exploration problem in multi-armed bandits in which the objective is to minimize the required number of time steps given a certain fixed confidence level. By taking advantage of the correlation structure among beams that the information from nearby beams is similar and the heteroscedastic property that the variance of the reward of an arm (beam) is related to its mean, the proposed algorithm groups all beams into several beam sets such that the optimal beam set is first selected and the optimal beam is identified in this set after that. Theoretical analysis and simulation results on synthetic and semi-practical channel data demonstrate the clear superiority of the proposed algorithm vis-\`a-vis other baseline competitors.
翻译:光束对齐问题在于精确地对发射机和接收器束进行对齐,以便在无线通信系统中建立可靠的通信连接。 BA现有方法搜索整个光束空间,以确定最理想的发射接收光束对比。当天线数量巨大时,这会产生显著的悬浮。在这项工作中,我们开发了以强盗为基础的快速BA算法,以减少毫米波通信(mmWave)的BA延缓度。我们的算法名为“双阶段超分轨道和停止”(2PHT ⁇ S) 。我们首先将BA问题描述为多臂强盗的一个纯粹的探索问题,目的是在某种固定信任水平下最大限度地减少所需时间步骤的数量。我们利用了相邻的光束之间的相关结构,以及一个手臂(bembam)的奖赏差异与其平均值有关,拟议的算法组全部组成若干波束组,以便首先选择最佳波束,并在这一系统中确定了最佳波束,然后在这一合成模型分析之后,还查明了其他高级实验室的模拟和模拟结果。