Visible Light Communication (VLC) has been widely investigated during the last decade due to its ability to provide high data rates with low power consumption. In general, resource management is an important issue in cellular networks that can highly effect their performance. In this paper, an optimisation problem is formulated to assign each user to an optimal access point and a wavelength at a given time. This problem can be solved using mixed integer linear programming (MILP). However, using MILP is not considered a practical solution due to its complexity and memory requirements. In addition, accurate information must be provided to perform the resource allocation. Therefore, the optimisation problem is reformulated using reinforcement learning (RL), which has recently received tremendous interest due to its ability to interact with any environment without prior knowledge. In this paper, we investigate solving the resource allocation optimisation problem in VLC systems using the basic Q-learning algorithm. Two scenarios are simulated to compare the results with the previously proposed MILP model. The results demonstrate the ability of the Q-learning algorithm to provide optimal solutions close to the MILP model without prior knowledge of the system.
翻译:过去十年来,人们广泛调查了可见光通信(VLC),因为它能够提供高数据率低电耗,总体而言,资源管理是细胞网络中的一个重要问题,能够对其性能产生高度影响。在本文件中,优化问题被提出来,让每个用户在特定时间达到最佳接入点和波长。这个问题可以通过混合整数线性编程(MILP)来解决。但是,由于MILP的复杂性和记忆要求,使用MILP并不被视为一个实际解决办法。此外,必须提供准确的信息来进行资源分配。因此,优化问题是利用强化学习(RL)重新形成的,因为后者最近由于能够在没有事先了解的情况下与任何环境进行互动而引起极大兴趣。在本文件中,我们研究如何利用基本的Q-学习算法解决VLC系统中的资源分配优化问题。模拟了两种假设,将结果与先前提议的MILP模型进行比较。结果显示Q学习算法有能力提供接近MILP模型的最佳解决方案,而没有事先对系统的了解。