The next generation of networks will actively embrace artificial intelligence (AI) and machine learning (ML) technologies for automation networks and optimal network operation strategies. The emerging network structure represented by Open RAN (O-RAN) conforms to this trend, and the radio intelligent controller (RIC) at the centre of its specification serves as an ML applications host. Various ML models, especially Reinforcement Learning (RL) models, are regarded as the key to solving RAN-related multi-objective optimization problems. However, it should be recognized that most of the current RL successes are confined to abstract and simplified simulation environments, which may not directly translate to high performance in complex real environments. One of the main reasons is the modelling gap between the simulation and the real environment, which could make the RL agent trained by simulation ill-equipped for the real environment. This issue is termed as the sim2real gap. This article brings to the fore the sim2real challenge within the context of O-RAN. Specifically, it emphasizes the characteristics, and benefits that the digital twins (DT) could have as a place for model development and verification. Several use cases are presented to exemplify and demonstrate failure modes of the simulations trained RL model in real environments. The effectiveness of DT in assisting the development of RL algorithms is discussed. Then the current state of the art learning-based methods commonly used to overcome the sim2real challenge are presented. Finally, the development and deployment concerns for the RL applications realisation in O-RAN are discussed from the view of the potential issues like data interaction, environment bottlenecks, and algorithm design.
翻译:下一代网络将积极包括人工智能(AI)和机器学习(ML)技术,用于自动化网络和最佳网络运作战略。Open RAN(O-RAN)所代表的新兴网络结构符合这一趋势,其规格中心无线电智能控制器(RIC)作为ML应用程序的东道主。各种ML模型,特别是Servement Learning(RL)模型,被视为解决与RAN有关的多目标优化问题的关键。然而,应当认识到,目前的RL成功大多局限于抽象和简化的模拟环境,这也许不会直接转化为复杂的真实环境中的高性互动。其中一个主要原因是模拟与真实环境之间的建模差距,使模拟为实际环境设备不足而培训的RL代理器。这个问题被称为Sim2现实差距。这篇文章在O-RA的背景下提出了Sim2现实挑战性的挑战。它强调数字双胞(DT)作为模型开发和核查的场所可能具有的特性和好处。在RL应用中,一些经过培训的模型应用软件的使用案例正在演示当前发展模式的模拟中,在RDL的模拟中演示了当前发展方法的可靠性。