Radio access network (RAN) technologies continue to evolve, with Open RAN gaining the most recent momentum. In the O-RAN specifications, the RAN intelligent controllers (RICs) are software-defined orchestration and automation functions for the intelligent management of RAN. This article introduces principles for machine learning (ML), in particular, reinforcement learning (RL) applications in the O-RAN stack. Furthermore, we review the state-of-the-art research in wireless networks and cast it onto the RAN framework and the hierarchy of the O-RAN architecture. We provide a taxonomy for the challenges faced by ML/RL models throughout the development life-cycle: from the system specification to production deployment (data acquisition, model design, testing and management, etc.). To address the challenges, we integrate a set of existing MLOps principles with unique characteristics when RL agents are considered. This paper discusses a systematic model development, testing and validation life-cycle, termed: RLOps. We discuss fundamental parts of RLOps, which include: model specification, development, production environment serving, operations monitoring and safety/security. Based on these principles, we propose the best practices for RLOps to achieve an automated and reproducible model development process. At last, a holistic data analytics platform rooted in the O-RAN deployment is designed and implemented, aiming to embrace and fulfil the aforementioned principles and best practices of RLOps.
翻译:在O-RAN规格中,RAN智能控制器(RICs)是用于智能管理RAN的软件定义的调音和自动化功能。这一条提出了机器学习的原则(ML),特别是在O-RAN堆叠中引入了加强学习(RL)应用的原则。此外,我们审查了无线网络的最新研究,并将其纳入RAN框架和O-RAN结构的等级。我们为ML/RL模型在发展整个生命周期(从系统规格到生产部署(数据获取、模型设计、测试和管理等))所面临的挑战提供了一个分类。为了应对挑战,我们结合了一套现有MLOPs原则,在考虑RAN代理时具有独特的特点。此外,我们讨论了无线网络的最新模型开发、测试和验证生命周期。我们讨论了RLOPs的基本组成部分,其中包括:示范规格、开发、生产环境服务、操作、操作和安全及RLS/安全性部署的模型。我们根据这些原则,提出了一个最终实现的ROPS-ROA原则的自动化流程。