Recently, intent-based management is receiving good attention in telecom networks owing to stringent performance requirements for many of the use cases. Several approaches on the literature employ traditional methods in the telecom domain to fulfill intents on the KPIs, which can be defined as a closed loop. However, these methods consider every closed-loop independent of each other which degrades the combined closed-loop performance. Also, when many closed loops are needed, these methods are not easily scalable. Multi-agent reinforcement learning (MARL) techniques have shown significant promise in many areas in which traditional closed-loop control falls short, typically for complex coordination and conflict management among loops. In this work, we propose a method based on MARL to achieve intent-based management without the requirement of the model of the underlying system. Moreover, when there are conflicting intents, the MARL agents can implicitly incentivize the loops to cooperate, without human interaction, by prioritizing the important KPIs. Experiments have been performed on a network emulator on optimizing KPIs for three services and we observe the proposed system performs well and is able to fulfill all existing intents when there are enough resources or prioritize the KPIs when there are scarce resources.
翻译:最近,由于对许多使用案例的严格性能要求,基于意图的管理在电信网络中正受到很好的注意。关于文献的一些方法在电信领域采用传统方法来实现对KPI的意向,这些方法可以定义为一个闭路循环。但是,这些方法考虑到每个互不独立的闭路环,从而降低封闭环的功能。此外,当需要许多闭路环时,这些方法不容易推广。多试剂强化学习(MARL)技术在传统封闭环控制不足的许多领域显示出巨大的希望,这些领域的传统闭路控制通常难以实现各循环之间的复杂协调和冲突管理。在这项工作中,我们建议一种基于MARL的方法,在没有基本系统模式要求的情况下,实现基于意图的管理。此外,如果存在相互矛盾的意图,MARL代理可以隐含地鼓励在没有人类互动的情况下进行合作的循环,对重要的KPIs进行优先排序。已经对一个网络模拟器进行了实验,以优化三种服务的KPIs,我们观察了拟议的系统是否运行良好,并且能够在资源十分稀缺或优先的情况下,在资源十分稀少的情况下,能够满足所有现有的意图。