Recently, intent-based management has received good attention in telecom networks owing to stringent performance requirements for many of the use cases. Several approaches in the literature employ traditional closed-loop driven methods to fulfill the intents on the KPIs. However, these methods consider every closed-loop independent of each other which degrades the combined performance. Also, such existing methods are not easily scalable. Multi-agent reinforcement learning (MARL) techniques have shown significant promise in many areas in which traditional closed-loop control falls short, typically for complex coordination and conflict management among loops. In this work, we propose a method based on MARL to achieve intent-based management without the need for knowing a model of the underlying system. Moreover, when there are conflicting intents, the MARL agents can implicitly incentivize the loops to cooperate and promote trade-offs, without human interaction, by prioritizing the important KPIs. Experiments have been performed on a network emulator for optimizing KPIs of three services. Results obtained demonstrate that the proposed system performs quite well and is able to fulfill all existing intents when there are enough resources or prioritize the KPIs when resources are scarce.
翻译:最近,在电信网络中,由于许多使用案例的严格业绩要求,基于意图的管理在电信网络中得到了很好的注意。文献中的几种方法采用传统的封闭式环驱动方法来实现对KPI的意向。然而,这些方法考虑到每个相互独立的封闭式环流,从而降低联合性能。此外,这些现有方法不容易推广。多试剂强化学习(MARL)技术在传统闭路控制不足的许多领域显示出很大的希望,通常是为了在循环中进行复杂的协调和冲突管理。在这项工作中,我们建议以MAL为基础,在不需要了解基本系统模式的情况下实现基于意图的管理。此外,如果存在相互矛盾的意图,MARL代理商可以不优先考虑重要的KPIs,暗含鼓励循环,在没有人际互动的情况下进行合作和促进交易。在优化三种服务的KPIs时,对网络模拟器进行了实验。获得的结果表明,拟议的系统运作良好,在资源充足或资源稀缺时能够满足所有现有意图。