Current trends in networking propose the use of Machine Learning (ML) for a wide variety of network optimization tasks. As such, many efforts have been made to produce ML-based solutions for Traffic Engineering (TE), which is a fundamental problem in ISP networks. Nowadays, state-of-the-art TE optimizers rely on traditional optimization techniques, such as Local search, Constraint Programming, or Linear programming. In this paper, we present MAGNNETO, a distributed ML-based framework that leverages Multi-Agent Reinforcement Learning and Graph Neural Networks for distributed TE optimization. MAGNNETO deploys a set of agents across the network that learn and communicate in a distributed fashion via message exchanges between neighboring agents. Particularly, we apply this framework to optimize link weights in OSPF, with the goal of minimizing network congestion. In our evaluation, we compare MAGNNETO against several state-of-the-art TE optimizers in more than 75 topologies (up to 153 nodes and 354 links), including realistic traffic loads. Our experimental results show that, thanks to its distributed nature, MAGNNETO achieves comparable performance to state-of-the-art TE optimizers with significantly lower execution times. Moreover, our ML-based solution demonstrates a strong generalization capability to successfully operate in new networks unseen during training.
翻译:网络优化的最新趋势是使用机器学习(ML)解决各种问题,交通工程(TE)是ISP网络中的一个基本问题。目前,最先进的TE优化器依赖于传统的优化技术,如局部搜索、约束编程或线性编程。在本文中,我们提出了MAGNNETO,这是一个分布式ML框架,利用多智能体强化学习和图神经网络进行分布式TE优化。 MAGNNETO在网络中部署了一组代理,它们通过邻居代理之间的消息交换以分布式方式学习和通信。特别地,我们将此框架应用于优化OSPF中的链路权重,以最小化网络拥塞。在评估中,我们将MAGNNETO与75多个拓扑(高达153个节点和354个链接),包括真实的流量负载,进行比较。我们的实验结果表明,由于其分布式性质,MAGNNETO的执行时间显着较低,同时实现了可比较的性能,超越了最先进的TE优化器。此外,我们的ML解决方案表现出强大的泛化能力,在训练过程中未曾见过的新网络中成功运行。