Packet routing is a fundamental problem in communication networks that decides how the packets are directed from their source nodes to their destination nodes through some intermediate nodes. With the increasing complexity of network topology and highly dynamic traffic demand, conventional model-based and rule-based routing schemes show significant limitations, due to the simplified and unrealistic model assumptions, and lack of flexibility and adaption. Adding intelligence to the network control is becoming a trend and the key to achieving high-efficiency network operation. In this paper, we develop a model-free and data-driven routing strategy by leveraging reinforcement learning (RL), where routers interact with the network and learn from the experience to make some good routing configurations for the future. Considering the graph nature of the network topology, we design a multi-agent RL framework in combination with Graph Neural Network (GNN), tailored to the routing problem. Three deployment paradigms, centralized, federated, and cooperated learning, are explored respectively. Simulation results demonstrate that our algorithm outperforms some existing benchmark algorithms in terms of packet transmission delay and affordable load.
翻译:在通信网络中,通过一些中间节点,确定包件如何从源节点转向目的地节点是一个根本性的问题。随着网络地形的日益复杂和高度动态的交通需求,传统的基于模式和基于规则的路线计划显示出很大的局限性,因为模型假设简化和不现实,缺乏灵活性和适应性。在网络控制中增加情报正在成为一种趋势,成为实现高效网络操作的关键。在本文中,我们通过利用强化学习(RL),开发了一种无模型和数据驱动的路线战略,在这种战略中,路由者与网络互动,并学习经验,为未来建立一些良好的路线配置。考虑到网络地形学的图示性质,我们设计了一个多剂RL框架,与针对路由问题的图形神经网络(GNN)相结合。正在分别探索三种部署模式,即集中、反馈和合作学习。模拟结果表明,我们的算法在包装传输延迟和可负担的负荷方面超越了现有的一些基准算法。