Traditional multicast routing methods have some problems in constructing a multicast tree, such as limited access to network state information, poor adaptability to dynamic and complex changes in the network, and inflexible data forwarding. To address these defects, the optimal multicast routing problem in software-defined networking (SDN) is tailored as a multi-objective optimization problem, and an intelligent multicast routing algorithm DRL-M4MR based on the deep Q network (DQN) deep reinforcement learning (DRL) method is designed to construct a multicast tree in SDN. First, the multicast tree state matrix, link bandwidth matrix, link delay matrix, and link packet loss rate matrix are designed as the state space of the DRL agent by combining the global view and control of the SDN. Second, the action space of the agent is all the links in the network, and the action selection strategy is designed to add the links to the current multicast tree under four cases. Third, single-step and final reward function forms are designed to guide the intelligence to make decisions to construct the optimal multicast tree. The experimental results show that, compared with existing algorithms, the multicast tree construct by DRL-M4MR can obtain better bandwidth, delay, and packet loss rate performance after training, and it can make more intelligent multicast routing decisions in a dynamic network environment.
翻译:传统的多播路由方法在建设多播树方面有一些问题,例如网络状态信息的获取有限,对网络动态和复杂变化的适应性差,以及不灵活的数据传输。为了解决这些缺陷,软件定义网络(SDN)中的最佳多播路由问题被专门设计为一个多目标优化问题,基于深Q网络(DQN)深层强化学习(DRL-M4MR)的智能多播路运算算法DRL-M4MR(DRL)旨在在SDN构建多播种树。第一,多播树状态矩阵、链接带宽矩阵、链接延迟矩阵和链接包损失率矩阵设计为DRL代理的状态空间。第二,该代理的动作空间是网络中的所有链接,而行动选择战略的目的是在四个案例下增加当前多播种树的链接。第三,单级和最后奖励函数设计为指导智能决策,以构建最佳多播种树。实验结果显示,与现有智能网络相比,通过智能网络,可以构建更动态的升级, 能够构建更动态的磁带宽度环境, 能够构建一个更好的磁带宽化网络。