The transport capacity of a communication network can be characterized by the transition from a free-flow state to a congested state. Here, we propose a dynamic routing strategy in complex networks based on hierarchical bypass selections. The routing decisions are made by the reinforcement learning agents implemented at selected nodes with high betweenness centrality. The learning processes of the agents are decoupled from each other due to the degeneracy of their bypasses. Through interactions mediated by the underlying traffic dynamics, the agents act cooperatively, and coherent actions arise spontaneously. With only a small number of agents, the transport capacities are significantly improved, including in real-world Internet networks at the router level and the autonomous system level. Our strategy is also resilient to link removals.
翻译:通信网络的运输能力特点可以是从自由流通状态向拥挤状态的过渡。 我们在此提出基于等级绕行选择的复杂网络动态路线战略。 路线决定由在选定的交错中心点执行的强化学习代理商作出。 代理商的学习过程由于绕道的退化而相互脱钩。 通过以基本交通动态为媒介的相互作用,代理商采取合作行动,并自发地采取一致的行动。 只有少数代理商,运输能力得到显著改善,包括在路由器一级和自主系统一级的实际世界互联网网络。 我们的战略还具有连接清除的复原力。