In this paper, we explore the use of multi-agent deep learning as well as learning to cooperate principles to meet stringent service level agreements, in terms of throughput and end-to-end delay, for a set of classified network flows. We consider agents built on top of a weighted fair queuing algorithm that continuously set weights for three flow groups: gold, silver, and bronze. We rely on a novel graph-convolution based, multi-agent reinforcement learning approach known as DGN. As benchmarks, we propose centralized and distributed deep Q-network approaches and evaluate their performances in different network, traffic, and routing scenarios, highlighting the effectiveness of our proposals and the importance of agent cooperation. We show that our DGN-based approach meets stringent throughput and delay requirements across all scenarios.
翻译:在本文中,我们探索如何利用多试剂深层学习以及学习合作原则,以达到严格的服务级协议,即一系列保密网络流动的输送量和端到端的延迟。我们考虑在加权公平排队算法之上建起的代理商,该算法不断为金、银和铜三个流动组设定权重。我们依靠一种以图表为基础的、多试剂强化学习方法,即DGN。作为基准,我们提出集中和分散的深度Q网络方法,并评价它们在不同的网络、交通和路由情景中的绩效,突出我们的建议的有效性和代理人合作的重要性。我们表明,我们基于DGN的方法满足了各种情景的严格吞吐量和延迟要求。