学习使用多路阻塞控制和排程的 Harness Bandwidth (Learning to Harness Bandwidth with Multipath Congestion Control and Scheduling) - 专知论文

会员服务 ·

0

控制器 · 学成 · Extensibility · Networking · Continuity ·

2021 年 5 月 29 日

Learning to Harness Bandwidth with Multipath Congestion Control and Scheduling

翻译：学习使用多路阻塞控制和排程的 Harness Bandwidth

Shiva Raj Pokhrel,Anwar Walid

from arxiv, 14 pages

Multipath TCP (MPTCP) has emerged as a facilitator for harnessing and pooling available bandwidth in wireless/wireline communication networks and in data centers. Existing implementations of MPTCP such as, Linked Increase Algorithm (LIA), Opportunistic LIA (OLIA) and BAlanced LInked Adaptation (BALIA) include separate algorithms for congestion control and packet scheduling, with pre-selected control parameters. We propose a Deep Q-Learning (DQL) based framework for joint congestion control and packet scheduling for MPTCP. At the heart of the solution is an intelligent agent for interface, learning and actuation, which learns from experience optimal congestion control and scheduling mechanism using DQL techniques with policy gradients. We provide a rigorous stability analysis of system dynamics which provides important practical design insights. In addition, the proposed DQL-MPTCP algorithm utilizes the `recurrent neural network' and integrates it with `long short-term memory' for continuously i) learning dynamic behavior of subflows (paths) and ii) responding promptly to their behavior using prioritized experience replay. With extensive emulations, we show that the proposed DQL-based MPTCP algorithm outperforms MPTCP LIA, OLIA and BALIA algorithms. Moreover, the DQL-MPTCP algorithm is robust to time-varying network characteristics, and provides dynamic exploration and exploitation of paths.

翻译：多路TCP(MPTCP)已成为在无线/网络通信网络和数据中心使用和集中现有带宽的促进者,目前实施MPTPCP(LIA)、机会性LIA(OLIA)和BAlanced LInked适应(BALIA)等MPTCP(MPTCP),包括了分别用于控制拥堵和包装时间安排的算法,并附有预先选定的控制参数。我们提议了一个基于深QL学习(DQL)的框架,用于无线/线通信网络和数据中心的联合拥堵控制(DQL) 。解决方案的核心是界面、学习和动作的智能剂,它利用政策梯度的DQL技术,从最优化的拥堵控制经验和时间安排机制中学习。我们对系统动态进行严格的稳定分析,提供重要的实用设计见解。此外,拟议的DQL-MPTCP算法利用“经常性神经网络”并将其与“基于短期的动态记忆”结合起来,用于不断学习子流动态动作(路径)和DLIA快速应对其行为,我们利用优先经验重新展示其网络。

0

相关内容

控制器

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【google】监督对比学习，Supervised Contrastive Learning

【google】监督对比学习，Supervised Contrastive Learning

专知会员服务

32+阅读 · 2020年4月23日

【Manning2020新书】深度强化学习实战，351页pdf，Deep Reinforcement Learning

【Manning2020新书】深度强化学习实战，351页pdf，Deep Reinforcement Learning

专知会员服务

292+阅读 · 2020年3月10日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

【新墨西哥大学】深度学习的局限性和缺陷，10页pdf，Deep Learning Limitations and Flaws

【新墨西哥大学】深度学习的局限性和缺陷，10页pdf，Deep Learning Limitations and Flaws

专知会员服务

54+阅读 · 2020年2月5日

【课程推荐】斯坦福课程：图机器学习《CS224W: Machine Learning with Graphs(Stanford / Fall 2019)》by Jurij Leskovec

【课程推荐】斯坦福课程：图机器学习《CS224W: Machine Learning with Graphs(Stanford / Fall 2019)》by Jurij Leskovec

专知会员服务

146+阅读 · 2019年12月10日

【AAAI 2019 Tutorial】城市交通控制的规划与调度方法（Planning and Scheduling Approaches for Urban Traffic Control），Scott Sanner，Mauro Vallati，Stephen F. Smith

【AAAI 2019 Tutorial】城市交通控制的规划与调度方法（Planning and Scheduling Approaches for Urban Traffic Control），Scott Sanner，Mauro Vallati，Stephen F. Smith

专知会员服务

8+阅读 · 2019年11月18日

【课程】Andrew Ng与Google Brain团队联合出品《TensorFlow in Practice 》

【课程】Andrew Ng与Google Brain团队联合出品《TensorFlow in Practice 》

专知会员服务

13+阅读 · 2019年10月29日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

已删除

生物探索

3+阅读 · 2018年2月10日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

Group Contrastive Self-Supervised Learning on Graphs

Arxiv

0+阅读 · 2021年7月20日

An Analysis of Reinforcement Learning for Malaria Control

An Analysis of Reinforcement Learning for Malaria Control

Arxiv

0+阅读 · 2021年7月19日

AoI-minimizing Scheduling in UAV-relayed IoT Networks

Arxiv

0+阅读 · 2021年7月19日

DeepCC: Bridging the Gap Between Congestion Control and Applications via Multi-Objective Optimization

Arxiv

0+阅读 · 2021年7月19日

Predictable Bandwidth Slicing with Open vSwitch

Arxiv

0+阅读 · 2021年7月18日

Reliability and User-Plane Latency Analysis of mmWave Massive MIMO for Grant-Free URLLC Applications

Arxiv

0+阅读 · 2021年7月17日

Achievable Rate with Antenna Size Constraint: Shannon meets Chu and Bode

Arxiv

0+阅读 · 2021年7月16日

Imitation Learning: Progress, Taxonomies and Opportunities

Arxiv

12+阅读 · 2021年6月23日

Deep Robust Clustering by Contrastive Learning

Arxiv

7+阅读 · 2020年8月7日

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Arxiv

26+阅读 · 2020年2月10日

VIP会员

文章信息

相关主题

相关VIP内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【google】监督对比学习，Supervised Contrastive Learning

【google】监督对比学习，Supervised Contrastive Learning

专知会员服务

32+阅读 · 2020年4月23日

【Manning2020新书】深度强化学习实战，351页pdf，Deep Reinforcement Learning

【Manning2020新书】深度强化学习实战，351页pdf，Deep Reinforcement Learning

专知会员服务

292+阅读 · 2020年3月10日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

【新墨西哥大学】深度学习的局限性和缺陷，10页pdf，Deep Learning Limitations and Flaws

【新墨西哥大学】深度学习的局限性和缺陷，10页pdf，Deep Learning Limitations and Flaws

专知会员服务

54+阅读 · 2020年2月5日

【课程推荐】斯坦福课程：图机器学习《CS224W: Machine Learning with Graphs(Stanford / Fall 2019)》by Jurij Leskovec

【课程推荐】斯坦福课程：图机器学习《CS224W: Machine Learning with Graphs(Stanford / Fall 2019)》by Jurij Leskovec

专知会员服务

146+阅读 · 2019年12月10日

【AAAI 2019 Tutorial】城市交通控制的规划与调度方法（Planning and Scheduling Approaches for Urban Traffic Control），Scott Sanner，Mauro Vallati，Stephen F. Smith

【AAAI 2019 Tutorial】城市交通控制的规划与调度方法（Planning and Scheduling Approaches for Urban Traffic Control），Scott Sanner，Mauro Vallati，Stephen F. Smith

专知会员服务

8+阅读 · 2019年11月18日

【课程】Andrew Ng与Google Brain团队联合出品《TensorFlow in Practice 》

【课程】Andrew Ng与Google Brain团队联合出品《TensorFlow in Practice 》

专知会员服务

13+阅读 · 2019年10月29日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《太空边缘（临近空间）的武器化？军事高空平台的进展与前景》

《利用星基增强系统（SBAS）信号进行射频干扰（RFI）检测与特征分析》

美陆军在“艾布拉姆斯”坦克与“布拉德利”步战车上测试“牛蛙”反无人机炮塔

《军事领域特性及其对军事人工智能应用的影响》

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

已删除

生物探索

3+阅读 · 2018年2月10日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Group Contrastive Self-Supervised Learning on Graphs

Arxiv

0+阅读 · 2021年7月20日

An Analysis of Reinforcement Learning for Malaria Control

An Analysis of Reinforcement Learning for Malaria Control

Arxiv

0+阅读 · 2021年7月19日

AoI-minimizing Scheduling in UAV-relayed IoT Networks

Arxiv

0+阅读 · 2021年7月19日

DeepCC: Bridging the Gap Between Congestion Control and Applications via Multi-Objective Optimization

Arxiv

0+阅读 · 2021年7月19日

Predictable Bandwidth Slicing with Open vSwitch

Arxiv

0+阅读 · 2021年7月18日

Reliability and User-Plane Latency Analysis of mmWave Massive MIMO for Grant-Free URLLC Applications

Arxiv

0+阅读 · 2021年7月17日

Achievable Rate with Antenna Size Constraint: Shannon meets Chu and Bode

Arxiv

0+阅读 · 2021年7月16日

Imitation Learning: Progress, Taxonomies and Opportunities

Arxiv

12+阅读 · 2021年6月23日

Deep Robust Clustering by Contrastive Learning

Arxiv

7+阅读 · 2020年8月7日

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Arxiv

26+阅读 · 2020年2月10日

微信扫码咨询专知VIP会员