在NOMA-URLLLC网络的上链接中进行可靠的加强学习以分配资源 (A Reliable Reinforcement Learning for Resource Allocation in Uplink NOMA-URLLC Networks) - 专知论文

会员服务 ·

0

学成 · Networking · 可约的 · INTERACT · 均值 ·

2022 年 1 月 16 日

A Reliable Reinforcement Learning for Resource Allocation in Uplink NOMA-URLLC Networks

翻译：在NOMA-URLLLC网络的上链接中进行可靠的加强学习以分配资源

Waleed Ahsan,Wenqiang Yi,Yuanwei Liu,Arumugam Nallanathan

from arxiv, 32 pages, 8 figures

In this paper, we propose a deep state-action-reward-state-action (SARSA) $\lambda$ learning approach for optimising the uplink resource allocation in non-orthogonal multiple access (NOMA) aided ultra-reliable low-latency communication (URLLC). To reduce the mean decoding error probability in time-varying network environments, this work designs a reliable learning algorithm for providing a long-term resource allocation, where the reward feedback is based on the instantaneous network performance. With the aid of the proposed algorithm, this paper addresses three main challenges of the reliable resource sharing in NOMA-URLLC networks: 1) user clustering; 2) Instantaneous feedback system; and 3) Optimal resource allocation. All of these designs interact with the considered communication environment. Lastly, we compare the performance of the proposed algorithm with conventional Q-learning and SARSA Q-learning algorithms. The simulation outcomes show that: 1) Compared with the traditional Q learning algorithms, the proposed solution is able to converges within \myb{200} episodes for providing as low as $10^{-2}$ long-term mean error; 2) NOMA assisted URLLC outperforms traditional OMA systems in terms of decoding error probabilities; and 3) The proposed feedback system is efficient for the long-term learning process.

翻译：在本文中,我们建议采用一个深度的国家-行动-奖励-状态-行动(SASA) $=lambda$学习方法,优化非横向多重存取(NOMA)帮助的超可靠低纬度通信(URLLC)的上行资源配置。为减少时间变化网络环境中的平均解码错误概率,这项工作设计了一个可靠的学习算法,以提供长期资源分配,奖励反馈以即时网络性能为基础。在拟议算法的帮助下,本文件讨论了诺马-URLLC网络可靠资源共享的三大挑战:1)用户群集;2)非即时反馈系统;和3)最佳资源分配。所有这些设计都与经过深思熟虑的通信环境相互作用。最后,我们将拟议的算法的性能与传统的Q-学习和SAQ学习算法相比较。模拟结果显示:(1) 与传统的Q学习算法相比,拟议的解决办法能够在NOMA-URLC 网络网络的可靠资源共享中找到一个主要挑战:1) 用户群;2) 即时反馈系统;2 以长期的低值提供MA 长期学习误差 ; MA 长期 MA 长期的系统的低值 ; MA MA 长期 MA MA 长期 MA 的长期的 MA 的长期的 MA 的 MA 的的的的 MA MA 的 MA MA 的的 MA 的的 MA 的的的。

0

相关内容

【AI+商业投资】法国兴业银行《深度强化学习在投资组合分配中的应用》26页PPT，Deep Reinforcement Learning for portfolio allocation

【AI+商业投资】法国兴业银行《深度强化学习在投资组合分配中的应用》26页PPT，Deep Reinforcement Learning for portfolio allocation

专知会员服务

24+阅读 · 2022年4月1日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

移动云计算复杂网络环境下任务粒度的应用划分和调度方法

国家自然科学基金

0+阅读 · 2015年12月31日

基于artifact的跨组织业务流非功能性需求变化分析与应对机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

异构无线网络中的能效改善机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于贝叶斯网络的流域污染源综合管理体系研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型荧光微结构光纤二氧化碳传感器

国家自然科学基金

0+阅读 · 2012年12月31日

复杂海洋环境下多机动目标跟踪机理及方法研究

国家自然科学基金

2+阅读 · 2011年12月31日

基于不确定时间序列的Skyline分析的研究

国家自然科学基金

1+阅读 · 2010年12月31日

传感器网络下分布式多目标跟踪方法研究

国家自然科学基金

1+阅读 · 2009年12月31日

基于大气辐射传输模型的污染源排放定量遥测方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

面向复杂区域和高维问题的谱方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning

A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning

Arxiv

0+阅读 · 2022年4月20日

A sojourn-based approach to semi-Markov Reinforcement Learning

Arxiv

0+阅读 · 2022年4月20日

Placement and Resource Allocation of Wireless-Powered Multiantenna UAV for Energy-Efficient Multiuser NOMA

Arxiv

0+阅读 · 2022年4月20日

Deep Reinforcement Learning for Practical Phase Shift Optimization in RIS-aided MISO URLLC Systems

Arxiv

0+阅读 · 2022年4月19日

Online RIS Configuration Learning for Arbitrary Large Numbers of $1$-Bit Phase Resolution Elements

Arxiv

0+阅读 · 2022年4月18日

Efficient Reinforcement Learning for Unsupervised Controlled Text Generation

Arxiv

0+阅读 · 2022年4月16日

Learning from Peers: Deep Transfer Reinforcement Learning for Joint Radio and Cache Resource Allocation in 5G RAN Slicing

Arxiv

0+阅读 · 2022年4月15日

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Arxiv

20+阅读 · 2020年3月10日

A Multi-Objective Deep Reinforcement Learning Framework

A Multi-Objective Deep Reinforcement Learning Framework

Arxiv

16+阅读 · 2018年6月27日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

VIP会员

文章信息

相关主题

相关VIP内容

【AI+商业投资】法国兴业银行《深度强化学习在投资组合分配中的应用》26页PPT，Deep Reinforcement Learning for portfolio allocation

【AI+商业投资】法国兴业银行《深度强化学习在投资组合分配中的应用》26页PPT，Deep Reinforcement Learning for portfolio allocation

专知会员服务

24+阅读 · 2022年4月1日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

《多体环境下定位导航授时（PNT）系统研究》228页

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning

A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning

Arxiv

0+阅读 · 2022年4月20日

A sojourn-based approach to semi-Markov Reinforcement Learning

Arxiv

0+阅读 · 2022年4月20日

Placement and Resource Allocation of Wireless-Powered Multiantenna UAV for Energy-Efficient Multiuser NOMA

Arxiv

0+阅读 · 2022年4月20日

Deep Reinforcement Learning for Practical Phase Shift Optimization in RIS-aided MISO URLLC Systems

Arxiv

0+阅读 · 2022年4月19日

Online RIS Configuration Learning for Arbitrary Large Numbers of $1$-Bit Phase Resolution Elements

Arxiv

0+阅读 · 2022年4月18日

Efficient Reinforcement Learning for Unsupervised Controlled Text Generation

Arxiv

0+阅读 · 2022年4月16日

Learning from Peers: Deep Transfer Reinforcement Learning for Joint Radio and Cache Resource Allocation in 5G RAN Slicing

Arxiv

0+阅读 · 2022年4月15日

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Arxiv

20+阅读 · 2020年3月10日

A Multi-Objective Deep Reinforcement Learning Framework

A Multi-Objective Deep Reinforcement Learning Framework

Arxiv

16+阅读 · 2018年6月27日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

相关基金

移动云计算复杂网络环境下任务粒度的应用划分和调度方法

国家自然科学基金

0+阅读 · 2015年12月31日

基于artifact的跨组织业务流非功能性需求变化分析与应对机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

异构无线网络中的能效改善机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于贝叶斯网络的流域污染源综合管理体系研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型荧光微结构光纤二氧化碳传感器

国家自然科学基金

0+阅读 · 2012年12月31日

复杂海洋环境下多机动目标跟踪机理及方法研究

国家自然科学基金

2+阅读 · 2011年12月31日

基于不确定时间序列的Skyline分析的研究

国家自然科学基金

1+阅读 · 2010年12月31日

传感器网络下分布式多目标跟踪方法研究

国家自然科学基金

1+阅读 · 2009年12月31日

基于大气辐射传输模型的污染源排放定量遥测方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

面向复杂区域和高维问题的谱方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员