学习资源调度与使用深度确定性策略梯度进行高优先级用户调度 (Learning Resource Scheduling with High Priority Users using Deep Deterministic Policy Gradients) - 专知论文

会员服务 ·

0

调度 · 确定性策略 · 资源调度 · 策略梯度 · 用户调度 ·

2023 年 4 月 19 日

Learning Resource Scheduling with High Priority Users using Deep Deterministic Policy Gradients

翻译：学习资源调度与使用深度确定性策略梯度进行高优先级用户调度

Steffen Gracla,Edgar Beck,Carsten Bockelmann,Armin Dekorsy

Advances in mobile communication capabilities open the door for closer integration of pre-hospital and in-hospital care processes. For example, medical specialists can be enabled to guide on-site paramedics and can, in turn, be supplied with live vitals or visuals. Consolidating such performance-critical applications with the highly complex workings of mobile communications requires solutions both reliable and efficient, yet easy to integrate with existing systems. This paper explores the application of Deep Deterministic Policy Gradient~(\ddpg) methods for learning a communications resource scheduling algorithm with special regards to priority users. Unlike the popular Deep-Q-Network methods, the \ddpg is able to produce continuous-valued output. With light post-processing, the resulting scheduler is able to achieve high performance on a flexible sum-utility goal.

翻译：移动通信能力的进步为院前和院内护理流程的更紧密集成打开了大门。例如，可以启用医学专家来指导现场医护人员，并随之获取实时生命体征或可视化图像。将这样的绩效关键应用程序与移动通信的高度复杂工作融合需要可靠而高效的解决方案，同时易于与现有系统集成。本文探讨了使用深度确定性策略梯度方法_\ddpg学习通信资源调度算法的应用，特别关注优先级用户。与广受欢迎的 Deep-Q Network 方法不同，\ddpg能够产生连续值输出。通过轻微后期处理，产生的调度程序能够在柔性总效用目标上取得高性能。

0

相关内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

专知会员服务

232+阅读 · 2022年4月10日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【2022新书】强化学习工业应用，408页pdf

【2022新书】强化学习工业应用，408页pdf

专知会员服务

231+阅读 · 2022年2月3日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

324+阅读 · 2020年11月26日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

分布式有监督学习的学习理论

国家自然科学基金

17+阅读 · 2015年12月31日

协方差融合算法在时滞系统中的应用研究

国家自然科学基金

2+阅读 · 2015年12月31日

TCDD经SSeCKS/TRAF6通路诱导星形胶质细胞激活致神经毒性的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

多核混合关键度实时系统中同步感知的调度方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

连续调谐波长变换在正交频分复用弹性光网络中的应用

国家自然科学基金

0+阅读 · 2013年12月31日

从调控星形胶质细胞活化异质性探讨益肾化浊通络法对多发性硬化髓鞘再生适应性保护效应机制

国家自然科学基金

0+阅读 · 2013年12月31日

嵌入式多核环境中分区操作系统关键技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

物联网中面向应急响应的排队机制及QoS保证研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向光子网格应用的上行多波长动态调度无源光网络

国家自然科学基金

0+阅读 · 2009年12月31日

缓存交换机确保时限调度的NP-C问题新解法

国家自然科学基金

0+阅读 · 2008年12月31日

Learning Failure-Inducing Models for Testing Software-Defined Networks

Arxiv

0+阅读 · 2023年6月5日

Streaming Task Graph Scheduling for Dataflow Architectures

Arxiv

0+阅读 · 2023年6月5日

Large-Scale Distributed Learning via Private On-Device Locality-Sensitive Hashing

Arxiv

0+阅读 · 2023年6月5日

ByzSecAgg: A Byzantine-Resistant Secure Aggregation Scheme for Federated Learning Based on Coded Computing and Vector Commitment

Arxiv

0+阅读 · 2023年6月2日

CSMAAFL: Client Scheduling and Model Aggregation in Asynchronous Federated Learning

Arxiv

0+阅读 · 2023年6月1日

Policy Optimization for Continuous Reinforcement Learning

Arxiv

0+阅读 · 2023年6月1日

CRS-FL: Conditional Random Sampling for Communication-Efficient and Privacy-Preserving Federated Learning

Arxiv

0+阅读 · 2023年6月1日

Progressive Learning for Physics-informed Neural Motion Planning

Arxiv

0+阅读 · 2023年6月1日

A Comprehensive Survey on Orbital Edge Computing: Systems, Applications, and Algorithms

Arxiv

0+阅读 · 2023年6月1日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

VIP会员

文章信息

相关主题

确定性策略

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

专知会员服务

232+阅读 · 2022年4月10日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【2022新书】强化学习工业应用，408页pdf

【2022新书】强化学习工业应用，408页pdf

专知会员服务

231+阅读 · 2022年2月3日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

324+阅读 · 2020年11月26日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】在低维和高维空间中分析、建模和转换潜在表征

从无人机到数据：揭示边缘计算作为新作战域

可解释人工智能的基础

大规模视觉模型中的基于提示的适应：综述

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Learning Failure-Inducing Models for Testing Software-Defined Networks

Arxiv

0+阅读 · 2023年6月5日

Streaming Task Graph Scheduling for Dataflow Architectures

Arxiv

0+阅读 · 2023年6月5日

Large-Scale Distributed Learning via Private On-Device Locality-Sensitive Hashing

Arxiv

0+阅读 · 2023年6月5日

ByzSecAgg: A Byzantine-Resistant Secure Aggregation Scheme for Federated Learning Based on Coded Computing and Vector Commitment

Arxiv

0+阅读 · 2023年6月2日

CSMAAFL: Client Scheduling and Model Aggregation in Asynchronous Federated Learning

Arxiv

0+阅读 · 2023年6月1日

Policy Optimization for Continuous Reinforcement Learning

Arxiv

0+阅读 · 2023年6月1日

CRS-FL: Conditional Random Sampling for Communication-Efficient and Privacy-Preserving Federated Learning

Arxiv

0+阅读 · 2023年6月1日

Progressive Learning for Physics-informed Neural Motion Planning

Arxiv

0+阅读 · 2023年6月1日

A Comprehensive Survey on Orbital Edge Computing: Systems, Applications, and Algorithms

Arxiv

0+阅读 · 2023年6月1日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

相关基金

分布式有监督学习的学习理论

国家自然科学基金

17+阅读 · 2015年12月31日

协方差融合算法在时滞系统中的应用研究

国家自然科学基金

2+阅读 · 2015年12月31日

TCDD经SSeCKS/TRAF6通路诱导星形胶质细胞激活致神经毒性的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

多核混合关键度实时系统中同步感知的调度方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

连续调谐波长变换在正交频分复用弹性光网络中的应用

国家自然科学基金

0+阅读 · 2013年12月31日

从调控星形胶质细胞活化异质性探讨益肾化浊通络法对多发性硬化髓鞘再生适应性保护效应机制

国家自然科学基金

0+阅读 · 2013年12月31日

嵌入式多核环境中分区操作系统关键技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

物联网中面向应急响应的排队机制及QoS保证研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向光子网格应用的上行多波长动态调度无源光网络

国家自然科学基金

0+阅读 · 2009年12月31日

缓存交换机确保时限调度的NP-C问题新解法

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员