无人驾驶航空器辅助网络中数据更新的 Muti- Agenti- proximus 政策优化</s> (Muti-Agent Proximal Policy Optimization For Data Freshness in UAV-assisted Networks) - 专知论文

会员服务 ·

0

优化器 · Learning · Networking · 强化学习 · 可约的 ·

2023 年 3 月 15 日

Muti-Agent Proximal Policy Optimization For Data Freshness in UAV-assisted Networks

翻译：无人驾驶航空器辅助网络中数据更新的 Muti- Agenti- proximus 政策优化

Mouhamed Naby Ndiaye,El Houcine Bergou,Hajar El Hammouti

Unmanned aerial vehicles (UAVs) are seen as a promising technology to perform a wide range of tasks in wireless communication networks. In this work, we consider the deployment of a group of UAVs to collect the data generated by IoT devices. Specifically, we focus on the case where the collected data is time-sensitive, and it is critical to maintain its timeliness. Our objective is to optimally design the UAVs' trajectories and the subsets of visited IoT devices such as the global Age-of-Updates (AoU) is minimized. To this end, we formulate the studied problem as a mixed-integer nonlinear programming (MINLP) under time and quality of service constraints. To efficiently solve the resulting optimization problem, we investigate the cooperative Multi-Agent Reinforcement Learning (MARL) framework and propose an RL approach based on the popular on-policy Reinforcement Learning (RL) algorithm: Policy Proximal Optimization (PPO). Our approach leverages the centralized training decentralized execution (CTDE) framework where the UAVs learn their optimal policies while training a centralized value function. Our simulation results show that the proposed MAPPO approach reduces the global AoU by at least a factor of 1/2 compared to conventional off-policy reinforcement learning approaches.

翻译：无人驾驶航空飞行器(UAVs)被视为在无线通信网络中执行广泛任务的一种大有希望的技术。在这项工作中,我们考虑部署一组无人驾驶航空器收集IOT设备产生的数据。具体地说,我们侧重于所收集数据具有时间敏感性、对保持其及时性至关重要的案例。我们的目标是以最佳方式设计无人驾驶航空器的轨迹和诸如全球更新时代(AoU)等已访问的IOT装置的子集。为此,我们在服务限制的时间和质量限制下将所研究的问题发展成混合式非线性编程(MINLP ) 。为了有效解决由此产生的优化问题,我们调查多点加强学习合作框架,并根据流行的加强政策学习算法(RL)算法(PPO):政策优度最佳最佳优化化(PPO) 。我们的方法利用集中化培训(CTDE) 框架,UAVS在其中学习最佳政策,同时培训中央强化A/2功能。我们用模拟结果显示在1PO的学习方法上降低最优化的升级。</s>

1

相关内容

优化器

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

63+阅读 · 2023年2月15日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

专知会员服务

35+阅读 · 2019年11月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

sRNA伴侣蛋白Hfq与sRNA RsmY对藤黄绿菌素合成途径转录激活子PltR表达的转录后调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

胰岛素抵抗和Foxo信号对肝纤维化的调控

国家自然科学基金

0+阅读 · 2014年12月31日

数字电路双逻辑综合关键技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

染料包埋的核壳结构YAG:Ce3+/SiO2荧光粉的制备与发光性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于A-Train卫星观测的沙尘暴数字重构技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

植物病毒PVY与宿主表观遗传调控的互作研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于多Agent技术的舰船综合电力系统重构研究

国家自然科学基金

4+阅读 · 2011年12月31日

基于多Agent的通信交互式动态影响图研究及应用

国家自然科学基金

2+阅读 · 2009年12月31日

拋物奇异积分算子有界性及其应用

国家自然科学基金

0+阅读 · 2009年12月31日

Plug-In混合动力汽车能量管理及动力系统优化问题研究

国家自然科学基金

1+阅读 · 2008年12月31日

On Preimage Approximation for Neural Networks

Arxiv

0+阅读 · 2023年5月5日

Experimental Validation of Safe MPC for Autonomous Driving in Uncertain Environments

Arxiv

0+阅读 · 2023年5月5日

Rethinking Population-assisted Off-policy Reinforcement Learning

Arxiv

0+阅读 · 2023年5月4日

Unified Model Learning for Various Neural Machine Translation

Arxiv

0+阅读 · 2023年5月4日

Towards Hierarchical Policy Learning for Conversational Recommendation with Hypergraph-based Reinforcement Learning

Arxiv

0+阅读 · 2023年5月4日

Joint Graph Learning and Model Fitting in Laplacian Regularized Stratified Models

Arxiv

0+阅读 · 2023年5月4日

Exploration Policies for On-the-Fly Controller Synthesis: A Reinforcement Learning Approach

Arxiv

0+阅读 · 2023年5月3日

Training Efficient Controllers via Analytic Policy Gradient

Arxiv

0+阅读 · 2023年5月2日

Privacy-Enhanced Living: A Local Differential Privacy Approach to Secure Smart Home Data

Arxiv

0+阅读 · 2023年5月2日

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Arxiv

26+阅读 · 2020年2月10日

VIP会员

文章信息

相关主题

相关VIP内容

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

63+阅读 · 2023年2月15日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

专知会员服务

35+阅读 · 2019年11月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【伯克利博士论文】通过真实世界实践赋能机器人自主性

军用无人机集群技术尚未成熟——但潜力可期

人工智能安全治理白皮书（2025）

AgentOps综述：分类、挑战与未来方向

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

On Preimage Approximation for Neural Networks

Arxiv

0+阅读 · 2023年5月5日

Experimental Validation of Safe MPC for Autonomous Driving in Uncertain Environments

Arxiv

0+阅读 · 2023年5月5日

Rethinking Population-assisted Off-policy Reinforcement Learning

Arxiv

0+阅读 · 2023年5月4日

Unified Model Learning for Various Neural Machine Translation

Arxiv

0+阅读 · 2023年5月4日

Towards Hierarchical Policy Learning for Conversational Recommendation with Hypergraph-based Reinforcement Learning

Arxiv

0+阅读 · 2023年5月4日

Joint Graph Learning and Model Fitting in Laplacian Regularized Stratified Models

Arxiv

0+阅读 · 2023年5月4日

Exploration Policies for On-the-Fly Controller Synthesis: A Reinforcement Learning Approach

Arxiv

0+阅读 · 2023年5月3日

Training Efficient Controllers via Analytic Policy Gradient

Arxiv

0+阅读 · 2023年5月2日

Privacy-Enhanced Living: A Local Differential Privacy Approach to Secure Smart Home Data

Arxiv

0+阅读 · 2023年5月2日

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Arxiv

26+阅读 · 2020年2月10日

相关基金

sRNA伴侣蛋白Hfq与sRNA RsmY对藤黄绿菌素合成途径转录激活子PltR表达的转录后调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

胰岛素抵抗和Foxo信号对肝纤维化的调控

国家自然科学基金

0+阅读 · 2014年12月31日

数字电路双逻辑综合关键技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

染料包埋的核壳结构YAG:Ce3+/SiO2荧光粉的制备与发光性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于A-Train卫星观测的沙尘暴数字重构技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

植物病毒PVY与宿主表观遗传调控的互作研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于多Agent技术的舰船综合电力系统重构研究

国家自然科学基金

4+阅读 · 2011年12月31日

基于多Agent的通信交互式动态影响图研究及应用

国家自然科学基金

2+阅读 · 2009年12月31日

拋物奇异积分算子有界性及其应用

国家自然科学基金

0+阅读 · 2009年12月31日

Plug-In混合动力汽车能量管理及动力系统优化问题研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员