机动车辆AMOD系统强力和受制约的多机构强化多机构学习框架 (A Robust and Constrained Multi-Agent Reinforcement Learning Framework for Electric Vehicle AMoD Systems) - 专知论文

会员服务 ·

0

稳健性 · Learning · 评论员 · MoDELS · 转移核 ·

2022 年 9 月 17 日

A Robust and Constrained Multi-Agent Reinforcement Learning Framework for Electric Vehicle AMoD Systems

翻译：机动车辆AMOD系统强力和受制约的多机构强化多机构学习框架

Sihong He,Yue Wang,Shuo Han,Shaofeng Zou,Fei Miao

from arxiv, 8 pages

Electric vehicles (EVs) play critical roles in autonomous mobility-on-demand (AMoD) systems, but their unique charging patterns increase the model uncertainties in AMoD systems (e.g. state transition probability). Since there usually exists a mismatch between the training and test (true) environments, incorporating model uncertainty into system design is of critical importance in real-world applications. However, model uncertainties have not been considered explicitly in EV AMoD system rebalancing by existing literature yet and remain an urgent and challenging task. In this work, we design a robust and constrained multi-agent reinforcement learning (MARL) framework with transition kernel uncertainty for the EV rebalancing and charging problem. We then propose a robust and constrained MARL algorithm (ROCOMA) that trains a robust EV rebalancing policy to balance the supply-demand ratio and the charging utilization rate across the whole city under state transition uncertainty. Experiments show that the ROCOMA can learn an effective and robust rebalancing policy. It outperforms non-robust MARL methods when there are model uncertainties. It increases the system fairness by 19.6% and decreases the rebalancing costs by 75.8%.

翻译：电动车辆(EVs)在自动按需流动(AMOD)系统中发挥着关键作用,但是它们独特的收费模式增加了AMOD系统中的模型不确定性(例如州过渡概率)。由于培训和测试(真实)环境之间通常存在不匹配,因此将模型不确定性纳入系统设计对于现实世界的应用至关重要。然而,在EV AMOD系统中,现有文献尚未明确考虑模型不确定性,这种平衡仍然是一项紧迫和具有挑战性的任务。在这项工作中,我们设计了一个强有力和受限制的多试剂强化学习框架,为EV再平衡和充电问题提供过渡内核不确定性。我们然后提出一个强大和受限制的MARL算法(ROCOMA),用于培训强有力的EV再平衡政策,以平衡供需比率和整个城市在州过渡不确定情况下的收费利用率。实验表明,RECOMA可以学习有效和有力的再平衡政策。当模型不确定性存在时,它比非紫外MARL方法要强得多。它提高了系统公平性,增加了19.6%,再平衡成本减少75.8%。

0

相关内容

稳健性

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【强化学习资源集合】Awesome Reinforcement Learning

【强化学习资源集合】Awesome Reinforcement Learning

专知会员服务

97+阅读 · 2019年12月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

TensorFlow 2.0 学习资源汇总

TensorFlow 2.0 学习资源汇总

专知会员服务

67+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

基于UGC的应急响应决策支持系统关键技术研究

国家自然科学基金

12+阅读 · 2014年12月31日

基于演算子理论的Hamiltonian系统的鲁棒无源性控制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Egr3调控造血干细胞功能的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

基于网格的分布式雷达仿真系统关键技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

汽车实时嵌入式系统中的软件健康监控技术

国家自然科学基金

0+阅读 · 2012年12月31日

YAP2信号通路在骨肉瘤中的作用和机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

模块化非线性系统辨识

国家自然科学基金

0+阅读 · 2011年12月31日

高超音速乘波飞行器若干非线性动力学问题的研究

国家自然科学基金

0+阅读 · 2011年12月31日

执行器带约束与时滞的控制系统设计的参量Lyapunov方程方法

国家自然科学基金

0+阅读 · 2009年12月31日

Many-Objective Reinforcement Learning for Online Testing of DNN-Enabled Systems

Arxiv

0+阅读 · 2022年10月27日

Towards customizable reinforcement learning agents: Enabling preference specification through online vocabulary expansion

Arxiv

0+阅读 · 2022年10月27日

Limitations of Deep Learning for Inverse Problems on Digital Hardware

Arxiv

0+阅读 · 2022年10月26日

Deep VULMAN: A Deep Reinforcement Learning-Enabled Cyber Vulnerability Management Framework

Arxiv

0+阅读 · 2022年10月24日

ADLight: A Universal Approach of Traffic Signal Control with Augmented Data Using Reinforcement Learning

Arxiv

0+阅读 · 2022年10月24日

Learning a subspace of policies for online adaptation in Reinforcement Learning

Arxiv

0+阅读 · 2022年10月24日

Differentiable Constrained Imitation Learning for Robot Motion Planning and Control

Arxiv

0+阅读 · 2022年10月21日

Value Function Decomposition for Iterative Design of Reinforcement Learning Agents

Arxiv

0+阅读 · 2022年10月20日

A Survey on Reinforcement Learning for Recommender Systems

Arxiv

22+阅读 · 2021年9月22日

MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration

Arxiv

12+阅读 · 2021年2月7日

VIP会员

文章信息

相关主题

相关VIP内容

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【强化学习资源集合】Awesome Reinforcement Learning

【强化学习资源集合】Awesome Reinforcement Learning

专知会员服务

97+阅读 · 2019年12月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

TensorFlow 2.0 学习资源汇总

TensorFlow 2.0 学习资源汇总

专知会员服务

67+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【伯克利博士论文】通过真实世界实践赋能机器人自主性

军用无人机集群技术尚未成熟——但潜力可期

人工智能安全治理白皮书（2025）

AgentOps综述：分类、挑战与未来方向

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

相关论文

Many-Objective Reinforcement Learning for Online Testing of DNN-Enabled Systems

Arxiv

0+阅读 · 2022年10月27日

Towards customizable reinforcement learning agents: Enabling preference specification through online vocabulary expansion

Arxiv

0+阅读 · 2022年10月27日

Limitations of Deep Learning for Inverse Problems on Digital Hardware

Arxiv

0+阅读 · 2022年10月26日

Deep VULMAN: A Deep Reinforcement Learning-Enabled Cyber Vulnerability Management Framework

Arxiv

0+阅读 · 2022年10月24日

ADLight: A Universal Approach of Traffic Signal Control with Augmented Data Using Reinforcement Learning

Arxiv

0+阅读 · 2022年10月24日

Learning a subspace of policies for online adaptation in Reinforcement Learning

Arxiv

0+阅读 · 2022年10月24日

Differentiable Constrained Imitation Learning for Robot Motion Planning and Control

Arxiv

0+阅读 · 2022年10月21日

Value Function Decomposition for Iterative Design of Reinforcement Learning Agents

Arxiv

0+阅读 · 2022年10月20日

A Survey on Reinforcement Learning for Recommender Systems

Arxiv

22+阅读 · 2021年9月22日

MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration

Arxiv

12+阅读 · 2021年2月7日

相关基金

基于UGC的应急响应决策支持系统关键技术研究

国家自然科学基金

12+阅读 · 2014年12月31日

基于演算子理论的Hamiltonian系统的鲁棒无源性控制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Egr3调控造血干细胞功能的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

基于网格的分布式雷达仿真系统关键技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

汽车实时嵌入式系统中的软件健康监控技术

国家自然科学基金

0+阅读 · 2012年12月31日

YAP2信号通路在骨肉瘤中的作用和机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

模块化非线性系统辨识

国家自然科学基金

0+阅读 · 2011年12月31日

高超音速乘波飞行器若干非线性动力学问题的研究

国家自然科学基金

0+阅读 · 2011年12月31日

执行器带约束与时滞的控制系统设计的参量Lyapunov方程方法

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员