电力控制强化学习工具分布组合 (Distributed Ensembles of Reinforcement Learning Agents for Electricity Control) - 专知论文

会员服务 ·

0

Agent · Learning · 集成 · 控制器 · 强化学习 ·

2022 年 8 月 30 日

Distributed Ensembles of Reinforcement Learning Agents for Electricity Control

翻译：电力控制强化学习工具分布组合

Pierrick Pochelu,Serge G. Petiton,Bruno Conche

Deep Reinforcement Learning (or just "RL") is gaining popularity for industrial and research applications. However, it still suffers from some key limits slowing down its widespread adoption. Its performance is sensitive to initial conditions and non-determinism. To unlock those challenges, we propose a procedure for building ensembles of RL agents to efficiently build better local decisions toward long-term cumulated rewards. For the first time, hundreds of experiments have been done to compare different ensemble constructions procedures in 2 electricity control environments. We discovered an ensemble of 4 agents improves accumulated rewards by 46%, improves reproducibility by a factor of 3.6, and can naturally and efficiently train and predict in parallel on GPUs and CPUs.

翻译：深入强化学习(或仅仅是“RL”)在工业和研究应用方面越来越受欢迎。然而,它仍然受到一些关键限制,延缓其广泛采用的速度。它的性能对初始条件和非确定性十分敏感。为了解决这些挑战,我们建议建立一个程序,以建立RL代理机构群,高效地为长期累积的奖励制定更好的地方决策。第一次进行了数百次实验,比较了2个电力控制环境中的不同组合建筑程序。我们发现4个代理机构共增加了46%的累积收益,提高了3.6倍的可复制性,并且可以自然和有效地同时培训和预测GPU和CPU。

0

相关内容

Agent

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

专知会员服务

102+阅读 · 2020年6月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

TRIM33在表观遗传水平上对TGF-β信号通路的调控

国家自然科学基金

0+阅读 · 2014年12月31日

智能SB-3CT-NPs 靶向抑制TBI后枢纽蛋白MMP-9的脑保护作用及机制

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

MADS-RIN下游基因的鉴定及功能分析

国家自然科学基金

0+阅读 · 2012年12月31日

特定lincRNA在体细胞重编程中的功能与机制

国家自然科学基金

0+阅读 · 2012年12月31日

IS6基因突变导致青少年特发性脊柱侧凸的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

海洋上非球形沙尘气溶胶散射和辐射特性的理论模拟及遥感应用

国家自然科学基金

0+阅读 · 2012年12月31日

琼玉膏延缓衰老的靶蛋白及代谢组学研究

国家自然科学基金

0+阅读 · 2011年12月31日

中红外新波段强场物理前沿开拓

国家自然科学基金

0+阅读 · 2011年12月31日

GmMADS1在大豆花发育中的调控机理研究

国家自然科学基金

0+阅读 · 2008年12月31日

Packed-Ensembles for Efficient Uncertainty Estimation

Arxiv

0+阅读 · 2022年10月17日

Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data

Arxiv

0+阅读 · 2022年10月16日

Distributional Reward Estimation for Effective Multi-Agent Deep Reinforcement Learning

Arxiv

0+阅读 · 2022年10月14日

Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations

Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations

Arxiv

0+阅读 · 2022年10月13日

Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief

Arxiv

0+阅读 · 2022年10月13日

Deep Multiagent Reinforcement Learning: Challenges and Directions

Arxiv

0+阅读 · 2022年10月12日

Transformers are Meta-Reinforcement Learners

Arxiv

15+阅读 · 2022年6月14日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

A Survey on Deep Reinforcement Learning for Data Processing and Analytics

Arxiv

24+阅读 · 2022年2月4日

A Survey of Reinforcement Learning Techniques: Strategies, Recent Development, and Future Directions

A Survey of Reinforcement Learning Techniques: Strategies, Recent Development, and Future Directions

Arxiv

79+阅读 · 2020年1月19日

VIP会员

文章信息

相关主题

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

专知会员服务

102+阅读 · 2020年6月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《小型无人机系统侦测追踪技术：声学、计算机视觉与深度学习融合方案》最新98页

《"牧羊人网格"拦截策略：实现无人机集群可靠拦截的新范式》

光纤无人机：反无人机系统的重大挑战

《作战建模与仿真实证研究》

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Packed-Ensembles for Efficient Uncertainty Estimation

Arxiv

0+阅读 · 2022年10月17日

Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data

Arxiv

0+阅读 · 2022年10月16日

Distributional Reward Estimation for Effective Multi-Agent Deep Reinforcement Learning

Arxiv

0+阅读 · 2022年10月14日

Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations

Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations

Arxiv

0+阅读 · 2022年10月13日

Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief

Arxiv

0+阅读 · 2022年10月13日

Deep Multiagent Reinforcement Learning: Challenges and Directions

Arxiv

0+阅读 · 2022年10月12日

Transformers are Meta-Reinforcement Learners

Arxiv

15+阅读 · 2022年6月14日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

A Survey on Deep Reinforcement Learning for Data Processing and Analytics

Arxiv

24+阅读 · 2022年2月4日

A Survey of Reinforcement Learning Techniques: Strategies, Recent Development, and Future Directions

A Survey of Reinforcement Learning Techniques: Strategies, Recent Development, and Future Directions

Arxiv

79+阅读 · 2020年1月19日

相关基金

TRIM33在表观遗传水平上对TGF-β信号通路的调控

国家自然科学基金

0+阅读 · 2014年12月31日

智能SB-3CT-NPs 靶向抑制TBI后枢纽蛋白MMP-9的脑保护作用及机制

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

MADS-RIN下游基因的鉴定及功能分析

国家自然科学基金

0+阅读 · 2012年12月31日

特定lincRNA在体细胞重编程中的功能与机制

国家自然科学基金

0+阅读 · 2012年12月31日

IS6基因突变导致青少年特发性脊柱侧凸的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

海洋上非球形沙尘气溶胶散射和辐射特性的理论模拟及遥感应用

国家自然科学基金

0+阅读 · 2012年12月31日

琼玉膏延缓衰老的靶蛋白及代谢组学研究

国家自然科学基金

0+阅读 · 2011年12月31日

中红外新波段强场物理前沿开拓

国家自然科学基金

0+阅读 · 2011年12月31日

GmMADS1在大豆花发育中的调控机理研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员