基于课程的强化学习,防止热层叠加的网格地形控制器 (Curriculum Based Reinforcement Learning of Grid Topology Controllers to Prevent Thermal Cascading) - 专知论文

会员服务 ·

0

学成 · Integration · 控制器 · 级联 · 强化学习 ·

2021 年 12 月 18 日

Curriculum Based Reinforcement Learning of Grid Topology Controllers to Prevent Thermal Cascading

翻译：基于课程的强化学习,防止热层叠加的网格地形控制器

Amarsagar Reddy Ramapuram Matavalam,Kishan Prudhvi Guddanti,Yang Weng,Venkataramana Ajjarapu

This paper describes how domain knowledge of power system operators can be integrated into reinforcement learning (RL) frameworks to effectively learn agents that control the grid's topology to prevent thermal cascading. Typical RL-based topology controllers fail to perform well due to the large search/optimization space. Here, we propose an actor-critic-based agent to address the problem's combinatorial nature and train the agent using the RL environment developed by RTE, the French TSO. To address the challenge of the large optimization space, a curriculum-based approach with reward tuning is incorporated into the training procedure by modifying the environment using network physics for enhanced agent learning. Further, a parallel training approach on multiple scenarios is employed to avoid biasing the agent to a few scenarios and make it robust to the natural variability in grid operations. Without these modifications to the training procedure, the RL agent failed for most test scenarios, illustrating the importance of properly integrating domain knowledge of physical systems for real-world RL learning. The agent was tested by RTE for the 2019 learning to run the power network challenge and was awarded the 2nd place in accuracy and 1st place in speed. The developed code is open-sourced for public use.

翻译：本文描述了如何将电力系统操作员的域知识纳入强化学习(RL)框架,以有效学习控制电网地形的物剂,防止热层层升高。典型的 RL 地形控制员由于搜索/优化空间巨大而不能很好地运行。在这里,我们提议一个基于演员的电源系统操作员,以解决该问题的组合性质,并利用法国电信组织所开发的RL环境对代理商进行培训。为了应对大优化空间的挑战,通过利用网络物理来改变环境,加强代理商学习,将奖励调适的课程法纳入培训程序。此外,还采用多种情景平行的培训办法,避免将该物剂偏向少数场景,使其适应电网操作的自然变异性。如果不对培训程序进行这些修改,RL 代理商在多数测试情景中都未能成功,说明适当整合物理系统的域知识对于现实世界RL学习的重要性。该物剂在2019年学习电源网络挑战时,通过RTE测试后被环境调整,并被授予公开使用2号。

0

相关内容

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

专知会员服务

89+阅读 · 2021年1月12日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

【牛津大学博士论文】基于强化学习的无地图机器人导航，Reinforcement Learning Based MRN

【牛津大学博士论文】基于强化学习的无地图机器人导航，Reinforcement Learning Based MRN

专知会员服务

121+阅读 · 2020年5月18日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【DeepMind-Nando de Freitas】强化学习教程，102页ppt，Reinforcement Learning

【DeepMind-Nando de Freitas】强化学习教程，102页ppt，Reinforcement Learning

专知会员服务

84+阅读 · 2019年11月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

已删除

将门创投

4+阅读 · 2018年12月10日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

Networked Online Learning for Control of Safety-Critical Resource-Constrained Systems based on Gaussian Processes

Networked Online Learning for Control of Safety-Critical Resource-Constrained Systems based on Gaussian Processes

Arxiv

0+阅读 · 2022年2月23日

A Benchmark Comparison of Learned Control Policies for Agile Quadrotor Flight

Arxiv

0+阅读 · 2022年2月22日

Event-Triggered Tracking Control of Networked Multi-Agent Systems

Arxiv

0+阅读 · 2022年2月22日

Learning Causal Overhypotheses through Exploration in Children and Computational Models

Learning Causal Overhypotheses through Exploration in Children and Computational Models

Arxiv

0+阅读 · 2022年2月21日

CCGL: Contrastive Cascade Graph Learning

Arxiv

0+阅读 · 2022年2月20日

RL4RS: A Real-World Benchmark for Reinforcement Learning based Recommender System

Arxiv

1+阅读 · 2022年2月20日

Nearest-Neighbor-based Collision Avoidance for Quadrotors via Reinforcement Learning

Nearest-Neighbor-based Collision Avoidance for Quadrotors via Reinforcement Learning

Arxiv

0+阅读 · 2022年2月18日

Soft Actor-Critic Deep Reinforcement Learning for Fault Tolerant Flight Control

Arxiv

0+阅读 · 2022年2月16日

CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-driving

CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-driving

Arxiv

8+阅读 · 2018年7月10日

Neural Network Based Reinforcement Learning for Audio-Visual Gaze Control in Human-Robot Interaction

Arxiv

6+阅读 · 2018年4月23日

VIP会员

文章信息

相关主题

相关VIP内容

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

专知会员服务

89+阅读 · 2021年1月12日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

【牛津大学博士论文】基于强化学习的无地图机器人导航，Reinforcement Learning Based MRN

【牛津大学博士论文】基于强化学习的无地图机器人导航，Reinforcement Learning Based MRN

专知会员服务

121+阅读 · 2020年5月18日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【DeepMind-Nando de Freitas】强化学习教程，102页ppt，Reinforcement Learning

【DeepMind-Nando de Freitas】强化学习教程，102页ppt，Reinforcement Learning

专知会员服务

84+阅读 · 2019年11月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津博士论文】零样本强化学习综述

《美军条令：陆军指挥官与规划人员地理空间指南》60页

战术边缘指挥控制：防务面临的核心挑战

迈向开放世界检测：综述

相关资讯

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

已删除

将门创投

4+阅读 · 2018年12月10日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Networked Online Learning for Control of Safety-Critical Resource-Constrained Systems based on Gaussian Processes

Networked Online Learning for Control of Safety-Critical Resource-Constrained Systems based on Gaussian Processes

Arxiv

0+阅读 · 2022年2月23日

A Benchmark Comparison of Learned Control Policies for Agile Quadrotor Flight

Arxiv

0+阅读 · 2022年2月22日

Event-Triggered Tracking Control of Networked Multi-Agent Systems

Arxiv

0+阅读 · 2022年2月22日

Learning Causal Overhypotheses through Exploration in Children and Computational Models

Learning Causal Overhypotheses through Exploration in Children and Computational Models

Arxiv

0+阅读 · 2022年2月21日

CCGL: Contrastive Cascade Graph Learning

Arxiv

0+阅读 · 2022年2月20日

RL4RS: A Real-World Benchmark for Reinforcement Learning based Recommender System

Arxiv

1+阅读 · 2022年2月20日

Nearest-Neighbor-based Collision Avoidance for Quadrotors via Reinforcement Learning

Nearest-Neighbor-based Collision Avoidance for Quadrotors via Reinforcement Learning

Arxiv

0+阅读 · 2022年2月18日

Soft Actor-Critic Deep Reinforcement Learning for Fault Tolerant Flight Control

Arxiv

0+阅读 · 2022年2月16日

CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-driving

CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-driving

Arxiv

8+阅读 · 2018年7月10日

Neural Network Based Reinforcement Learning for Audio-Visual Gaze Control in Human-Robot Interaction

Arxiv

6+阅读 · 2018年4月23日

微信扫码咨询专知VIP会员