观察者反馈前馈控制器结构在强化学习中的应用 (Observer-Feedback-Feedforward Controller Structures in Reinforcement Learning) - 专知论文

会员服务 ·

0

前馈 · 前馈控制 · 状态反馈 · 结构 · 递归神经网络 ·

2023 年 4 月 20 日

Observer-Feedback-Feedforward Controller Structures in Reinforcement Learning

翻译：观察者反馈前馈控制器结构在强化学习中的应用

Ruoqi Zhang,Per Mattson,Torbjörn Wigren

from arxiv, IFAC WC 2023

The paper proposes the use of structured neural networks for reinforcement learning based nonlinear adaptive control. The focus is on partially observable systems, with separate neural networks for the state and feedforward observer and the state feedback and feedforward controller. The observer dynamics are modelled by recurrent neural networks while a standard network is used for the controller. As discussed in the paper, this leads to a separation of the observer dynamics to the recurrent neural network part, and the state feedback to the feedback and feedforward network. The structured approach reduces the computational complexity and gives the reinforcement learning based controller an {\em understandable} structure as compared to when one single neural network is used. As shown by simulation the proposed structure has the additional and main advantage that the training becomes significantly faster. Two ways to include feedforward structure are presented, one related to state feedback control and one related to classical feedforward control. The latter method introduces further structure with a separate recurrent neural network that processes only the measured disturbance. When evaluated with simulation on a nonlinear cascaded double tank process, the method with most structure performs the best, with excellent feedforward disturbance rejection gains.

翻译：该文章提出了在非线性自适应控制的强化学习中使用结构化神经网络的方法。文章侧重于部分可观测的系统，使用单独的神经网络来处理状态和前馈观察者，以及状态反馈和前馈控制器。观察者动力学由递归神经网络建模，控制器则使用标准网络。如本文所述，这导致观察者动态被分离到递归神经网络部分和状态反馈到反馈和前馈网络部分。采用结构化方法降低了计算复杂度，并且对于强化学习的控制器，具有“可理解”的结构性，而不是使用单一的神经网络。仿真结果表明，所提出的结构可以使训练速度显著加快。文章介绍了两种包含前馈结构的方法，一种与状态反馈控制相关，一种与经典前馈控制相关。后一种方法引入更多结构，使用单独的递归神经网络仅处理测量干扰。在非线性串联双罐过程的仿真评估中，最具结构性的方法表现最佳，前馈干扰拒绝性能优秀。

0

相关内容

【CTH博士论文】基于强化学习的自动驾驶决策，149页pdf

【CTH博士论文】基于强化学习的自动驾驶决策，149页pdf

专知会员服务

58+阅读 · 2023年2月18日

不可错过！华盛顿大学最新《可解释人工智能》课程，系统讲述XAI最新进展

不可错过！华盛顿大学最新《可解释人工智能》课程，系统讲述XAI最新进展

专知会员服务

70+阅读 · 2022年9月14日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

【2022新书】强化学习工业应用，408页pdf

【2022新书】强化学习工业应用，408页pdf

专知会员服务

231+阅读 · 2022年2月3日

【微软】强化学习系统，37页ppt

专知会员服务

40+阅读 · 2021年6月29日

【微软】人工智能系统课程

【微软】人工智能系统课程

专知会员服务

91+阅读 · 2020年12月31日

【2020新书】概率机器学习，附212页pdf与slides

【2020新书】概率机器学习，附212页pdf与slides

专知会员服务

111+阅读 · 2020年11月12日

【KDD2019|讲座推荐】深强化学习及其在交通运输中的应用：Deep Reinforcement Learning with Applications in Transportation

【KDD2019|讲座推荐】深强化学习及其在交通运输中的应用：Deep Reinforcement Learning with Applications in Transportation

专知会员服务

57+阅读 · 2019年12月4日

【电子书推荐】强化学习（Reinforcement Learning）法兰克福大学 | Cornelius Weber

【电子书推荐】强化学习（Reinforcement Learning）法兰克福大学 | Cornelius Weber

专知会员服务

44+阅读 · 2019年11月19日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【2022新书】强化学习工业应用

【2022新书】强化学习工业应用

专知

18+阅读 · 2022年2月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

【推荐】用TensorFlow实现LSTM社交对话股市情感分析

【推荐】用TensorFlow实现LSTM社交对话股市情感分析

机器学习研究会

11+阅读 · 2018年1月14日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

无尾飞翼布局飞行器的操纵面故障强化学习最优自适应补偿控制研究

国家自然科学基金

1+阅读 · 2014年12月31日

结构振动的非光滑控制方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

观测反馈能稳的控制系统的最佳结构参数

国家自然科学基金

0+阅读 · 2014年12月31日

金属富勒烯的质谱研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于融合智能算法斜拉桥振动控制Benchmark问题的混合控制策略研究

国家自然科学基金

0+阅读 · 2013年12月31日

两齿差外啮合双联行星齿轮传动系统动力学特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

高速列车振动行为、振动失效和振动评价研究

国家自然科学基金

0+阅读 · 2012年12月31日

机桨一体化装置的多场耦合研究

国家自然科学基金

0+阅读 · 2012年12月31日

Caveolin-1介导多能血管干细胞向增殖型血管平滑肌细胞分化的调节作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

结构多维随机振动控制

国家自然科学基金

0+阅读 · 2011年12月31日

Novel DeepONet architecture to predict stresses in elastoplastic structures with variable complex geometries and loads

Arxiv

0+阅读 · 2023年6月6日

Efficient Multi-Scale Attention Module with Cross-Spatial Learning

Arxiv

0+阅读 · 2023年6月6日

Action Noise in Off-Policy Deep Reinforcement Learning: Impact on Exploration and Performance

Arxiv

0+阅读 · 2023年6月5日

Deep Reinforcement Learning with Swin Transformers

Arxiv

0+阅读 · 2023年6月5日

Transformer-Based UNet with Multi-Headed Cross-Attention Skip Connections to Eliminate Artifacts in Scanned Documents

Arxiv

0+阅读 · 2023年6月5日

An adaptive safety layer with hard constraints for safe reinforcement learning in multi-energy management systems

Arxiv

0+阅读 · 2023年6月5日

Investigating Navigation Strategies in the Morris Water Maze through Deep Reinforcement Learning

Arxiv

0+阅读 · 2023年6月1日

A Comprehensive Survey and Performance Analysis of Activation Functions in Deep Learning

A Comprehensive Survey and Performance Analysis of Activation Functions in Deep Learning

Arxiv

23+阅读 · 2021年9月29日

Coding for Distributed Multi-Agent Reinforcement Learning

Arxiv

32+阅读 · 2021年1月7日

Hierarchical Graph Pooling with Structure Learning

Arxiv

13+阅读 · 2019年11月14日

VIP会员

文章信息

相关主题

递归神经网络

相关VIP内容

【CTH博士论文】基于强化学习的自动驾驶决策，149页pdf

【CTH博士论文】基于强化学习的自动驾驶决策，149页pdf

专知会员服务

58+阅读 · 2023年2月18日

不可错过！华盛顿大学最新《可解释人工智能》课程，系统讲述XAI最新进展

不可错过！华盛顿大学最新《可解释人工智能》课程，系统讲述XAI最新进展

专知会员服务

70+阅读 · 2022年9月14日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

【2022新书】强化学习工业应用，408页pdf

【2022新书】强化学习工业应用，408页pdf

专知会员服务

231+阅读 · 2022年2月3日

【微软】强化学习系统，37页ppt

专知会员服务

40+阅读 · 2021年6月29日

【微软】人工智能系统课程

【微软】人工智能系统课程

专知会员服务

91+阅读 · 2020年12月31日

【2020新书】概率机器学习，附212页pdf与slides

【2020新书】概率机器学习，附212页pdf与slides

专知会员服务

111+阅读 · 2020年11月12日

【KDD2019|讲座推荐】深强化学习及其在交通运输中的应用：Deep Reinforcement Learning with Applications in Transportation

【KDD2019|讲座推荐】深强化学习及其在交通运输中的应用：Deep Reinforcement Learning with Applications in Transportation

专知会员服务

57+阅读 · 2019年12月4日

【电子书推荐】强化学习（Reinforcement Learning）法兰克福大学 | Cornelius Weber

【电子书推荐】强化学习（Reinforcement Learning）法兰克福大学 | Cornelius Weber

专知会员服务

44+阅读 · 2019年11月19日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】在低维和高维空间中分析、建模和转换潜在表征

从无人机到数据：揭示边缘计算作为新作战域

可解释人工智能的基础

大规模视觉模型中的基于提示的适应：综述

相关资讯

【2022新书】强化学习工业应用

【2022新书】强化学习工业应用

专知

18+阅读 · 2022年2月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

【推荐】用TensorFlow实现LSTM社交对话股市情感分析

【推荐】用TensorFlow实现LSTM社交对话股市情感分析

机器学习研究会

11+阅读 · 2018年1月14日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Novel DeepONet architecture to predict stresses in elastoplastic structures with variable complex geometries and loads

Arxiv

0+阅读 · 2023年6月6日

Efficient Multi-Scale Attention Module with Cross-Spatial Learning

Arxiv

0+阅读 · 2023年6月6日

Action Noise in Off-Policy Deep Reinforcement Learning: Impact on Exploration and Performance

Arxiv

0+阅读 · 2023年6月5日

Deep Reinforcement Learning with Swin Transformers

Arxiv

0+阅读 · 2023年6月5日

Transformer-Based UNet with Multi-Headed Cross-Attention Skip Connections to Eliminate Artifacts in Scanned Documents

Arxiv

0+阅读 · 2023年6月5日

An adaptive safety layer with hard constraints for safe reinforcement learning in multi-energy management systems

Arxiv

0+阅读 · 2023年6月5日

Investigating Navigation Strategies in the Morris Water Maze through Deep Reinforcement Learning

Arxiv

0+阅读 · 2023年6月1日

A Comprehensive Survey and Performance Analysis of Activation Functions in Deep Learning

A Comprehensive Survey and Performance Analysis of Activation Functions in Deep Learning

Arxiv

23+阅读 · 2021年9月29日

Coding for Distributed Multi-Agent Reinforcement Learning

Arxiv

32+阅读 · 2021年1月7日

Hierarchical Graph Pooling with Structure Learning

Arxiv

13+阅读 · 2019年11月14日

相关基金

无尾飞翼布局飞行器的操纵面故障强化学习最优自适应补偿控制研究

国家自然科学基金

1+阅读 · 2014年12月31日

结构振动的非光滑控制方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

观测反馈能稳的控制系统的最佳结构参数

国家自然科学基金

0+阅读 · 2014年12月31日

金属富勒烯的质谱研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于融合智能算法斜拉桥振动控制Benchmark问题的混合控制策略研究

国家自然科学基金

0+阅读 · 2013年12月31日

两齿差外啮合双联行星齿轮传动系统动力学特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

高速列车振动行为、振动失效和振动评价研究

国家自然科学基金

0+阅读 · 2012年12月31日

机桨一体化装置的多场耦合研究

国家自然科学基金

0+阅读 · 2012年12月31日

Caveolin-1介导多能血管干细胞向增殖型血管平滑肌细胞分化的调节作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

结构多维随机振动控制

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员