根据持续时间政策逐步进展,优化结构型神经控制器 (Optimisation of Structured Neural Controller Based on Continuous-Time Policy Gradient) - 专知论文

会员服务 ·

0

Performer · 控制器 · Lyapunov · Analysis · Neural Networks ·

2022 年 6 月 26 日

Optimisation of Structured Neural Controller Based on Continuous-Time Policy Gradient

翻译：根据持续时间政策逐步进展,优化结构型神经控制器

Namhoon Cho,Hyo-Sang Shin

from arxiv, This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

This study presents a policy optimisation framework for structured nonlinear control of continuous-time (deterministic) dynamic systems. The proposed approach prescribes a structure for the controller based on relevant scientific knowledge (such as Lyapunov stability theory or domain experiences) while considering the tunable elements inside the given structure as the point of parametrisation with neural networks. To optimise a cost represented as a function of the neural network weights, the proposed approach utilises the continuous-time policy gradient method based on adjoint sensitivity analysis as a means for correct and performant computation of cost gradient. This enables combining the stability, robustness, and physical interpretability of an analytically-derived structure for the feedback controller with the representational flexibility and optimised resulting performance provided by machine learning techniques. Such a hybrid paradigm for fixed-structure control synthesis is particularly useful for optimising adaptive nonlinear controllers to achieve improved performance in online operation, an area where the existing theory prevails the design of structure while lacking clear analytical understandings about tuning of the gains and the uncertainty model basis functions that govern the performance characteristics. Numerical experiments on aerospace applications illustrate the utility of the structured nonlinear controller optimisation framework.

翻译：本研究为连续时间(确定性)动态系统的结构性非线性非线性控制提供了一个政策优化框架。拟议方法根据相关科学知识(如Lyapunov稳定性理论或域经验)为控制器规定了一种结构结构,同时将特定结构中的金枪鱼元素作为神经网络的平衡点。优化作为神经网络重量函数的成本,拟议方法利用基于联合敏感度分析的连续时间政策梯度方法,作为正确和运行成本梯度计算的一种手段。这能够将反馈控制器的分析衍生结构的稳定性、稳健性和物理解释性与机器学习技术提供的代表性灵活性和优化性能相结合。固定结构控制合成的混合模式对于优化适应性非线性控制器实现在线操作绩效的优化特别有用,而目前理论在结构设计上占上的位置,同时缺乏关于调整收益和计算成本梯度的明确分析理解。关于航空航天应用的预测性结构非线性框架的优化性能。

0

相关内容

Performer

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

树突膜特异性表达的syndecan-2通过调节actin细胞骨架聚集参与神经病理性痛的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

关于随机MAX SAT和(2+p)-SAT模型可满足阈值的研究

国家自然科学基金

0+阅读 · 2015年12月31日

发射可调铂(II)配合物的设计和新型静电喷雾沉积电致发光器件的制备

国家自然科学基金

0+阅读 · 2015年12月31日

以紫外光固化双亲共聚物为功能性软模板聚合水溶性PEDOT导电材料的研究

国家自然科学基金

0+阅读 · 2013年12月31日

几类随机分数阶复杂网络的参数及状态估计问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

近红外波段导电氧化物等离子体材料的光学性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

含Bi层状钙钛矿型铁电体中畴开关疲劳机理的原位透射电镜研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于磁层卫星和地面观测与太阳日冕遥测的磁场重联研究

国家自然科学基金

0+阅读 · 2011年12月31日

高灵敏FRET显微成像技术研究

国家自然科学基金

0+阅读 · 2008年12月31日

Time Minimization and Online Synchronization for Multi-agent Systems under Collaborative Temporal Tasks

Arxiv

0+阅读 · 2022年8月16日

Learning Operators with Ignore Effects for Bilevel Planning in Continuous Domains

Arxiv

0+阅读 · 2022年8月16日

On Optimizing Back-Substitution Methods for Neural Network Verification

Arxiv

0+阅读 · 2022年8月16日

Reinforcement Learning for Branch-and-Bound Optimisation using Retrospective Trajectories

Arxiv

0+阅读 · 2022年8月16日

On a Mechanism Framework of Autoencoders

Arxiv

0+阅读 · 2022年8月15日

Reliable emulation of complex functionals by active learning with error control

Arxiv

0+阅读 · 2022年8月13日

Unifying local and global model explanations by functional decomposition of low dimensional structures

Arxiv

0+阅读 · 2022年8月12日

A Modular Framework for Reinforcement Learning Optimal Execution

Arxiv

0+阅读 · 2022年8月11日

A Scalable Probabilistic Model for Reward Optimizing Slate Recommendation

Arxiv

0+阅读 · 2022年8月10日

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Arxiv

26+阅读 · 2020年2月10日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《以任务为中心的建模未来：将集成数字成熟度路径与用户故事框架融入任务工程》最新文献

《人机协作集成模型中的不确定性捕获》博士论文

运用不可解释人工智能进行军事决策

《以军铁剑战争中的战场决策》最新报告

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Time Minimization and Online Synchronization for Multi-agent Systems under Collaborative Temporal Tasks

Arxiv

0+阅读 · 2022年8月16日

Learning Operators with Ignore Effects for Bilevel Planning in Continuous Domains

Arxiv

0+阅读 · 2022年8月16日

On Optimizing Back-Substitution Methods for Neural Network Verification

Arxiv

0+阅读 · 2022年8月16日

Reinforcement Learning for Branch-and-Bound Optimisation using Retrospective Trajectories

Arxiv

0+阅读 · 2022年8月16日

On a Mechanism Framework of Autoencoders

Arxiv

0+阅读 · 2022年8月15日

Reliable emulation of complex functionals by active learning with error control

Arxiv

0+阅读 · 2022年8月13日

Unifying local and global model explanations by functional decomposition of low dimensional structures

Arxiv

0+阅读 · 2022年8月12日

A Modular Framework for Reinforcement Learning Optimal Execution

Arxiv

0+阅读 · 2022年8月11日

A Scalable Probabilistic Model for Reward Optimizing Slate Recommendation

Arxiv

0+阅读 · 2022年8月10日

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Arxiv

26+阅读 · 2020年2月10日

相关基金

树突膜特异性表达的syndecan-2通过调节actin细胞骨架聚集参与神经病理性痛的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

关于随机MAX SAT和(2+p)-SAT模型可满足阈值的研究

国家自然科学基金

0+阅读 · 2015年12月31日

发射可调铂(II)配合物的设计和新型静电喷雾沉积电致发光器件的制备

国家自然科学基金

0+阅读 · 2015年12月31日

以紫外光固化双亲共聚物为功能性软模板聚合水溶性PEDOT导电材料的研究

国家自然科学基金

0+阅读 · 2013年12月31日

几类随机分数阶复杂网络的参数及状态估计问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

近红外波段导电氧化物等离子体材料的光学性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

含Bi层状钙钛矿型铁电体中畴开关疲劳机理的原位透射电镜研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于磁层卫星和地面观测与太阳日冕遥测的磁场重联研究

国家自然科学基金

0+阅读 · 2011年12月31日

高灵敏FRET显微成像技术研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员