保证地方稳定的优化反馈控制 (Neural Network Optimal Feedback Control with Guaranteed Local Stability)

Recent research shows that supervised learning can be an effective tool for designing optimal feedback controllers for high-dimensional nonlinear dynamic systems. But the behavior of neural network controllers is still not well understood. In particular, some neural networks with high test accuracy can fail to even locally stabilize the dynamic system. To address this challenge we propose several novel neural network architectures, which we show guarantee local asymptotic stability while retaining the approximation capacity to learn the optimal feedback policy semi-globally. The proposed architectures are compared against standard neural network feedback controllers through numerical simulations of two high-dimensional nonlinear optimal control problems: stabilization of an unstable Burgers-type partial differential equation, and altitude and course tracking for an unmanned aerial vehicle. The simulations demonstrate that standard neural networks can fail to stabilize the dynamics even when trained well, while the proposed architectures are always at least locally stabilizing. Moreover, the proposed controllers are found to be nearly optimal in testing.

翻译：最近的研究显示,有监督的学习可以成为设计高维非线性动态系统最佳反馈控制器的有效工具。但神经网络控制器的行为仍然没有得到很好的理解。特别是, 一些测试精度高的神经网络可能甚至无法在当地稳定动态系统。为了应对这一挑战,我们提议了一些新的神经网络结构,我们表明这保证了当地无症状稳定,同时保留了近似能力,以学习最佳反馈政策半全球的最佳反馈政策。通过两个高维非线性非线性最佳控制问题的数字模拟,将拟议的神经网络反馈控制器与标准神经网络反馈控制器进行比较:稳定不稳定的布尔格斯型部分差异方程式,以及无人驾驶飞行器的高度和航道跟踪。模拟表明,标准神经网络即使在经过良好培训后仍可能无法稳定动态,而拟议的结构也总是至少稳定在本地。此外,在测试中发现拟议的控制器几乎是最佳的。

相关内容

Neural Networks

关注 1651

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日