神经网络 iLQR:一个新的强化学习结构 (Neural Network iLQR: A New Reinforcement Learning Architecture)

As a notable machine learning paradigm, the research efforts in the context of reinforcement learning have certainly progressed leaps and bounds. When compared with reinforcement learning methods with the given system model, the methodology of the reinforcement learning architecture based on the unknown model generally exhibits significantly broader universality and applicability. In this work, a new reinforcement learning architecture is developed and presented without the requirement of any prior knowledge of the system model, which is termed as an approach of a "neural network iterative linear quadratic regulator (NNiLQR)". Depending solely on measurement data, this method yields a completely new non-parametric routine for the establishment of the optimal policy (without the necessity of system modeling) through iterative refinements of the neural network system. Rather importantly, this approach significantly outperforms the classical iterative linear quadratic regulator (iLQR) method in terms of the given objective function because of the innovative utilization of further exploration in the methodology. As clearly indicated from the results attained in two illustrative examples, these significant merits of the NNiLQR method are demonstrated rather evidently.

翻译：与与特定系统模型相比,基于未知模型的强化学习结构方法普遍表现出广泛的普遍性和可适用性;在这项工作中,开发并提出了一个新的强化学习结构,没有事先对系统模型的任何知识要求,该系统模型被称为“网络迭代线性线性二次曲线调节器(NNILQR)”的一种方法。这一方法仅依靠测量数据,为通过神经网络系统的迭接改进(无需系统建模)制定最佳政策提供了全新的非参数性例行程序。更重要的是,这一方法大大超越了典型迭接线性线性二次曲线调节器(iLQR)在特定目标功能方面的功能,因为创新地利用了方法的进一步探索。正如从两个示例中所获得的结果所清楚显示的那样,NNILQR方法的这些重要优点非常明显。

相关内容

Neural Networks

关注 1648

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

专知会员服务

89+阅读 · 2021年1月12日

【南洋理工Xavier】图神经网络架构的最新进展，Graph Network Architectures，附80页ppt

专知会员服务

74+阅读 · 2020年11月6日