Adaptive methods are popular within the control literature due to the flexibility and forgiveness they offer in the area of modelling. Neural network adaptive control is favorable specifically for the powerful nature of the machine learning algorithm to approximate unknown functions and for the ability to relax certain constraints within traditional adaptive control. Deep neural networks are large framework networks with vastly superior approximation characteristics than their shallow counterparts. However, implementing a deep neural network can be difficult due to size specific complications such as vanishing/exploding gradients in training. In this paper, a neuro-adaptive controller is implemented featuring a deep neural network trained on a new weight update law that escapes the vanishing/exploding gradient problem by only incorporating the sign of the gradient. The type of controller designed is an adaptive dynamic inversion controller utilizing a modified state observer in a secondary estimation loop to train the network. The deep neural network learns the entire plant model on-line, creating a controller that is completely model free. The controller design is tested in simulation on a 2 link planar robot arm. The controller is able to learn the nonlinear plant quickly and displays good performance in the tracking control problem.
翻译:在控制文献中,适应方法很受欢迎,因为它们在建模领域提供了灵活性和宽恕性。神经网络适应性控制特别有利于机器学习算法的强大性质,以近似未知功能和在传统适应性控制中放松某些限制的能力。深神经网络是大型框架网络,其近似特征优于浅浅对等网络。然而,由于在培训中消失/释放梯度等大小特殊并发症,实施深神经网络可能很困难。在本文中,一个神经适应性控制器正在实施,其特点是一个深神经网络,经过关于新的重量更新法的培训,仅通过纳入梯度的标志来摆脱了消失/爆炸梯度问题。所设计的控制器类型是适应性动态转换控制器,使用经修改的州观察员进行二次估计循环来培训网络。深神经网络在网上学习整个工厂模型,创建一个完全无型的控制器。控制器的设计在2个连接板机器人臂的模拟中测试。控制器能够迅速学习非线工厂,并显示跟踪控制问题的良好表现。