This paper presents a data-driven optimal control policy for a micro flapping wing unmanned aerial vehicle. First, a set of optimal trajectories are computed off-line based on a geometric formulation of dynamics that captures the nonlinear coupling between the large angle flapping motion and the quasi-steady aerodynamics. Then, it is transformed into a feedback control system according to the framework of imitation learning. In particular, an additional constraint is incorporated through the learning process to enhance the stability properties of the resulting controlled dynamics. Compared with conventional methods, the proposed constrained imitation learning eliminates the need to generate additional optimal trajectories on-line, without sacrificing stability. As such, the computational efficiency is substantially improved. Furthermore, this establishes the first nonlinear control system that stabilizes the coupled longitudinal and lateral dynamics of flapping wing aerial vehicle without relying on averaging or linearization. These are illustrated by numerical examples for a simulated model inspired by Monarch butterflies.
翻译:本文展示了对微型扇翼无人驾驶飞行器的数据驱动最佳控制政策。 首先,一组最佳轨迹是根据几何式的动态组合来计算离线的最佳轨迹,这些动态组合将捕捉大角拍动运动和准固定空气动力学之间的非线性联动。 然后,根据模仿学习框架,将其转换为反馈控制系统。 特别是,通过学习过程增加了额外的制约因素, 以增强所产生控制动态的稳定性。 与常规方法相比, 拟议的限制模仿学习消除了在不牺牲稳定性的情况下在网上产生更多最佳轨迹的需要。 因此, 计算效率得到大幅提高。 此外, 这建立了第一个非线性控制系统, 稳定了拍动翼飞行器的双向和横向动态, 而不依赖平均或线性化。 由Monarch 蝴蝶所启发的模拟模型的数字示例说明了这些制约因素。