Estimating and reacting to external disturbances is of fundamental importance for robust control of quadrotors. Existing estimators typically require significant tuning or training with a large amount of data, including the ground truth, to achieve satisfactory performance. This paper proposes a data-efficient differentiable moving horizon estimation (DMHE) algorithm that can automatically tune the MHE parameters online and also adapt to different scenarios. We achieve this by deriving the analytical gradient of the estimated trajectory from MHE with respect to the tuning parameters, enabling end-to-end learning for auto-tuning. Most interestingly, we show that the gradient can be calculated efficiently from a Kalman filter in a recursive form. Moreover, we develop a model-based policy gradient algorithm to learn the parameters directly from the trajectory tracking errors without the need for the ground truth. The proposed DMHE can be further embedded as a layer with other neural networks for joint optimization. Finally, we demonstrate the effectiveness of the proposed method via both simulation and experiments on quadrotors, where challenging scenarios such as sudden payload change and flying in downwash are examined.
翻译:估计和应对外部扰动对于对振动器的稳健控制至关重要。现有的测算器通常需要大量的调整或培训,需要大量的数据,包括地面真相,才能取得令人满意的性能。本文件建议采用数据效率不同的移动地平线估计算法,可以在网上自动调控MHE参数,并适应不同的假设情况。我们通过从MAH得出估计轨迹在调试参数方面的分析梯度,为自动调控提供端到端的学习。最有意思的是,我们显示梯度可以从卡尔曼过滤器中以递归式的方式有效计算。此外,我们开发了基于模型的政策梯度算法,直接从轨迹跟踪错误中学习参数,而不需要地面真相。拟议的DHE可进一步作为一层,与其他神经网络一起进行联合优化。最后,我们通过模拟和对等离子体进行实验,展示了拟议方法的有效性,在其中对诸如突然有效载荷变化和在下层中飞行等具有挑战性的设想进行了研究。