This paper presents a deep reinforcement learning (DRL) algorithm for orientation estimation using inertial sensors combined with magnetometer. The Lyapunov method in control theory is employed to prove the convergence of orientation estimation errors. Based on the theoretical results, the estimator gains and a Lyapunov function are parametrized by deep neural networks and learned from samples. The DRL estimator is compared with three well-known orientation estimation methods on both numerical simulations and real datasets collected from commercially available sensors. The results show that the proposed algorithm is superior for arbitrary estimation initialization and can adapt to very large angular velocities for which other algorithms can be hardly applicable. To the best of our knowledge, this is the first DRL-based orientation estimation method with estimation error boundedness guarantee.
翻译:本文介绍了使用惯性传感器和磁强计进行定向估算的深度强化学习(DRL)算法。 Lyapunov 控制理论方法用于证明定向估算误差的趋同。根据理论结果,测算器的增益和Lyapunov 函数被深层神经网络分解,并从样本中学习。DRL 测算法与在数字模拟和从商业上可得到的传感器收集的真实数据集方面的三种众所周知的定向估算方法进行了比较。结果显示,拟议的算法优于任意估算初始化,能够适应其他算法几乎无法应用的非常大的角速度。据我们所知,这是第一种基于DRL的定向估算法,有估算误差的保证。