We consider the problem of designing a data-driven nonlinear state estimation (DANSE) method that uses (noisy) nonlinear measurements of a process whose underlying state transition model (STM) is unknown. Such a process is referred to as a model-free process. A recurrent neural network (RNN) provides parameters of a Gaussian prior that characterize the state of the model-free process, using all previous measurements at a given time point. In the case of DANSE, the measurement system was linear, leading to a closed-form solution for the state posterior. However, the presence of a nonlinear measurement system renders a closed-form solution infeasible. Instead, the second-order statistics of the state posterior are computed using the nonlinear measurements observed at the time point. We address the nonlinear measurements using a reparameterization trick-based particle sampling approach, and estimate the second-order statistics of the state posterior. The proposed method is referred to as particle-based DANSE (pDANSE). The RNN of pDANSE uses sequential measurements efficiently and avoids the use of computationally intensive sequential Monte-Carlo (SMC) and/or ancestral sampling. We describe the semi-supervised learning method for pDANSE, which transitions to unsupervised learning in the absence of labeled data. Using a stochastic Lorenz-$63$ system as a benchmark process, we experimentally demonstrate the state estimation performance for four nonlinear measurement systems. We explore cubic nonlinearity and a camera-model nonlinearity where unsupervised learning is used; then we explore half-wave rectification nonlinearity and Cartesian-to-spherical nonlinearity where semi-supervised learning is used. The performance of state estimation is shown to be competitive vis-\`a-vis particle filters that have complete knowledge of the STM of the Lorenz-$63$ system.
翻译:本文研究设计一种数据驱动的非线性状态估计(DANSE)方法,该方法利用(含噪声的)非线性测量数据对底层状态转移模型(STM)未知的过程进行估计。此类过程被称为无模型过程。循环神经网络(RNN)利用给定时间点之前的所有测量数据,生成描述无模型过程状态的高斯先验参数。在DANSE中,测量系统为线性,从而可得到状态后验的闭式解。然而,非线性测量系统的存在使得闭式解无法实现。为此,我们利用该时间点观测到的非线性测量数据计算状态后验的二阶统计量。针对非线性测量问题,我们采用基于重参数化技巧的粒子采样方法,并估计状态后验的二阶统计量。所提出的方法称为基于粒子的DANSE(pDANSE)。pDANSE中的RNN能高效利用序列测量数据,并避免了计算密集的序列蒙特卡洛(SMC)和/或祖先采样。我们描述了pDANSE的半监督学习方法,该方法在缺乏标注数据时可过渡至无监督学习。以随机Lorenz-63系统作为基准过程,我们通过实验展示了四种非线性测量系统下的状态估计性能。我们探索了立方非线性与相机模型非线性(采用无监督学习),以及半波整流非线性和笛卡尔坐标到球坐标非线性(采用半监督学习)。实验结果表明,该状态估计性能与完全知晓Lorenz-63系统STM的粒子滤波器相比具有竞争力。