Approximate Bayesian inference methods that scale to very large datasets are crucial in leveraging probabilistic models for real-world time series. Sparse Markovian Gaussian processes combine the use of inducing variables with efficient Kalman filter-like recursions, resulting in algorithms whose computational and memory requirements scale linearly in the number of inducing points, whilst also enabling parallel parameter updates and stochastic optimisation. Under this paradigm, we derive a general site-based approach to approximate inference, whereby we approximate the non-Gaussian likelihood with local Gaussian terms, called sites. Our approach results in a suite of novel sparse extensions to algorithms from both the machine learning and signal processing literature, including variational inference, expectation propagation, and the classical nonlinear Kalman smoothers. The derived methods are suited to large time series, and we also demonstrate their applicability to spatio-temporal data, where the model has separate inducing points in both time and space.
翻译:将贝叶斯的推论方法推到非常大的数据集中对于利用真实世界时间序列的概率模型至关重要。 Sparse Markovian Gaussian 进程将诱变变量的使用与高效的卡尔曼过滤式循环相结合,导致算法的计算和内存要求在引论点数量中线性规模,同时允许平行的参数更新和随机优化。在这个模式下,我们得出一个基于网站的近似推论方法,用本地的高斯语术语,即所谓的站点来比较非高西语的可能性。我们的方法的结果是,机器学习和信号处理文献中的算法有一套新颖的稀有扩展,包括变异推断、预期传播和古典非线性卡尔曼光滑剂。衍生的方法适合大型的时间序列,我们也展示了它们对磁波-时空数据的适用性,模型在时间和空间上都有不同的引论点。