Neural Stochastic Differential Equations (NSDEs) model the drift and diffusion functions of a stochastic process as neural networks. While NSDEs are known to predict time series accurately, their uncertainty quantification properties remain unexplored. Currently, there are no approximate inference methods, which allow flexible models and provide at the same time high quality uncertainty estimates at a reasonable computational cost. Existing SDE inference methods either make overly restrictive assumptions, e.g. linearity, or rely on Monte Carlo integration that requires many samples at prediction time for reliable uncertainty quantification. However, many real-world safety critical applications necessitate highly expressive models that can quantify prediction uncertainty at affordable computational cost. We introduce a variational inference scheme that approximates the posterior distribution of a NSDE governing a latent state space by a deterministic chain of operations. We approximate the intractable data fit term of the evidence lower bound by a novel bidimensional moment matching algorithm: vertical along the neural net layers and horizontal along the time direction. Our algorithm achieves uncertainty calibration scores that can be matched by its sampling-based counterparts only at significantly higher computation cost, while providing as accurate forecasts on system dynamics.
翻译:神经物理差异(NSDEs)模型是神经网络中随机过程的漂移和扩散功能的模型。虽然已知NSDEs可以准确预测时间序列,但其不确定的量化特性尚未探索。目前,没有近似推论方法,允许采用灵活模型,同时以合理的计算成本提供高质量的不确定性估计。现有的SDE推论方法要么作出过于限制性的假设,例如线性,要么依靠蒙特卡洛集成,在预测时需要许多样本才能可靠地量化不确定性。然而,许多现实世界安全关键应用需要高度清晰的模型,能够以可承受的计算成本量化预测不确定性。我们采用了一种变式推论方法,可以近似国家空间局外分布,通过一个确定性操作链管理潜在状态。我们比较了与证据相对应的棘手数据术语,该术语由新颖的双维时相匹配算法所约束:沿着神经网层垂直和横向方向垂直。我们的算法实现了不确定性校准分数,只有以高得多的成本进行精确的计算,同时提供精确的预测。