Implicit processes (IPs) are a generalization of Gaussian processes (GPs). IPs may lack a closed-form expression but are easy to sample from. Examples include, among others, Bayesian neural networks or neural samplers. IPs can be used as priors over functions, resulting in flexible models with well-calibrated prediction uncertainty estimates. Methods based on IPs usually carry out function-space approximate inference, which overcomes some of the difficulties of parameter-space approximate inference. Nevertheless, the approximations employed often limit the expressiveness of the final model, resulting, e.g., in a Gaussian predictive distribution, which can be restrictive. We propose here a multi-layer generalization of IPs called the Deep Variational Implicit process (DVIP). This generalization is similar to that of deep GPs over GPs, but it is more flexible due to the use of IPs as the prior distribution over the latent functions. We describe a scalable variational inference algorithm for training DVIP and show that it outperforms previous IP-based methods and also deep GPs. We support these claims via extensive regression and classification experiments. We also evaluate DVIP on large datasets with up to several million data instances to illustrate its good scalability and performance.
翻译:隐含过程(IPs)是高斯进程(GPs)的概括性。IPs可能缺乏封闭式表达形式,但很容易从中取样。例子包括巴伊西亚神经网络或神经采样器等。IPs可以用作功能的先期用途,从而形成具有精确的预测不确定性估计数的灵活模型。基于IPs的方法通常具有功能-空间近似推理,克服了参数-空间近似推理的某些困难。然而,所使用的近似往往限制了最终模型的清晰度,例如,在高斯预测分布中,结果可能是限制性的。我们在此建议对IPs进行多层次的概括化,称为深变隐性隐性进程(DVIP)。这种概括性类似于深度GPs相对于GPs(GPs)的深度深度推导算,但由于IPs作为潜在函数的先前分布,因此更灵活。我们描述用于培训DVIP的可变性推算法,并显示它超越了前几颗的磁性分析。我们还用大量的数据模型演示了前几百万次的回归性。