Many approaches for scalable GPs have focused on using a subset of data as inducing points. Another promising approach is the Karhunen-Lo\`eve (KL) decomposition, in which the GP kernel is represented by a set of basis functions which are the eigenfunctions of the kernel operator. Such kernels have the potential to be very fast, and do not depend on the selection of a reduced set of inducing points. However KL decompositions lead to high dimensionality, and variable selection thus becomes paramount. This paper reports a new method of forward variable selection, enabled by the ordered nature of the basis functions in the KL expansion of the Bayesian Smoothing Spline ANOVA kernel (BSS-ANOVA), coupled with fast Gibbs sampling in a fully Bayesian approach. The new algorithm determines how high the orders of included terms should reach, balancing model fidelity with model complexity using $L^0$ penalties inherent in Bayesian and Akaike information criteria. The inference speed and accuracy makes the method especially useful for modeling dynamic systems, by modeling the derivative in a dynamic system as a static problem, then integrating the learned dynamics using a high-order scheme. The methods are demonstrated on two dynamic datasets: a `Susceptible, Infected, Recovered' (SIR) toy problem, with the transmissibility used as forcing function, along with the experimental `Cascaded Tanks' benchmark dataset. Comparisons on the static prediction of derivatives are made with a random forest (RF), a residual neural network (ResNet), and the Orthogonal Additive Kernel (OAK) inducing points scalable GP, while for the timeseries prediction comparisons are made with LSTM and GRU recurrent neural networks (RNNs).
翻译:用于可伸缩的 GP 的许多方法都侧重于使用一组数据作为引导点。 另一个有希望的方法是 Karhunen- Lo ⁇ éeve (KL) 变形, 其中GP内核由一组基础函数代表, 这些函数是内核操作员的树皮功能。 这种内核具有非常快的潜力, 并不取决于选择一组减少的引导点。 但是 KL 分解导致高度的维度, 变量选择因此变得至关重要。 本文报告了一种新的前向变变量选择方法, 由Bayesian Splain ANOVA 内核(BS- ANOVA) 扩展的基函数的定序性质所促成, 由一组基础函数代表着一组基础函数功能, 这些功能是Bayesian Splain-S 内核内核内核内核, 新的算法决定其中的顺序应该达到多多少, 平衡模型的精度与模型的复杂性, 使用Bayesal-nal- blance 和Akaik 信息标准。 更精确的速度和精确使该方法特别有助于模拟动态的模型, 。