Conditional Neural Processes (CNP; Garnelo et al., 2018) are an attractive family of meta-learning models which produce well-calibrated predictions, enable fast inference at test time, and are trainable via a simple maximum likelihood procedure. A limitation of CNPs is their inability to model dependencies in the outputs. This significantly hurts predictive performance and renders it impossible to draw coherent function samples, which limits the applicability of CNPs in down-stream applications and decision making. NeuralProcesses (NPs; Garnelo et al., 2018) attempt to alleviate this issue by using latent variables, rely-ing on these to model output dependencies, but introduces difficulties stemming from approximate inference. One recent alternative (Bruinsma et al.,2021), which we refer to as the FullConvGNP, models dependencies in the predictions while still being trainable via exact maximum-likelihood.Unfortunately, the FullConvGNP relies on expensive 2D-dimensional convolutions, which limit its applicability to only one-dimensional data.In this work, we present an alternative way to model output dependencies which also lends it-self maximum likelihood training but, unlike the FullConvGNP, can be scaled to two- and three-dimensional data. The proposed models exhibit good performance in synthetic experiments
翻译:有条件神经过程(CNP;Garnelo等人,2018年)是一个具有吸引力的元学习模型组合,这些模型可以产生经充分校准的预测,在测试时能够快速推断,并且可以通过简单的最大可能性程序加以训练。对CNP的限制是它们无法在产出中模拟依赖性。这极大地伤害了预测性性能,使其无法得出一致的功能样本,从而限制了CNP在下游应用和决策中的适用性。神经工程(NPs;Garnelo等人,2018年)试图通过利用潜在变量来缓解这一问题,利用这些变量来模拟产出依赖性,但从大致的推断中产生困难。最近的一个替代品(Bruinsma等人,2021年)是它们无法在产出上建模,我们称之为完全的国产总值模型,而预测中的模型则仍然可以通过非常相似的模型来进行训练。毫无疑问,全Convultation公司依赖昂贵的二维演算,这限制了其应用性仅用于一维数据,而只是模型的模型,但从模型看,我们提出的三个规模上的数据也不同。