Neural processes (NPs) aim to stochastically complete unseen data points based on a given context dataset. NPs essentially leverage a given dataset as a context representation to derive a suitable identifier for a novel task. To improve the prediction accuracy, many variants of NPs have investigated context embedding approaches that generally design novel network architectures and aggregation functions satisfying permutation invariant. In this work, we propose a stochastic attention mechanism for NPs to capture appropriate context information. From the perspective of information theory, we demonstrate that the proposed method encourages context embedding to be differentiated from a target dataset, allowing NPs to consider features in a target dataset and context embedding independently. We observe that the proposed method can appropriately capture context embedding even under noisy data sets and restricted task distributions, where typical NPs suffer from a lack of context embeddings. We empirically show that our approach substantially outperforms conventional NPs in various domains through 1D regression, predator-prey model, and image completion. Moreover, the proposed method is also validated by MovieLens-10k dataset, a real-world problem.
翻译:神经过程(NPs) 旨在根据特定背景数据集对未知数据点进行彻底分析。 NPs 基本上将给定数据集作为背景代表,为一项新任务获取合适的识别符号。为了提高预测准确性,许多NP变种对总体设计新网络架构和汇总功能满足变异性功能的背景嵌入方法进行了调查。在这项工作中,我们建议为NPs建立一个随机关注机制,以获取适当的背景信息。从信息理论的角度来看,我们证明拟议方法鼓励将背景嵌入与目标数据集区别开来,允许NPs考虑目标数据集和独立嵌入环境的特征。我们观察到,拟议的方法可以适当捕捉环境嵌入即使在噪音数据集和有限任务分布下,而典型NPs因缺乏背景嵌入而受到影响。我们从经验上表明,我们的方法通过 1D 回归、 掠食者- 和图像完成模型,大大超越了不同领域常规的NPs。此外,拟议方法也得到了MeevialLens- 10k数据集、一个现实世界问题的验证。