Predictive coding (PC) is an influential theory in computational neuroscience, which argues that the cortex forms unsupervised world models by implementing a hierarchical process of prediction error minimization. PC networks (PCNs) are trained in two phases. First, neural activities are updated to optimize the network's response to external stimuli. Second, synaptic weights are updated to consolidate this change in activity -- an algorithm called \emph{prospective configuration}. While previous work has shown how in various limits, PCNs can be found to approximate backpropagation (BP), recent work has demonstrated that PCNs operating in this standard regime, which does not approximate BP, nevertheless obtain competitive training and generalization performance to BP-trained networks while outperforming them on tasks such as online, few-shot, and continual learning, where brains are known to excel. Despite this promising empirical performance, little is understood theoretically about the properties and dynamics of PCNs in this regime. In this paper, we provide a comprehensive theoretical analysis of the properties of PCNs trained with prospective configuration. We first derive analytical results concerning the inference equilibrium for PCNs and a previously unknown close connection relationship to target propagation (TP). Secondly, we provide a theoretical analysis of learning in PCNs as a variant of generalized expectation-maximization and use that to prove the convergence of PCNs to critical points of the BP loss function, thus showing that deep PCNs can, in theory, achieve the same generalization performance as BP, while maintaining their unique advantages.
翻译:预测性编码(PC)是计算神经科学中具有影响力的理论,它指出,通过采用预测差错最小化的等级过程,皮层形成不受监督的世界模型,PC网络分两个阶段培训。首先,神经活动得到更新,以优化网络对外部刺激的反应。第二,合成权重得到更新,以巩固活动的变化 -- -- 一种称为emph{Prospect 配置的算法。虽然以前的工作表明在各种限度内如何发现多氯化萘是近似反向调整(BP),但最近的工作表明,在这一标准制度中运行的多氯化萘并不接近BP,但获得了竞争培训和通用性能,在网上、少发和持续学习等任务上超过了网络对外部刺激的反应。尽管这种有希望的经验表现,但从理论上看,对PCN的特性和动态却很少了解。在本文件中,我们对与潜在配置所培训的PCN的特性进行了全面的理论分析,因此,在BTP的理论化方面,我们从一个未知的理论性平面上得出了一种分析结果,在BI级的理论上,我们从一个未知的理论上展示了一种对PI级的理论上的理论性平比级关系,从而提供了一种对PI级的理论的理论的理论的理论上的理论上的精确性平比级关系。