Machine learning algorithms have achieved superhuman performance in specific complex domains. Yet learning online from few examples and efficiently generalizing across domains remains elusive. In humans such learning proceeds via declarative memory formation and is closely associated with consciousness. Predictive processing has been advanced as a principled Bayesian inference framework for understanding the cortex as implementing deep generative perceptual models for both sensory data and action control. However, predictive processing offers little direct insight into fast compositional learning or the mystery of consciousness. Here we propose that through implementing online learning by hierarchical binding of unpredicted inferences, a predictive processing system may flexibly generalize in novel situations by forming working memories for perceptions and actions from single examples, which can become short- and long-term declarative memories retrievable by associative recall. We argue that the contents of such working memories are unified yet differentiated, can be maintained by selective attention and are consistent with observations of masking, postdictive perceptual integration, and other paradigm cases of consciousness research. We describe how the brain could have evolved to use perceptual value prediction for reinforcement learning of complex action policies simultaneously implementing multiple survival and reproduction strategies. 'Conscious experience' is how such a learning system perceptually represents its own functioning, suggesting an answer to the meta problem of consciousness. Our proposal naturally unifies feature binding, recurrent processing, and predictive processing with global workspace, and, to a lesser extent, the higher order theories of consciousness.
翻译:机器学习算法在特定复杂领域取得了超人性的表现。然而,从几个实例中进行在线学习,并有效地在各个领域进行普及仍然难以实现。在人类中,通过宣示性记忆的形成和与意识密切关联,这种学习过程作为原则性的巴伊西亚推论框架被推进,以理解骨皮层,将这种深刻的感官概念模型应用于感官数据和行动控制。然而,预测性处理很少直接洞察到快速的构成学习或意识的神秘性。我们在这里建议,通过对未经预知的推论进行等级约束的在线学习,预测性处理系统可以在新情况中灵活地进行普及,从单一例子中为感知和行动形成工作记忆,这可以成为短期和长期的宣示感框架,通过连带回顾,可以重新认识内涵对感数据和行动控制的深层次。我们认为,这种工作记忆的内容是统一的,但又有区别的,可以通过选择性的注意来维持,并且与关于掩盖、事后感知性整合和其他意识研究的典型案例的观察相一致。我们描述了大脑如何在新情况中运用概念预测性预测性预测性预测性预测性预测性,同时学习复杂的动作,一种不那么自然的理性的逻辑,它是如何学习一种正常的逻辑和复制式的系统,一个正常地学习一种不拘谨的逻辑,它是如何的逻辑性地进行着一种不拘谨的逻辑的逻辑的逻辑性地进行着一种过程。