A new prior is proposed for learning representations of high-level concepts of the kind we manipulate with language. This prior can be combined with other priors in order to help disentangling abstract factors from each other. It is inspired by cognitive neuroscience theories of consciousness, seen as a bottleneck through which just a few elements, after having been selected by attention from a broader pool, are then broadcast and condition further processing, both in perception and decision-making. The set of recently selected elements one becomes aware of is seen as forming a low-dimensional conscious state. This conscious state is combining the few concepts constituting a conscious thought, i.e., what one is immediately conscious of at a particular moment. We claim that this architectural and information-processing constraint corresponds to assumptions about the joint distribution between high-level concepts. To the extent that these assumptions are generally true (and the form of natural language seems consistent with them), they can form a useful prior for representation learning. A low-dimensional thought or conscious state is analogous to a sentence: it involves only a few variables and yet can make a statement with very high probability of being true. This is consistent with a joint distribution (over high-level concepts) which has the form of a sparse factor graph, i.e., where the dependencies captured by each factor of the factor graph involve only very few variables while creating a strong dip in the overall energy function. The consciousness prior also makes it natural to map conscious states to natural language utterances or to express classical AI knowledge in a form similar to facts and rules, albeit capturing uncertainty as well as efficient search mechanisms implemented by attention mechanisms.
翻译:为了学习我们用语言操控的那种高层次概念, 提出了一个新的前置建议, 用于学习我们所操控的那种高层次概念。 这个前置可以与其他前置概念相结合, 以便帮助分解彼此的抽象因素。 它受到认知神经科学意识理论的启发, 被看作一个瓶颈, 只有几个元素在被从更广泛的知识库中挑选出来之后, 才能在视觉和决策中被播放和进行进一步处理。 最近所认识的一组元素被视为形成一个低维度的觉悟状态。 这个自觉状态正在将构成意识思想的少数概念结合在一起, 即人们在特定时刻立即意识到的抽象因素。 我们声称, 这种建筑和信息处理限制与关于高层概念之间联合分布的假设相吻合。 这些假设在被广泛关注之后( 自然语言的形式似乎与它们一致 ), 一个低维度的思维或意识状态类似于一句句子: 它只涉及几个变量, 但是可以做出一个非常有可能真实的表述。 这与一个清晰的概念和信息处理制约性机制, 与一个清晰的自然意识结构的精确度函数形成一个稳定的状态,, 也就是一个精确的分布, 与一个精确的精确的状态, 以及每个直观的精确的状态, 以直观的状态形成一个精确的状态, 与一个精确的状态, 与一个稳定的状态, 与一个稳定的状态形成一个稳定的状态形成一个稳定的状态, 。