A well-known perceptual consequence of categorization in humans and other animals, called categorical perception, is notably characterized by a within-category compression and a between-category separation: two items, close in input space, are perceived closer if they belong to the same category than if they belong to different categories. Elaborating on experimental and theoretical results in cognitive science, here we study categorical effects in artificial neural networks. We combine a theoretical analysis that makes use of mutual and Fisher information quantities, and a series of numerical simulations on networks of increasing complexity. These formal and numerical analyses provide insights into the geometry of the neural representation in deep layers, with expansion of space near category boundaries and contraction far from category boundaries. We investigate categorical representation by using two complementary approaches: one mimics experiments in psychophysics and cognitive neuroscience by means of morphed continua between stimuli of different categories, while the other introduces a categoricality index that, for each layer in the network, quantifies the separability of the categories at the neural population level. We show on both shallow and deep neural networks that category learning automatically induces categorical perception. We further show that the deeper a layer, the stronger the categorical effects. As an outcome of our study, we propose a coherent view of the efficacy of different heuristic practices of the dropout regularization technique. More generally, our view, which finds echoes in the neuroscience literature, insists on the differential impact of noise in any given layer depending on the geometry of the neural representation that is being learned, i.e. on how this geometry reflects the structure of the categories.
翻译:人类和其他动物分类的一个众所周知的认知后果,称为绝对感知,其显著特征是类内压缩和类别间分离:两个在输入空间中接近的物品,如果属于同一类别,则被认为更接近于属于不同类别。关于认知科学的实验和理论结果,我们在这里研究人工神经网络中的绝对影响。我们结合了利用相互和渔业信息数量的理论分析,以及一系列关于日益复杂的网络的数值模拟。这些正式和数字分析为深层神经结构的几何结构提供了洞察力,空间在类别界限附近扩大,缩小到远离类别界限的收缩。我们通过使用两种互补方法来调查绝对代表性:一种是心理物理和认知神经科学的模拟实验,其方法是对不同类别的神经科学进行形态的演化,而另一种则是对网络中每一层的精确度指数,这种精确度的变异性在神经结构的变异性研究中,我们从更深层的层次上看到一个更深层次的变异性结果。