We study the functional task of deep learning image classification models and show that image classification requires extrapolation capabilities. This suggests that new theories have to be developed for the understanding of deep learning as the current theory assumes models are solely interpolating, leaving many questions about them unanswered. We investigate the pixel space and also the feature spaces extracted from images by trained models (in their hidden layers, including the 64-dimensional feature space in the last hidden layer of pre-trained residual neural networks), and also the feature space extracted by wavelets/shearlets. In all these domains, testing samples considerably fall outside the convex hull of training sets, and image classification requires extrapolation. In contrast to the deep learning literature, in cognitive science, psychology, and neuroscience, extrapolation and learning are often studied in tandem. Moreover, many aspects of human visual cognition and behavior are reported to involve extrapolation. We propose a novel extrapolation framework for the mathematical study of deep learning models. In our framework, we use the term extrapolation in this specific way of extrapolating outside the convex hull of training set (in the pixel space or feature space) but within the specific scope defined by the training data, the same way extrapolation is defined in many studies in cognitive science. We explain that our extrapolation framework can provide novel answers to open research problems about deep learning including their over-parameterization, their training regime, out-of-distribution detection, etc. We also see that the extent of extrapolation is negligible in learning tasks where deep learning is reported to have no advantage over simple models.
翻译:我们研究深层学习图像分类模型的功能任务,并表明图像分类需要外推能力。这表示必须开发新的理论,以便理解深层学习,因为目前的理论假设模型只是相互交织,留下许多关于这些模型的疑问。我们调查像素空间以及从经过训练的模型图像中提取的特征空间(在其隐藏层中,包括培训前残余神经网络最后隐藏层中的64维特征空间),以及由波盘/耳机提取的地物空间。在所有这些域中,测试样本大量落在培训组的螺旋外壳之外,而图像分类则需要外推。与深层学习文献、认知科学、心理学和神经科学、外推和学习的相似之处是同时研究。此外,人类视觉认知和行为的许多方面都涉及外推论。我们为深层学习模型的数学研究提出了一个新的外推法框架。在我们的框架中,我们使用深层外推法术语来在培训组(在深层次的外推算中,我们定义的外推法则包括空间或外推理学的外推理学框架中,我们定义的外推法系的外推法则可以解释。我们所定义的外推的外推法系的外推法系中,在特定的空间或外推法系的外推法外推法系的外推法系的外推法系的外推法系可以解释法系中,我们所定义的外推法系的外推法系的外推法系中,在具体的空间外推法系的外推法系的外推法系中,在具体的空间外推法系中,我们所定义的外推法系的外的外推法系的外推法系的外推法系的外推法系的外推法系的外推法系可以解释的外推法系的外推法系的外推法系的外的外的外推法系可以解释,在具体的空间外的外的外的外的外的外的外的外的外的外的外推法系可以解释法系可以解释法系的外的外的外的外的外推法系可以解释法系的外推法系的外的外的外的外推法系,我们的外的外的外的外推法系的外推法系的外的