共同神经网络中的内插、外推和局部概括化 (Interpolation, extrapolation, and local generalization in common neural networks)

There has been a long history of works showing that neural networks have hard time extrapolating beyond the training set. A recent study by Balestriero et al. (2021) challenges this view: defining interpolation as the state of belonging to the convex hull of the training set, they show that the test set, either in input or neural space, cannot lie for the most part in this convex hull, due to the high dimensionality of the data, invoking the well known curse of dimensionality. Neural networks are then assumed to necessarily work in extrapolative mode. We here study the neural activities of the last hidden layer of typical neural networks. Using an autoencoder to uncover the intrinsic space underlying the neural activities, we show that this space is actually low-dimensional, and that the better the model, the lower the dimensionality of this intrinsic space. In this space, most samples of the test set actually lie in the convex hull of the training set: under the convex hull definition, the models thus happen to work in interpolation regime. Moreover, we show that belonging to the convex hull does not seem to be the relevant criteria. Different measures of proximity to the training set are actually better related to performance accuracy. Thus, typical neural networks do seem to operate in interpolation regime. Good generalization performances are linked to the ability of a neural network to operate well in such a regime.

翻译：长期的工程历史表明,神经网络的外推时间比培训范围要难得多。Balestriero等人(2021年)最近的一项研究(2021年)对这一观点提出了挑战:将内推定义为属于培训组的螺旋壳状态,它们表明,无论是投入还是神经空间,测试组不能大部分地存在于这种螺旋壳中,因为数据具有高度的维度,并援引了众所周知的维度诅咒。神经网络随后被假定为必然以外推方式运作。我们在这里研究典型神经网络最后一层隐藏的神经活动。我们利用自动编码来揭示作为神经活动根基的内在空间,我们表明,这种空间实际上是低维度的,而且这种模型越好,这种内在空间的维度越低。在这个空间,测试组的大部分样本实际上都位于培训组的螺旋壳体壳中:根据Convex船体定义,模型因此会发生在内部系统的工作。此外,我们表明,在典型的神经网络中,“良好”的内置性能标准似乎与这种相互连接起来。我们表明,“正值”的内演测标准似乎与“更相”的内。

相关内容

Neural Networks

关注 1648

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/