物理概念的可学习性:神经网络能够理解什么是真实的吗? (On the Learnability of Physical Concepts: Can a Neural Network Understand What's Real?)

We revisit the classic signal-to-symbol barrier in light of the remarkable ability of deep neural networks to generate realistic synthetic data. DeepFakes and spoofing highlight the feebleness of the link between physical reality and its abstract representation, whether learned by a digital computer or a biological agent. Starting from a widely applicable definition of abstract concept, we show that standard feed-forward architectures cannot capture but trivial concepts, regardless of the number of weights and the amount of training data, despite being extremely effective classifiers. On the other hand, architectures that incorporate recursion can represent a significantly larger class of concepts, but may still be unable to learn them from a finite dataset. We qualitatively describe the class of concepts that can be "understood" by modern architectures trained with variants of stochastic gradient descent, using a (free energy) Lagrangian to measure information complexity. Even if a concept has been understood, however, a network has no means of communicating its understanding to an external agent, except through continuous interaction and validation. We then characterize physical objects as abstract concepts and use the previous analysis to show that physical objects can be encoded by finite architectures. However, to understand physical concepts, sensors must provide persistently exciting observations, for which the ability to control the data acquisition process is essential (active perception). The importance of control depends on the modality, benefiting visual more than acoustic or chemical perception. Finally, we conclude that binding physical entities to digital identities is possible in finite time with finite resources, solving in principle the signal-to-symbol barrier problem, but we highlight the need for continuous validation.

翻译：我们根据深神经网络生成现实合成数据的非凡能力,重新审视典型的信号到符号屏障。深假和假假冒凸显了物理现实及其抽象表达方式之间联系的微弱性。从广泛适用的抽象概念定义开始,我们显示标准反馈结构不能捕捉,而只是微不足道的概念,尽管是一个极为有效的分类器,尽管其重量和训练数据的数量是极为有效的。另一方面,包含循环的神经网络的架构可以代表一个大得多的概念类别,但可能仍然无法从一个有限的数据集中学习这些概念。我们从质量上描述能够“被数字计算机或生物剂所学习的物理现实及其抽象表达方式所“掩盖”的一类概念。我们从一个广泛适用的抽象概念开始,从一个广泛应用的抽象概念开始,我们用一个(自由能源)拉格朗格结构来衡量信息的复杂性。即使一个概念已经被理解,但一个网络除了通过持续的互动和验证之外,没有向外部代理传递其理解的手段。我们随后将物理物体描述为抽象的概念和使用先前的分析来从一个有限的物理概念中学习这些概念,我们从有限的视觉感知知会如何稳定地定义。我们最终地理解了获取数据的能力。我们最终能够通过一个固定的系统来决定数据控制。