神经网络的接近、深度分离和学习能力之间的连接 (The Connection Between Approximation, Depth Separation and Learnability in Neural Networks)

Several recent works have shown separation results between deep neural networks, and hypothesis classes with inferior approximation capacity such as shallow networks or kernel classes. On the other hand, the fact that deep networks can efficiently express a target function does not mean this target function can be learned efficiently by deep neural networks. In this work we study the intricate connection between learnability and approximation capacity. We show that learnability with deep networks of a target function depends on the ability of simpler classes to approximate the target. Specifically, we show that a necessary condition for a function to be learnable by gradient descent on deep neural networks is to be able to approximate the function, at least in a weak sense, with shallow neural networks. We also show that a class of functions can be learned by an efficient statistical query algorithm if and only if it can be approximated in a weak sense by some kernel class. We give several examples of functions which demonstrate depth separation, and conclude that they cannot be efficiently learned, even by a hypothesis class that can efficiently approximate them.

翻译：最近的几项研究显示,深神经网络和低近似能力假设等级(如浅网络或内核等级)之间的分离结果。另一方面,深网络能够有效地表达目标功能并不意味着深神经网络能够有效地学习这个目标功能。在这项工作中,我们研究了可学习性和近似能力之间的复杂联系。我们表明,与深目标功能的深网络的可学习性取决于更简单的类别是否有能力接近目标。具体地说,我们表明,深神经网络的梯度下降能够学习功能的一个必要条件就是能够以浅神经网络来接近该功能,至少从较弱的意义上来说。我们还表明,只有某些内核等级能够以较弱的意义上的近似值来学习,才能通过高效的统计查询算法来学习某类功能。我们举了几个表明深度分离的功能的例子,并得出结论,即使能够有效地接近这些功能的假设等级,也无法有效地学习这些功能。

相关内容

Neural Networks

关注 1650

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

多标签学习的新趋势（2020 Survey）

专知会员服务

44+阅读 · 2020年12月6日

借助几何先验知识促进深度神经网络：综述 | Boosting Deep Neural Networks with Geometrical Prior Knowledge: A Survey

专知会员服务

29+阅读 · 2020年7月10日

【CVPR2020-浙江大学-阿里巴巴】深层知识迁移的深层归因图，DEPARA: Deep Attribution Graph for Deep Knowledge Transferability

专知会员服务

29+阅读 · 2020年4月17日