在带有瓶颈的无限宽度神经网络中进行隐含加速和特征学习 (Implicit Acceleration and Feature Learning in Infinitely Wide Neural Networks with Bottlenecks)

We analyze the learning dynamics of infinitely wide neural networks with a finite sized bottle-neck. Unlike the neural tangent kernel limit, a bottleneck in an otherwise infinite width network al-lows data dependent feature learning in its bottle-neck representation. We empirically show that a single bottleneck in infinite networks dramatically accelerates training when compared to purely in-finite networks, with an improved overall performance. We discuss the acceleration phenomena by drawing similarities to infinitely wide deep linear models, where the acceleration effect of a bottleneck can be understood theoretically.

翻译：我们分析了无限宽的神经网络的学习动态,这些网络有一定大小的瓶颈。与神经相近的内核限制不同,在一个本来是无限宽的网络中存在瓶颈,高低的数据依附于在其瓶颈表征中学习特征。我们从经验上表明,与纯无限的网络相比,无限网络中的单一瓶颈极大地加快了培训速度,提高了总体性能。我们通过与无限宽的深度线性模型有相似之处来讨论加速现象,从理论上可以理解瓶颈的加速效应。

相关内容

表征学习

关注 151

在机器学习中，表征学习或表示学习是允许系统从原始数据中自动发现特征检测或分类所需的表示的一组技术。这取代了手动特征工程，并允许机器学习特征并使用它们执行特定任务。在有监督的表征学习中，使用标记的输入数据来学习特征，包括监督神经网络，多层感知器和（监督）字典学习。在无监督表征学习中，特征是与未标记的输入数据一起学习的，包括字典学习，独立成分分析，自动编码器，矩阵分解和各种形式的聚类。

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

专知会员服务

90+阅读 · 2020年7月22日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

59+阅读 · 2020年1月25日