关于集团- 粗粗自动编码器趋同问题 (On the convergence of group-sparse autoencoders)

Recent approaches in the theoretical analysis of model-based deep learning architectures have studied the convergence of gradient descent in shallow ReLU networks that arise from generative models whose hidden layers are sparse. Motivated by the success of architectures that impose structured forms of sparsity, we introduce and study a group-sparse autoencoder that accounts for a variety of generative models, and utilizes a group-sparse ReLU activation function to force the non-zero units at a given layer to occur in blocks. For clustering models, inputs that result in the same group of active units belong to the same cluster. We proceed to analyze the gradient dynamics of a shallow instance of the proposed autoencoder, trained with data adhering to a group-sparse generative model. In this setting, we theoretically prove the convergence of the network parameters to a neighborhood of the generating matrix. We validate our model through numerical analysis and highlight the superior performance of networks with a group-sparse ReLU compared to networks that utilize traditional ReLUs, both in sparse coding and in parameter recovery tasks. We also provide real data experiments to corroborate the simulated results, and emphasize the clustering capabilities of structured sparsity models.

翻译：在对基于模型的深层学习结构进行理论分析的最近方法中,研究了从基因模型中产生的、隐蔽层稀疏的浅ReLU网络中的梯度下降的趋同性关系。我们以将结构化的聚变形式强加于人的建筑的成功为动力,引入并研究一个可解释各种基因模型的群状自动编码器,并利用一个群状ReLU激活功能迫使某一层的非零单位在区块内出现。对于集成模型,产生同一组活跃单位的同一组群属于同一组群的输入物属于同一组群。我们还着手分析一个拟议自定义器浅体的浅体的梯度动态,该样体经过培训,其数据符合一个组状的基因化模型。在这种环境下,我们从理论上证明网络参数与生成矩阵的周围相融合。我们通过数字分析来验证我们的模型,并突出与使用群状质ReLU的网络的优性能,与使用传统的RELU的网络相比,在稀薄的和参数恢复任务中都是相同的。我们还提供真实的数据实验,以证实模拟结果,并强调结构组合模型的能力。

相关内容

自编码器

关注 140

自动编码器是一种人工神经网络，用于以无监督的方式学习有效的数据编码。自动编码器的目的是通过训练网络忽略信号“噪声”来学习一组数据的表示（编码），通常用于降维。与简化方面一起，学习了重构方面，在此，自动编码器尝试从简化编码中生成尽可能接近其原始输入的表示形式，从而得到其名称。基本模型存在几种变体，其目的是迫使学习的输入表示形式具有有用的属性。自动编码器可有效地解决许多应用问题，从面部识别到获取单词的语义。

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

59+阅读 · 2020年1月25日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日