以超超代表制作为生成模型:非见神经网络重量抽样 (Hyper-Representations as Generative Models: Sampling Unseen Neural Network Weights)

Learning representations of neural network weights given a model zoo is an emerging and challenging area with many potential applications from model inspection, to neural architecture search or knowledge distillation. Recently, an autoencoder trained on a model zoo was able to learn a hyper-representation, which captures intrinsic and extrinsic properties of the models in the zoo. In this work, we extend hyper-representations for generative use to sample new model weights. We propose layer-wise loss normalization which we demonstrate is key to generate high-performing models and several sampling methods based on the topology of hyper-representations. The models generated using our methods are diverse, performant and capable to outperform strong baselines as evaluated on several downstream tasks: initialization, ensemble sampling and transfer learning. Our results indicate the potential of knowledge aggregation from model zoos to new models via hyper-representations thereby paving the avenue for novel research directions.

翻译：模型动物园中神经网络重量的学习表现是一个新兴和具有挑战性的领域,有许多潜在应用,从模型检查到神经结构搜索或知识蒸馏。最近,在模型动物园中受过训练的自动编码器能够学习超强表示法,该表示法捕捉动物园中模型的内在和外部特性。在这项工作中,我们将超超表示法推广到基因化用途,用于样本新模型重量。我们提出从层到层的损耗正常化,这是产生高性能模型和基于超度表示法的地形学的若干抽样方法的关键。使用我们方法生成的模型多种多样,具有性能,能够超越一些下游任务(初始化、堆积采样和转移学习)所评估的强势基线。我们的结果表明,通过超表示法将知识从模型动物园到新模型,从而为新的研究方向铺平了道路。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日