In the last years, neural networks (NN) have evolved from laboratory environments to the state-of-the-art for many real-world problems. It was shown that NN models (i.e., their weights and biases) evolve on unique trajectories in weight space during training. Following, a population of such neural network models (referred to as model zoo) would form structures in weight space. We think that the geometry, curvature and smoothness of these structures contain information about the state of training and can reveal latent properties of individual models. With such model zoos, one could investigate novel approaches for (i) model analysis, (ii) discover unknown learning dynamics, (iii) learn rich representations of such populations, or (iv) exploit the model zoos for generative modelling of NN weights and biases. Unfortunately, the lack of standardized model zoos and available benchmarks significantly increases the friction for further research about populations of NNs. With this work, we publish a novel dataset of model zoos containing systematically generated and diverse populations of NN models for further research. In total the proposed model zoo dataset is based on eight image datasets, consists of 27 model zoos trained with varying hyperparameter combinations and includes 50'360 unique NN models as well as their sparsified twins, resulting in over 3'844'360 collected model states. Additionally, to the model zoo data we provide an in-depth analysis of the zoos and provide benchmarks for multiple downstream tasks. The dataset can be found at www.modelzoos.cc.
翻译:过去几年来,神经网络从实验室环境演变为处理许多现实世界问题的最先进的神经网络(NN),显示NN模型(即其重量和偏差)在培训期间在重量空间的独特轨迹上演进;随后,这类神经网络模型(称为模拟动物园)的总数将在重量空间形成结构;我们认为,这些结构的几何、曲度和光滑性包含培训状况的信息,并能够揭示单个模型的潜在特性。有了这些模型动物园,人们可以调查(一) 模型分析的新办法,(二) 发现未知的学习动态,(三) 了解这类人群的丰富表现,或(四) 利用模型动物网络模型模拟动物园进行NNE重量和偏差的基因化建模建模。不幸的是,缺乏标准化模型动物园和现有基准,大大增加了进一步研究NNN人口人口的摩擦擦擦。我们出版了一套包含系统生成和多样化的NN模型的新型模型数据集。在总共50个模型中,由经过培训的模型和8个研究所的模型组成。