Self-Supervised Learning (SSL) has been shown to learn useful and information-preserving representations. Neural Networks (NNs) are widely applied, yet their weight space is still not fully understood. Therefore, we propose to use SSL to learn neural representations of the weights of populations of NNs. To that end, we introduce domain specific data augmentations and an adapted attention architecture. Our empirical evaluation demonstrates that self-supervised representation learning in this domain is able to recover diverse NN model characteristics. Further, we show that the proposed learned representations outperform prior work for predicting hyper-parameters, test accuracy, and generalization gap as well as transfer to out-of-distribution settings.
翻译:自我监督学习(SSL)被证明学习了有用和信息保存的表达方式。神经网络(NNs)被广泛应用,但其重量空间仍然不完全理解。因此,我们提议使用SSL学习NS人口重量的神经表达方式。为此,我们引入了特定领域的数据增强和调整关注结构。我们的经验评估表明,在这一领域自我监督的表述学习能够恢复不同的NN模式特征。此外,我们表明,拟议的学习表达方式超过了先前预测超参数、测试精确度和一般化差距以及转移到分配外环境的工作。