未经培训的神经结构搜索 (Neural Architecture Search without Training)

The time and effort involved in hand-designing deep neural networks is immense. This has prompted the development of Neural Architecture Search (NAS) techniques to automate this design. However, NAS algorithms tend to be slow and expensive; they need to train vast numbers of candidate networks to inform the search process. This could be alleviated if we could partially predict a network's trained accuracy from its initial state. In this work, we examine the overlap of activations between datapoints in untrained networks and motivate how this can give a measure which is usefully indicative of a network's trained performance. We incorporate this measure into a simple algorithm that allows us to search for powerful networks without any training in a matter of seconds on a single GPU, and verify its effectiveness on NAS-Bench-101, NAS-Bench-201, and Network Design Spaces. Finally, our approach can be readily combined with more expensive search methods; we examine a simple adaptation of regularised evolutionary search that outperforms its predecessor. Code for reproducing our experiments is available at https://github.com/BayesWatch/nas-without-training.

翻译：手设计深神经网络所需的时间和努力是巨大的。这促使了神经结构搜索技术的开发,使这种设计自动化。但是,NAS算法往往缓慢而昂贵;它们需要培训大量候选网络来为搜索过程提供信息。如果我们能从最初状态部分预测一个网络经过训练的准确性, 这可能减轻这种变化。在这项工作中, 我们检查未经训练的网络中数据点的激活重叠, 并激励这如何提供一种能有益地显示网络经过训练的绩效的措施。我们将这一措施纳入一个简单的算法中, 使我们能够在一个GPU上不经过任何几秒钟的培训就搜索强大的网络, 并核查其在NAS- Bench-101、NAS- Bench-201和网络设计空间上的有效性。最后, 我们的方法可以很容易地与更昂贵的搜索方法结合起来; 我们检查一个简单的、超越其前身的正规化进化搜索方法的适应性能。我们复制实验的代码可以在 https://github.com/Bayes/nas- nonrestraintraining.

相关内容

Networking

关注 22

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

【如何做研究】How to research ，22页ppt

专知会员服务

113+阅读 · 2021年4月17日

【KDD2020】基于矩阵和张量因子分解的高效自动机器学习搜索，Efficient AutoML Pipeline Search with Matrix and Tensor Factorization

专知会员服务

13+阅读 · 2020年6月10日

深度学习搜索，Exploring Deep Learning for Search

专知会员服务

61+阅读 · 2020年5月9日

【阿里巴巴达摩院】TResNet: 高性能的GPU专用架构，GPU-Dedicated Architecture

专知会员服务

33+阅读 · 2020年4月1日